Search | VHL CLAP/WR-PAHO/WHO

1.

Useful blunders: Can automated speech recognition errors improve downstream dementia classification?

Li, Changye; Xu, Weizhe; Cohen, Trevor; Pakhomov, Serguei.

J Biomed Inform ; 150: 104598, 2024 02.

Article in English | MEDLINE | ID: mdl-38253228

ABSTRACT

OBJECTIVES: We aimed to investigate how errors from automatic speech recognition (ASR) systems affect dementia classification accuracy, specifically in the "Cookie Theft" picture description task. We aimed to assess whether imperfect ASR-generated transcripts could provide valuable information for distinguishing between language samples from cognitively healthy individuals and those with Alzheimer's disease (AD). METHODS: We conducted experiments using various ASR models, refining their transcripts with post-editing techniques. Both these imperfect ASR transcripts and manually transcribed ones were used as inputs for the downstream dementia classification. We conducted comprehensive error analysis to compare model performance and assess ASR-generated transcript effectiveness in dementia classification. RESULTS: Imperfect ASR-generated transcripts surprisingly outperformed manual transcription for distinguishing between individuals with AD and those without in the "Cookie Theft" task. These ASR-based models surpassed the previous state-of-the-art approach, indicating that ASR errors may contain valuable cues related to dementia. The synergy between ASR and classification models improved overall accuracy in dementia classification. CONCLUSION: Imperfect ASR transcripts effectively capture linguistic anomalies linked to dementia, improving accuracy in classification tasks. This synergy between ASR and classification models underscores ASR's potential as a valuable tool in assessing cognitive impairment and related clinical applications.

Subject(s)

Alzheimer Disease , Cognitive Dysfunction , Speech Perception , Humans , Speech , Language , Alzheimer Disease/diagnosis

2.

Retrieval augmentation of large language models for lay language generation.

Guo, Yue; Qiu, Wei; Leroy, Gondy; Wang, Sheng; Cohen, Trevor.

J Biomed Inform ; 149: 104580, 2024 01.

Article in English | MEDLINE | ID: mdl-38163514

ABSTRACT

The complex linguistic structures and specialized terminology of expert-authored content limit the accessibility of biomedical literature to the general public. Automated methods have the potential to render this literature more interpretable to readers with different educational backgrounds. Prior work has framed such lay language generation as a summarization or simplification task. However, adapting biomedical text for the lay public includes the additional and distinct task of background explanation: adding external content in the form of definitions, motivation, or examples to enhance comprehensibility. This task is especially challenging because the source document may not include the required background knowledge. Furthermore, background explanation capabilities have yet to be formally evaluated, and little is known about how best to enhance them. To address this problem, we introduce Retrieval-Augmented Lay Language (RALL) generation, which intuitively fits the need for external knowledge beyond that in expert-authored source documents. In addition, we introduce CELLS, the largest (63k pairs) and broadest-ranging (12 journals) parallel corpus for lay language generation. To evaluate RALL, we augmented state-of-the-art text generation models with information retrieval of either term definitions from the UMLS and Wikipedia, or embeddings of explanations from Wikipedia documents. Of these, embedding-based RALL models improved summary quality and simplicity while maintaining factual correctness, suggesting that Wikipedia is a helpful source for background explanation in this context. We also evaluated the ability of both an open-source Large Language Model (Llama 2) and a closed-source Large Language Model (GPT-4) in background explanation, with and without retrieval augmentation. Results indicate that these LLMs can generate simplified content, but that the summary quality is not ideal. Taken together, this work presents the first comprehensive study of background explanation for lay language generation, paving the path for disseminating scientific knowledge to a broader audience. Our code and data are publicly available at: https://github.com/LinguisticAnomalies/pls_retrieval.

Subject(s)

Language , Natural Language Processing , Information Storage and Retrieval , Linguistics , Unified Medical Language System

3.

Opportunities for incorporating intersectionality into biomedical informatics.

Bear Don't Walk, Oliver J; Paullada, Amandalynne; Everhart, Avery; Casanova-Perez, Reggie; Cohen, Trevor; Veinot, Tiffany.

J Biomed Inform ; 154: 104653, 2024 Jun.

Article in English | MEDLINE | ID: mdl-38734158

ABSTRACT

Many approaches in biomedical informatics (BMI) rely on the ability to define, gather, and manipulate biomedical data to support health through a cyclical research-practice lifecycle. Researchers within this field are often fortunate to work closely with healthcare and public health systems to influence data generation and capture and have access to a vast amount of biomedical data. Many informaticists also have the expertise to engage with stakeholders, develop new methods and applications, and influence policy. However, research and policy that explicitly seeks to address the systemic drivers of health would more effectively support health. Intersectionality is a theoretical framework that can facilitate such research. It holds that individual human experiences reflect larger socio-structural level systems of privilege and oppression, and cannot be truly understood if these systems are examined in isolation. Intersectionality explicitly accounts for the interrelated nature of systems of privilege and oppression, providing a lens through which to examine and challenge inequities. In this paper, we propose intersectionality as an intervention into how we conduct BMI research. We begin by discussing intersectionality's history and core principles as they apply to BMI. We then elaborate on the potential for intersectionality to stimulate BMI research. Specifically, we posit that our efforts in BMI to improve health should address intersectionality's five key considerations: (1) systems of privilege and oppression that shape health; (2) the interrelated nature of upstream health drivers; (3) the nuances of health outcomes within groups; (4) the problematic and power-laden nature of categories that we assign to people in research and in society; and (5) research to inform and support social change.

Subject(s)

Medical Informatics , Humans , Medical Informatics/methods , Biomedical Research

4.

Discerning conversational context in online health communities for personalized digital behavior change solutions using Pragmatics to Reveal Intent in Social Media (PRISM) framework.

Singh, Tavleen; Roberts, Kirk; Cohen, Trevor; Cobb, Nathan; Franklin, Amy; Myneni, Sahiti.

J Biomed Inform ; 140: 104324, 2023 04.

Article in English | MEDLINE | ID: mdl-36842490

ABSTRACT

BACKGROUND: Online health communities (OHCs) have emerged as prominent platforms for behavior modification, and the digitization of online peer interactions has afforded researchers with unique opportunities to model multilevel mechanisms that drive behavior change. Existing studies, however, have been limited by a lack of methods that allow the capture of conversational context and socio-behavioral dynamics at scale, as manifested in these digital platforms. OBJECTIVE: We develop, evaluate, and apply a novel methodological framework, Pragmatics to Reveal Intent in Social Media (PRISM), to facilitate granular characterization of peer interactions by combining multidimensional facets of human communication. METHODS: We developed and applied PRISM to analyze peer interactions (N = 2.23 million) in QuitNet, an OHC for tobacco cessation. First, we generated a labeled set of peer interactions (n = 2,005) through manual annotation along three dimensions: communication themes (CTs), behavior change techniques (BCTs), and speech acts (SAs). Second, we used deep learning models to apply our qualitative codes at scale. Third, we applied our validated model to perform a retrospective analysis. Finally, using social network analysis (SNA), we portrayed large-scale patterns and relationships among the aforementioned communication dimensions embedded in peer interactions in QuitNet. RESULTS: Qualitative analysis showed that the themes of social support and behavioral progress were common. The most used BCTs were feedback and monitoring and comparison of behavior, and users most commonly expressed their intentions using SAs-expressive and emotion. With additional in-domain pre-training, bidirectional encoder representations from Transformers (BERT) outperformed other deep learning models on the classification tasks. Content-specific SNA revealed that users' engagement or abstinence status is associated with the prevalence of various categories of BCTs and SAs, which also was evident from the visualization of network structures. CONCLUSIONS: Our study describes the interplay of multilevel characteristics of online communication and their association with individual health behaviors.

Subject(s)

Social Media , Humans , Retrospective Studies , Intention , Social Support , Communication

5.

Predicting decompression surgery by applying multimodal deep learning to patients' structured and unstructured health data.

Jujjavarapu, Chethan; Suri, Pradeep; Pejaver, Vikas; Friedly, Janna; Gold, Laura S; Meier, Eric; Cohen, Trevor; Mooney, Sean D; Heagerty, Patrick J; Jarvik, Jeffrey G.

BMC Med Inform Decis Mak ; 23(1): 2, 2023 01 06.

Article in English | MEDLINE | ID: mdl-36609379

ABSTRACT

BACKGROUND: Low back pain (LBP) is a common condition made up of a variety of anatomic and clinical subtypes. Lumbar disc herniation (LDH) and lumbar spinal stenosis (LSS) are two subtypes highly associated with LBP. Patients with LDH/LSS are often started with non-surgical treatments and if those are not effective then go on to have decompression surgery. However, recommendation of surgery is complicated as the outcome may depend on the patient's health characteristics. We developed a deep learning (DL) model to predict decompression surgery for patients with LDH/LSS. MATERIALS AND METHOD: We used datasets of 8387 and 8620 patients from a prospective study that collected data from four healthcare systems to predict early (within 2 months) and late surgery (within 12 months after a 2 month gap), respectively. We developed a DL model to use patients' demographics, diagnosis and procedure codes, drug names, and diagnostic imaging reports to predict surgery. For each prediction task, we evaluated the model's performance using classical and generalizability evaluation. For classical evaluation, we split the data into training (80%) and testing (20%). For generalizability evaluation, we split the data based on the healthcare system. We used the area under the curve (AUC) to assess performance for each evaluation. We compared results to a benchmark model (i.e. LASSO logistic regression). RESULTS: For classical performance, the DL model outperformed the benchmark model for early surgery with an AUC of 0.725 compared to 0.597. For late surgery, the DL model outperformed the benchmark model with an AUC of 0.655 compared to 0.635. For generalizability performance, the DL model outperformed the benchmark model for early surgery. For late surgery, the benchmark model outperformed the DL model. CONCLUSIONS: For early surgery, the DL model was preferred for classical and generalizability evaluation. However, for late surgery, the benchmark and DL model had comparable performance. Depending on the prediction task, the balance of performance may shift between DL and a conventional ML method. As a result, thorough assessment is needed to quantify the value of DL, a relatively computationally expensive, time-consuming and less interpretable method.

Subject(s)

Deep Learning , Intervertebral Disc Displacement , Low Back Pain , Spinal Stenosis , Humans , Decompression, Surgical/adverse effects , Decompression, Surgical/methods , Prospective Studies , Lumbar Vertebrae/surgery , Low Back Pain/diagnosis , Low Back Pain/surgery , Low Back Pain/complications , Intervertebral Disc Displacement/surgery , Spinal Stenosis/surgery , Treatment Outcome , Retrospective Studies

6.

Intervention to Promote Communication About Goals of Care for Hospitalized Patients With Serious Illness: A Randomized Clinical Trial.

Curtis, J Randall; Lee, Robert Y; Brumback, Lyndia C; Kross, Erin K; Downey, Lois; Torrence, Janaki; LeDuc, Nicole; Mallon Andrews, Kasey; Im, Jennifer; Heywood, Joanna; Brown, Crystal E; Sibley, James; Lober, William B; Cohen, Trevor; Weiner, Bryan J; Khandelwal, Nita; Abedini, Nauzley C; Engelberg, Ruth A.

JAMA ; 329(23): 2028-2037, 2023 06 20.

Article in English | MEDLINE | ID: mdl-37210665

ABSTRACT

Importance: Discussions about goals of care are important for high-quality palliative care yet are often lacking for hospitalized older patients with serious illness. Objective: To evaluate a communication-priming intervention to promote goals-of-care discussions between clinicians and hospitalized older patients with serious illness. Design, Setting, and Participants: A pragmatic, randomized clinical trial of a clinician-facing communication-priming intervention vs usual care was conducted at 3 US hospitals within 1 health care system, including a university, county, and community hospital. Eligible hospitalized patients were aged 55 years or older with any of the chronic illnesses used by the Dartmouth Atlas project to study end-of-life care or were aged 80 years or older. Patients with documented goals-of-care discussions or a palliative care consultation between hospital admission and eligibility screening were excluded. Randomization occurred between April 2020 and March 2021 and was stratified by study site and history of dementia. Intervention: Physicians and advance practice clinicians who were treating the patients randomized to the intervention received a 1-page, patient-specific intervention (Jumpstart Guide) to prompt and guide goals-of-care discussions. Main Outcomes and Measures: The primary outcome was the proportion of patients with electronic health record-documented goals-of-care discussions within 30 days. There was also an evaluation of whether the effect of the intervention varied by age, sex, history of dementia, minoritized race or ethnicity, or study site. Results: Of 3918 patients screened, 2512 were enrolled (mean age, 71.7 [SD, 10.8] years and 42% were women) and randomized (1255 to the intervention group and 1257 to the usual care group). The patients were American Indian or Alaska Native (1.8%), Asian (12%), Black (13%), Hispanic (6%), Native Hawaiian or Pacific Islander (0.5%), non-Hispanic (93%), and White (70%). The proportion of patients with electronic health record-documented goals-of-care discussions within 30 days was 34.5% (433 of 1255 patients) in the intervention group vs 30.4% (382 of 1257 patients) in the usual care group (hospital- and dementia-adjusted difference, 4.1% [95% CI, 0.4% to 7.8%]). The analyses of the treatment effect modifiers suggested that the intervention had a larger effect size among patients with minoritized race or ethnicity. Among 803 patients with minoritized race or ethnicity, the hospital- and dementia-adjusted proportion with goals-of-care discussions was 10.2% (95% CI, 4.0% to 16.5%) higher in the intervention group than in the usual care group. Among 1641 non-Hispanic White patients, the adjusted proportion with goals-of-care discussions was 1.6% (95% CI, -3.0% to 6.2%) higher in the intervention group than in the usual care group. There was no evidence of differential treatment effects of the intervention on the primary outcome by age, sex, history of dementia, or study site. Conclusions and Relevance: Among hospitalized older adults with serious illness, a pragmatic clinician-facing communication-priming intervention significantly improved documentation of goals-of-care discussions in the electronic health record, with a greater effect size in racially or ethnically minoritized patients. Trial Registration: ClinicalTrials.gov Identifier: NCT04281784.

Subject(s)

Dementia , Terminal Care , Humans , Female , Aged , Male , Communication , Hospitalization , Dementia/therapy , Patient Care Planning

7.

Fully automated detection of formal thought disorder with Time-series Augmented Representations for Detection of Incoherent Speech (TARDIS).

Xu, Weizhe; Wang, Weichen; Portanova, Jake; Chander, Ayesha; Campbell, Andrew; Pakhomov, Serguei; Ben-Zeev, Dror; Cohen, Trevor.

J Biomed Inform ; 126: 103998, 2022 02.

Article in English | MEDLINE | ID: mdl-35063668

ABSTRACT

Formal thought disorder (ThD) is a clinical sign of schizophrenia amongst other serious mental health conditions. ThD can be recognized by observing incoherent speech - speech in which it is difficult to perceive connections between successive utterances and lacks a clear global theme. Automated assessment of the coherence of speech in patients with schizophrenia has been an active area of research for over a decade, in an effort to develop an objective and reliable instrument through which to quantify ThD. However, this work has largely been conducted in controlled settings using structured interviews and depended upon manual transcription services to render audio recordings amenable to computational analysis. In this paper, we present an evaluation of such automated methods in the context of a fully automated system using Automated Speech Recognition (ASR) in place of a manual transcription service, with "audio diaries" collected in naturalistic settings from participants experiencing Auditory Verbal Hallucinations (AVH). We show that performance lost due to ASR errors can often be restored through the application of Time-Series Augmented Representations for Detection of Incoherent Speech (TARDIS), a novel approach that involves treating the sequence of coherence scores from a transcript as a time-series, providing features for machine learning. With ASR, TARDIS improves average AUC across coherence metrics for detection of severe ThD by 0.09; average correlation with human-labeled derailment scores by 0.10; and average correlation between coherence estimates from manual and ASR-derived transcripts by 0.29. In addition, TARDIS improves the agreement between coherence estimates from manual transcripts and human judgment and correlation with self-reported estimates of AVH symptom severity. As such, TARDIS eliminates a fundamental barrier to the deployment of automated methods to detect linguistic indicators of ThD to monitor and improve clinical care in serious mental illness.

Subject(s)

Schizophrenia , Speech , Hallucinations , Humans , Linguistics , Machine Learning

8.

Augmenting aer2vec: Enriching distributed representations of adverse event report data with orthographic and lexical information.

Ding, Xiruo; Mower, Justin; Subramanian, Devika; Cohen, Trevor.

J Biomed Inform ; 119: 103833, 2021 07.

Article in English | MEDLINE | ID: mdl-34111555

ABSTRACT

Adverse Drug Events (ADEs) are prevalent, costly, and sometimes preventable. Post-marketing drug surveillance aims to monitor ADEs that occur after a drug is released to market. Reports of such ADEs are aggregated by reporting systems, such as the Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS). In this paper, we consider the topic of how best to represent data derived from reports in FAERS for the purpose of detecting post-marketing surveillance signals, in order to inform regulatory decision making. In our previous work, we developed aer2vec, a method for deriving distributed representations (concept embeddings) of drugs and side effects from ADE reports, establishing the utility of distributional information for pharmacovigilance signal detection. In this paper, we advance this line of research further by evaluating the utility of encoding orthographic and lexical information. We do so by adapting two Natural Language Processing methods, subword embedding and vector retrofitting, which were developed to encode such information into word embeddings. Models were compared for their ability to distinguish between positive and negative examples in a set of manually curated drug/ADE relationships, with both aer2vec enhancements offering advantages in performances over baseline models, and best performance obtained when retrofitting and subword embeddings were applied in concert. In addition, this work demonstrates that models leveraging distributed representations do not require extensive manual preprocessing to perform well on this pharmacovigilance signal detection task, and may even benefit from information that would otherwise be lost during the normalization and standardization process.

Subject(s)

Drug-Related Side Effects and Adverse Reactions , Pharmacovigilance , Adverse Drug Reaction Reporting Systems , Humans , Natural Language Processing , United States , United States Food and Drug Administration

9.

Using computable knowledge mined from the literature to elucidate confounders for EHR-based pharmacovigilance.

Malec, Scott A; Wei, Peng; Bernstam, Elmer V; Boyce, Richard D; Cohen, Trevor.

J Biomed Inform ; 117: 103719, 2021 05.

Article in English | MEDLINE | ID: mdl-33716168

ABSTRACT

INTRODUCTION: Drug safety research asks causal questions but relies on observational data. Confounding bias threatens the reliability of studies using such data. The successful control of confounding requires knowledge of variables called confounders affecting both the exposure and outcome of interest. However, causal knowledge of dynamic biological systems is complex and challenging. Fortunately, computable knowledge mined from the literature may hold clues about confounders. In this paper, we tested the hypothesis that incorporating literature-derived confounders can improve causal inference from observational data. METHODS: We introduce two methods (semantic vector-based and string-based confounder search) that query literature-derived information for confounder candidates to control, using SemMedDB, a database of computable knowledge mined from the biomedical literature. These methods search SemMedDB for confounders by applying semantic constraint search for indications treated by the drug (exposure) and that are also known to cause the adverse event (outcome). We then include the literature-derived confounder candidates in statistical and causal models derived from free-text clinical notes. For evaluation, we use a reference dataset widely used in drug safety containing labeled pairwise relationships between drugs and adverse events and attempt to rediscover these relationships from a corpus of 2.2 M NLP-processed free-text clinical notes. We employ standard adjustment and causal inference procedures to predict and estimate causal effects by informing the models with varying numbers of literature-derived confounders and instantiating the exposure, outcome, and confounder variables in the models with dichotomous EHR-derived data. Finally, we compare the results from applying these procedures with naive measures of association (χ2 and reporting odds ratio) and with each other. RESULTS AND CONCLUSIONS: We found semantic vector-based search to be superior to string-based search at reducing confounding bias. However, the effect of including more rather than fewer literature-derived confounders was inconclusive. We recommend using targeted learning estimation methods that can address treatment-confounder feedback, where confounders also behave as intermediate variables, and engaging subject-matter experts to adjudicate the handling of problematic covariates.

Subject(s)

Models, Theoretical , Pharmacovigilance , Bias , Causality , Reproducibility of Results

10.

Pragmatics to Reveal Intent in Social Media Peer Interactions: Mixed Methods Study.

Singh, Tavleen; Olivares, Sofia; Cohen, Trevor; Cobb, Nathan; Wang, Jing; Franklin, Amy; Myneni, Sahiti.

J Med Internet Res ; 23(11): e32167, 2021 11 17.

Article in English | MEDLINE | ID: mdl-34787578

ABSTRACT

BACKGROUND: Online health communities (OHCs) have emerged as the leading venues for behavior change and health-related information seeking. The soul and success of these digital platforms lie in their ability to foster social togetherness and a sense of community by providing personalized support. However, we have a minimal understanding of how conversational posts in these settings lead to collaborative societies and ultimately result in positive health changes through social influence. OBJECTIVE: Our objective is to develop a content-specific and intent-sensitive methodological framework for analyzing peer interactions in OHCs. METHODS: We developed and applied a mixed-methods approach to understand the manifestation of expressions in peer interactions in OHCs. We applied our approach to describe online social dialogue in the context of two online communities, QuitNet (QN) and the American Diabetes Association (ADA) support community. A total of 3011 randomly selected peer interactions (n=2005 from QN, n=1006 from ADA) were analyzed. Specifically, we conducted thematic analysis to characterize communication content and linguistic expressions (speech acts) embedded within the two data sets. We also developed an empirical user persona based on their engagement levels and behavior profiles. Further, we examined the association between speech acts and communication themes across observed tiers of user engagement and self-reported behavior profiles using the chi-square test or the Fisher test. RESULTS: Although social support, the most prevalent communication theme in both communities, was expressed in several subtle manners, the prevalence of emotions was higher in the tobacco cessation community and assertions were higher in the diabetes self-management (DSM) community. Specific communication theme-speech act relationships were revealed, such as the social support theme was significantly associated (P<.05) with 9 speech acts from a total of 10 speech acts (ie, assertion, commissive, declarative, desire, directive, expressive, question, stance, and statement) within the QN community. Only four speech acts (ie, commissive, emotion, expressive, and stance) were significantly associated (P<.05) with the social support theme in the ADA community. The speech acts were also significantly associated with the users' abstinence status within the QN community and with the users' lifestyle status within the ADA community (P<.05). CONCLUSIONS: Such an overlay of communication intent implicit in online peer interactions alongside content-specific theory-linked characterizations of social media discourse can inform the development of effective digital health technologies in the field of health promotion and behavior change. Our analysis revealed a rich gradient of expressions across a standardized thematic vocabulary, with a distinct variation in emotional and informational needs, depending on the behavioral and disease management profiles within and across the communities. This signifies the need and opportunities for coupling pragmatic messaging in digital therapeutics and care management pathways for personalized support.

Subject(s)

Social Media , Health Behavior , Humans , Intention , Peer Group , Social Support

11.

Behavioral Activation and Depression Symptomatology: Longitudinal Assessment of Linguistic Indicators in Text-Based Therapy Sessions.

Burkhardt, Hannah A; Alexopoulos, George S; Pullmann, Michael D; Hull, Thomas D; Areán, Patricia A; Cohen, Trevor.

J Med Internet Res ; 23(7): e28244, 2021 07 14.

Article in English | MEDLINE | ID: mdl-34259637

ABSTRACT

BACKGROUND: Behavioral activation (BA) is rooted in the behavioral theory of depression, which states that increased exposure to meaningful, rewarding activities is a critical factor in the treatment of depression. Assessing constructs relevant to BA currently requires the administration of standardized instruments, such as the Behavioral Activation for Depression Scale (BADS), which places a burden on patients and providers, among other potential limitations. Previous work has shown that depressed and nondepressed individuals may use language differently and that automated tools can detect these differences. The increasing use of online, chat-based mental health counseling presents an unparalleled resource for automated longitudinal linguistic analysis of patients with depression, with the potential to illuminate the role of reward exposure in recovery. OBJECTIVE: This work investigated how linguistic indicators of planning and participation in enjoyable activities identified in online, text-based counseling sessions relate to depression symptomatology over time. METHODS: Using distributional semantics methods applied to a large corpus of text-based online therapy sessions, we devised a set of novel BA-related categories for the Linguistic Inquiry and Word Count (LIWC) software package. We then analyzed the language used by 10,000 patients in online therapy chat logs for indicators of activation and other depression-related markers using LIWC. RESULTS: Despite their conceptual and operational differences, both previously established LIWC markers of depression and our novel linguistic indicators of activation were strongly associated with depression scores (Patient Health Questionnaire [PHQ]-9) and longitudinal patient trajectories. Emotional tone; pronoun rates; words related to sadness, health, and biology; and BA-related LIWC categories appear to be complementary, explaining more of the variance in the PHQ score together than they do independently. CONCLUSIONS: This study enables further work in automated diagnosis and assessment of depression, the refinement of BA psychotherapeutic strategies, and the development of predictive models for decision support.

Subject(s)

Depression , Linguistics , Depression/diagnosis , Depression/therapy , Emotions , Humans , Language , Semantics

12.

Perceived Utility and Characterization of Personal Google Search Histories to Detect Data Patterns Proximal to a Suicide Attempt in Individuals Who Previously Attempted Suicide: Pilot Cohort Study.

Areán, Patricia A; Pratap, Abhishek; Hsin, Honor; Huppert, Tierney K; Hendricks, Karin E; Heagerty, Patrick J; Cohen, Trevor; Bagge, Courtney; Comtois, Katherine Anne.

J Med Internet Res ; 23(5): e27918, 2021 05 06.

Article in English | MEDLINE | ID: mdl-33955838

ABSTRACT

BACKGROUND: Despite decades of research to better understand suicide risk and to develop detection and prevention methods, suicide is still one of the leading causes of death globally. While large-scale studies using real-world evidence from electronic health records can identify who is at risk, they have not been successful at pinpointing when someone is at risk. Personalized social media and online search history data, by contrast, could provide an ongoing real-world datastream revealing internal thoughts and personal states of mind. OBJECTIVE: We conducted this study to determine the feasibility and acceptability of using personalized online information-seeking behavior in the identification of risk for suicide attempts. METHODS: This was a cohort survey study to assess attitudes of participants with a prior suicide attempt about using web search data for suicide prevention purposes, dates of lifetime suicide attempts, and an optional one-time download of their past web searches on Google. The study was conducted at the University of Washington School of Medicine Psychiatry Research Offices. The main outcomes were participants' opinions on internet search data for suicide prediction and intervention and any potential change in online information-seeking behavior proximal to a suicide attempt. Individualized nonparametric association analysis was used to assess the magnitude of difference in web search data features derived from time periods proximal (7, 15, 30, and 60 days) to the suicide attempts versus the typical (baseline) search behavior of participants. RESULTS: A total of 62 participants who had attempted suicide in the past agreed to participate in the study. Internet search activity varied from person to person (median 2-24 searches per day). Changes in online search behavior proximal to suicide attempts were evident up to 60 days before attempt. For a subset of attempts (7/30, 23%) search features showed associations from 2 months to a week before the attempt. The top 3 search constructs associated with attempts were online searching patterns (9/30 attempts, 30%), semantic relatedness of search queries to suicide methods (7/30 attempts, 23%), and anger (7/30 attempts, 23%). Participants (40/59, 68%) indicated that use of this personalized web search data for prevention purposes was acceptable with noninvasive potential interventions such as connection to a real person (eg, friend, family member, or counselor); however, concerns were raised about detection accuracy, privacy, and the potential for overly invasive intervention. CONCLUSIONS: Changes in online search behavior may be a useful and acceptable means of detecting suicide risk. Personalized analysis of online information-seeking behavior showed notable changes in search behavior and search terms that are tied to early warning signs of suicide and are evident 2 months to 7 days before a suicide attempt.

Subject(s)

Search Engine , Suicide, Attempted , Cohort Studies , Humans , Information Seeking Behavior , Internet , Pilot Projects

13.

Mining HPV Vaccine Knowledge Structures of Young Adults From Reddit Using Distributional Semantics and Pathfinder Networks.

Amith, Muhammad; Cohen, Trevor; Cunningham, Rachel; Savas, Lara S; Smith, Nina; Cuccaro, Paula; Gabay, Efrat; Boom, Julie; Schvaneveldt, Roger; Tao, Cui.

Cancer Control ; 27(1): 1073274819891442, 2020.

Article in English | MEDLINE | ID: mdl-31912742

ABSTRACT

The human papillomavirus (HPV) vaccine protects adolescents and young adults from 9 high-risk HPV virus types that cause 90% of cervical and anal cancers and 70% of oropharyngeal cancers. This study extends our previous research analyzing online content concerning the HPV vaccination in social media platforms used by young adults, in which we used Pathfinder network scaling and methods of distributional semantics to characterize differences in knowledge organization reflected in consumer- and expert-generated online content. The current study extends this approach to evaluate HPV vaccine perceptions among young adults who populate Reddit, a major social media platform. We derived Pathfinder networks from estimates of semantic relatedness obtained by learning word embeddings from Reddit posts and compared these to networks derived from human expert estimation of the relationship between key concepts. Results revealed that users of Reddit, predominantly comprising young adults in the vaccine catch up age-group 18 through 26 years of age, perceived the HPV vaccine domain from a virus-framed perspective that could impact their lifestyle choices and that their awareness of the HPV vaccine for cancer prevention is also lacking. Further differences in knowledge structures were elucidated, with implications for future health communication initiatives.

Subject(s)

Health Knowledge, Attitudes, Practice , Papillomavirus Vaccines/genetics , Semantics , Child , Female , Humans , Male

14.

More Than a Myoma: A Review of Myomatous Erythrocytosis Syndrome.

Mui, Justin; Yang, Michael M H; Cohen, Trevor; McDonald, David I; Hunt, Harold.

J Obstet Gynaecol Can ; 42(2): 198-203.e3, 2020 Feb.

Article in English | MEDLINE | ID: mdl-30904341

ABSTRACT

Myomatous erythrocytosis syndrome (MES) is gynaecological condition marked by isolated erythrocytosis and a fibroid uterus. This report presents a case of MES and reviews common clinical presentations, hematological trends, and patient outcomes. This study was a combined case report and review of published cases of MES. Cases were identified using Medline and EMBASE databases. Binomial statistics were used to compare clinical characteristics among patients with MES. Kruskal-Wallis one-way analysis of variance was used to compare hematological values across time points (Canadian Task Force Classification III). A total of 57 cases of MES were reviewed. The mean age at presentation was 48.7 years. Commonly reported signs or symptoms at presentation include abdominopelvic distension or mass (93%), skin discolouration (33%), and menstrual irregularities (25%). There was no difference in parity (Pâ¯=â¯0.42), menopausal status (Pâ¯=â¯0.87), or hydronephrosis on imaging (Pâ¯=â¯0.48) among patients. Preoperative phlebotomy to reduce the risk of thromboembolic complications was performed in half of all cases. On average, a 51% reduction in serum erythropoietin levels was observed following surgical resection (Pâ¯=â¯0.004). In conclusion, patients with MES present with signs and symptoms attributed to either an abdominopelvic mass or erythrocytosis. Preoperative phlebotomy to decrease the severity of erythrocytosis has been used to mitigate the risk of thrombotic complications. Surgical resection of the offending leiomyoma is a valid approach for the treatment of MES.

Subject(s)

Leiomyoma/diagnosis , Polycythemia/diagnosis , Uterine Neoplasms/diagnosis , Adult , Diagnosis, Differential , Female , Humans , Hysterectomy , Leiomyoma/diagnostic imaging , Leiomyoma/pathology , Parity , Polycythemia/blood , Polycythemia/diagnostic imaging , Syndrome , Uterine Neoplasms/diagnostic imaging , Uterine Neoplasms/pathology

15.

No. 279-Female Sexual Health Consensus Clinical Guidelines.

Lamont, John; Bajzak, Krisztina; Bouchard, Céline; Burnett, Margaret; Byers, Sandra; Cohen, Trevor; Fisher, William; Holzapfel, Stephen; Senikas, Vyta.

J Obstet Gynaecol Can ; 40(6): e451-e503, 2018 06.

Article in English | MEDLINE | ID: mdl-29861087

ABSTRACT

OBJECTIVE: To establish national guidelines for the assessment of women's sexual health concerns and the provision of sexual health care for women. EVIDENCE: Published literature was retrieved through searches of PubMed, CINAHL, and the Cochrane Library from May to October 2010, using appropriate controlled vocabulary (e.g., sexuality, "sexual dysfunction," "physiological," dyspareunia) and key words (e.g., sexual dysfunction, sex therapy, anorgasmia). Results were restricted, where possible, to systematic reviews, randomized control trials/controlled clinical trials, and observational studies. There were no language restrictions. Searches were updated on a regular basis and incorporated in the guideline to December 2010. Grey (unpublished) literature was identified through searching the websites of health technology assessment and health technology assessment-related agencies, clinical practice guideline collections, clinical trial registries, and national and international medical specialty societies. Each article was screened for relevance and the full text acquired if determined to be relevant. The evidence obtained was reviewed and evaluated by the members of the Expert Workgroup established by The Society of Obstetricians and Gynaecologists of Canada. VALUES: The quality of evidence was evaluated and recommendations made using the use of criteria described by the Canadian Task Force on Preventive Health Care (Table).

Subject(s)

Consensus , Sexual Health , Women's Health , Canada , Dyspareunia , Female , Gynecology , Humans , Obstetrics , Sexual Behavior , Sexual Dysfunction, Physiological , Sexual Dysfunctions, Psychological , Sexuality

16.

Embedding of semantic predications.

Cohen, Trevor; Widdows, Dominic.

J Biomed Inform ; 68: 150-166, 2017 04.

Article in English | MEDLINE | ID: mdl-28284761

ABSTRACT

This paper concerns the generation of distributed vector representations of biomedical concepts from structured knowledge, in the form of subject-relation-object triplets known as semantic predications. Specifically, we evaluate the extent to which a representational approach we have developed for this purpose previously, known as Predication-based Semantic Indexing (PSI), might benefit from insights gleaned from neural-probabilistic language models, which have enjoyed a surge in popularity in recent years as a means to generate distributed vector representations of terms from free text. To do so, we develop a novel neural-probabilistic approach to encoding predications, called Embedding of Semantic Predications (ESP), by adapting aspects of the Skipgram with Negative Sampling (SGNS) algorithm to this purpose. We compare ESP and PSI across a number of tasks including recovery of encoded information, estimation of semantic similarity and relatedness, and identification of potentially therapeutic and harmful relationships using both analogical retrieval and supervised learning. We find advantages for ESP in some, but not all of these tasks, revealing the contexts in which the additional computational work of neural-probabilistic modeling is justified.

Subject(s)

Algorithms , Natural Language Processing , Semantics , Humans

17.

Measuring content overlap during handoff communication using distributional semantics: An exploratory study.

Abraham, Joanna; Kannampallil, Thomas G; Srinivasan, Vignesh; Galanter, William L; Tagney, Gail; Cohen, Trevor.

J Biomed Inform ; 65: 132-144, 2017 01.

Article in English | MEDLINE | ID: mdl-27913246

ABSTRACT

OBJECTIVE: We develop and evaluate a methodological approach to measure the degree and nature of overlap in handoff communication content within and across clinical professions. This extensible, exploratory approach relies on combining techniques from conversational analysis and distributional semantics. MATERIALS AND METHODS: We audio-recorded handoff communication of residents and nurses on the General Medicine floor of a large academic hospital (n=120 resident and n=120 nurse handoffs). We measured semantic similarity, a proxy for content overlap, between resident-resident and nurse-nurse communication using multiple steps: a qualitative conversational content analysis; an automated semantic similarity analysis using Reflective Random Indexing (RRI); and comparing semantic similarity generated by RRI analysis with human ratings of semantic similarity. RESULTS: There was significant association between the semantic similarity as computed by the RRI method and human rating (ρ=0.88). Based on the semantic similarity scores, content overlap was relatively higher for content related to patient active problems, assessment of active problems, patient-identifying information, past medical history, and medications/treatments. In contrast, content overlap was limited on content related to allergies, family-related information, code status, and anticipatory guidance. CONCLUSIONS: Our approach using RRI analysis provides new opportunities for characterizing the nature and degree of overlap in handoff communication. Although exploratory, this method provides a basis for identifying content that can be used for determining shared understanding across clinical professions. Additionally, this approach can inform the development of flexibly standardized handoff tools that reflect clinical content that are most appropriate for fostering shared understanding during transitions of care.

Subject(s)

Communication , Patient Handoff , Semantics , Humans , Natural Language Processing , Physician-Nurse Relations , Physicians

18.

Using Pathfinder networks to discover alignment between expert and consumer conceptual knowledge from online vaccine content.

Amith, Muhammad; Cunningham, Rachel; Savas, Lara S; Boom, Julie; Schvaneveldt, Roger; Tao, Cui; Cohen, Trevor.

J Biomed Inform ; 74: 33-45, 2017 10.

Article in English | MEDLINE | ID: mdl-28823922

ABSTRACT

This study demonstrates the use of distributed vector representations and Pathfinder Network Scaling (PFNETS) to represent online vaccine content created by health experts and by laypeople. By analyzing a target audience's conceptualization of a topic, domain experts can develop targeted interventions to improve the basic health knowledge of consumers. The underlying assumption is that the content created by different groups reflects the mental organization of their knowledge. Applying automated text analysis to this content may elucidate differences between the knowledge structures of laypeople (heath consumers) and professionals (health experts). This paper utilizes vaccine information generated by laypeople and health experts to investigate the utility of this approach. We used an established technique from cognitive psychology, Pathfinder Network Scaling to infer the structure of the associational networks between concepts learned from online content using methods of distributional semantics. In doing so, we extend the original application of PFNETS to infer knowledge structures from individual participants, to infer the prevailing knowledge structures within communities of content authors. The resulting graphs reveal opportunities for public health and vaccination education experts to improve communication and intervention efforts directed towards health consumers. Our efforts demonstrate the feasibility of using an automated procedure to examine the manifestation of conceptual models within large bodies of free text, revealing evidence of conflicting understanding of vaccine concepts among health consumers as compared with health experts. Additionally, this study provides insight into the differences between consumer and expert abstraction of domain knowledge, revealing vaccine-related knowledge gaps that suggest opportunities to improve provider-patient communication.

Subject(s)

Community Participation , Online Systems , Vaccines , Automation , Humans

19.

No. 279-Female Sexual Health Consensus Clinical Guidelines.

Lamont, John; Bajzak, Krisztina; Bouchard, Céline; Burnett, Margaret; Byers, Sandra; Cohen, Trevor; Fisher, William; Holzapfel, Stephen; Senikas, Vyta.

J Obstet Gynaecol Can ; 39(12): e535-e541, 2017 Dec.

Article in English | MEDLINE | ID: mdl-29197489

ABSTRACT

OBJECTIVE: To establish national guidelines for the assessment of women's sexual health concerns and the provision of sexual health care for women. EVIDENCE: Published literature was retrieved through searches of PubMed, CINAHL, and the Cochrane Library from May to October 2010, using appropriate controlled vocabulary (e .g., sexuality, "sexual dysfunction," "physiological," dyspareunia) and key words (e .g ., sexual dysfunction, sex therapy, anorgasmia). Results were restricted, where possible, to systematic reviews, randomized control trials/controlled clinical trials, and observational studies. There were no language restrictions. Searches were updated on a regular basis and incorporated in the guideline to December 2010. Grey (unpublished) literature was identified through searching the websites of health technology assessment and health technology assessment-related agencies, clinical practice guideline collections, clinical trial registries, and national and international medical specialty societies. Each article was screened for relevance and the full text acquired if determined to be relevant. The evidence obtained was reviewed and evaluated by the members of the Expert Workgroup established by The Society of Obstetricians and Gynaecologists of Canada. VALUES: The quality of evidence was evaluated and recommendations made using the use of criteria described by the Canadian Task Force on Preventive Health Care (Table).

Subject(s)

Sexual Health , Women's Health

20.

An active learning-enabled annotation system for clinical named entity recognition.

Chen, Yukun; Lask, Thomas A; Mei, Qiaozhu; Chen, Qingxia; Moon, Sungrim; Wang, Jingqi; Nguyen, Ky; Dawodu, Tolulola; Cohen, Trevor; Denny, Joshua C; Xu, Hua.

BMC Med Inform Decis Mak ; 17(Suppl 2): 82, 2017 Jul 05.

Article in English | MEDLINE | ID: mdl-28699546

ABSTRACT

BACKGROUND: Active learning (AL) has shown the promising potential to minimize the annotation cost while maximizing the performance in building statistical natural language processing (NLP) models. However, very few studies have investigated AL in a real-life setting in medical domain. METHODS: In this study, we developed the first AL-enabled annotation system for clinical named entity recognition (NER) with a novel AL algorithm. Besides the simulation study to evaluate the novel AL algorithm, we further conducted user studies with two nurses using this system to assess the performance of AL in real world annotation processes for building clinical NER models. RESULTS: The simulation results show that the novel AL algorithm outperformed traditional AL algorithm and random sampling. However, the user study tells a different story that AL methods did not always perform better than random sampling for different users. CONCLUSIONS: We found that the increased information content of actively selected sentences is strongly offset by the increased time required to annotate them. Moreover, the annotation time was not considered in the querying algorithms. Our future work includes developing better AL algorithms with the estimation of annotation time and evaluating the system with larger number of users.

Subject(s)

Medical Informatics , Natural Language Processing , Problem-Based Learning , Computer Simulation , Humans

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL