Pesquisa | Biblioteca Virtual em Saúde Fiocruz

1.

Social Marketing Perspective on Participant Recruitment in Informatics-Based Intervention Studies.

Idnay, Betina; Cordoba, Evette; Ramirez, Sergio Ozoria; Xiao, Eugenia; Wood, Olivia R; Batey, D Scott; Garofalo, Robert; Schnall, Rebecca.

AIDS Behav ; 2024 May 04.

Artigo em Inglês | MEDLINE | ID: mdl-38703337

RESUMO

Effective recruitment strategies are pivotal for informatics-based intervention trials success, particularly for people living with HIV (PLWH), where engagement can be challenging. Although informatics interventions are recognized for improving health outcomes, the effectiveness of their recruitment strategies remains unclear. We investigated the application of a social marketing framework in navigating the nuances of recruitment for informatics-based intervention trials for PLWH by examining participant experiences and perceptions. We used qualitative descriptive methodology to conduct semi-structured interviews with 90 research participants from four informatics-based intervention trials. Directed inductive and deductive content analyses were guided by Howcutt et al.'s social marketing framework on applying the decision-making process to research recruitment. The majority were male (86.7%), living in the Northeast United States (56%), and identified as Black (32%) or White (32%). Most participants (60%) completed the interview remotely. Sixteen subthemes emerged from five themes: motivation, perception, attitude formation, integration, and learning. Findings from our interview data suggest that concepts from Howcutt et al.'s framework informed participants' decisions to participate in an informatics-based intervention trial. We found that the participants' perceptions of trust in the research process were integral to the participants across the four trials. However, the recruitment approach and communication medium preferences varied between older and younger age groups. Social marketing framework can provide insight into improving the research recruitment process. Future work should delve into the complex interplay between the type of informatics-based interventions, trust in the research process, and communication preferences, and how these factors collectively influence participants' willingness to engage.

2.

Promoting equity in clinical research: The role of social determinants of health.

Idnay, Betina; Fang, Yilu; Stanley, Edward; Ruotolo, Brenda; Chung, Wendy K; Marder, Karen; Weng, Chunhua.

J Biomed Inform ; 156: 104663, 2024 Jun 04.

Artigo em Inglês | MEDLINE | ID: mdl-38838949

RESUMO

OBJECTIVE: This study aims to investigate the association between social determinants of health (SDoH) and clinical research recruitment outcomes and recommends evidence-based strategies to enhance equity. MATERIALS AND METHODS: Data were collected from the internal clinical study manager database, clinical data warehouse, and clinical research registry. Study characteristics (e.g., study phase) and sociodemographic information were extracted. Median neighborhood income, distance from the study location, and Area Deprivation Index (ADI) were calculated. Mixed effect generalized regression was used for clustering effects and false discovery rate adjustment for multiple testing. A stratified analysis was performed to examine the impact in distinct medical departments. RESULTS: The study sample consisted of 3,962 individuals, with a mean age of 61.5 years, 53.6 % male, 54.2 % White, and 49.1 % non-Hispanic or Latino. Study characteristics revealed a variety of protocols across different departments, with cardiology having the highest percentage of participants (46.4 %). Industry funding was the most common (74.5 %), and digital advertising and personal outreach were the main recruitment methods (58.9 % and 90.8 %). DISCUSSION: The analysis demonstrated significant associations between participant characteristics and research participation, including biological sex, age, ethnicity, and language. The stratified analysis revealed other significant associations for recruitment strategies. SDoH is crucial to clinical research recruitment, and this study presents evidence-based solutions for equity and inclusivity. Researchers can tailor recruitment strategies to overcome barriers and increase participant diversity by identifying participant characteristics and research involvement status. CONCLUSION: The findings highlight the relevance of clinical research inequities and equitable representation of historically underrepresented populations. We need to improve recruitment strategies to promote diversity and inclusivity in research.

3.

Criteria2Query 3.0: Leveraging generative large language models for clinical trial eligibility query generation.

Park, Jimyung; Fang, Yilu; Ta, Casey; Zhang, Gongbo; Idnay, Betina; Chen, Fangyi; Feng, David; Shyu, Rebecca; Gordon, Emily R; Spotnitz, Matthew; Weng, Chunhua.

J Biomed Inform ; 154: 104649, 2024 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38697494

RESUMO

OBJECTIVE: Automated identification of eligible patients is a bottleneck of clinical research. We propose Criteria2Query (C2Q) 3.0, a system that leverages GPT-4 for the semi-automatic transformation of clinical trial eligibility criteria text into executable clinical database queries. MATERIALS AND METHODS: C2Q 3.0 integrated three GPT-4 prompts for concept extraction, SQL query generation, and reasoning. Each prompt was designed and evaluated separately. The concept extraction prompt was benchmarked against manual annotations from 20 clinical trials by two evaluators, who later also measured SQL generation accuracy and identified errors in GPT-generated SQL queries from 5 clinical trials. The reasoning prompt was assessed by three evaluators on four metrics: readability, correctness, coherence, and usefulness, using corrected SQL queries and an open-ended feedback questionnaire. RESULTS: Out of 518 concepts from 20 clinical trials, GPT-4 achieved an F1-score of 0.891 in concept extraction. For SQL generation, 29 errors spanning seven categories were detected, with logic errors being the most common (n = 10; 34.48 %). Reasoning evaluations yielded a high coherence rating, with the mean score being 4.70 but relatively lower readability, with a mean of 3.95. Mean scores of correctness and usefulness were identified as 3.97 and 4.37, respectively. CONCLUSION: GPT-4 significantly improves the accuracy of extracting clinical trial eligibility criteria concepts in C2Q 3.0. Continued research is warranted to ensure the reliability of large language models.

Assuntos

Ensaios Clínicos como Assunto , Humanos , Processamento de Linguagem Natural , Software , Seleção de Pacientes

4.

Using Natural Language Processing to Identify Stigmatizing Language in Labor and Birth Clinical Notes.

Barcelona, Veronica; Scharp, Danielle; Moen, Hans; Davoudi, Anahita; Idnay, Betina R; Cato, Kenrick; Topaz, Maxim.

Matern Child Health J ; 28(3): 578-586, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-38147277

RESUMO

INTRODUCTION: Stigma and bias related to race and other minoritized statuses may underlie disparities in pregnancy and birth outcomes. One emerging method to identify bias is the study of stigmatizing language in the electronic health record. The objective of our study was to develop automated natural language processing (NLP) methods to identify two types of stigmatizing language: marginalizing language and its complement, power/privilege language, accurately and automatically in labor and birth notes. METHODS: We analyzed notes for all birthing people > 20 weeks' gestation admitted for labor and birth at two hospitals during 2017. We then employed text preprocessing techniques, specifically using TF-IDF values as inputs, and tested machine learning classification algorithms to identify stigmatizing and power/privilege language in clinical notes. The algorithms assessed included Decision Trees, Random Forest, and Support Vector Machines. Additionally, we applied a feature importance evaluation method (InfoGain) to discern words that are highly correlated with these language categories. RESULTS: For marginalizing language, Decision Trees yielded the best classification with an F-score of 0.73. For power/privilege language, Support Vector Machines performed optimally, achieving an F-score of 0.91. These results demonstrate the effectiveness of the selected machine learning methods in classifying language categories in clinical notes. CONCLUSION: We identified well-performing machine learning methods to automatically detect stigmatizing language in clinical notes. To our knowledge, this is the first study to use NLP performance metrics to evaluate the performance of machine learning methods in discerning stigmatizing language. Future studies should delve deeper into refining and evaluating NLP methods, incorporating the latest algorithms rooted in deep learning.

Assuntos

Algoritmos , Processamento de Linguagem Natural , Feminino , Humanos , Registros Eletrônicos de Saúde , Aprendizado de Máquina , Idioma

5.

A data-driven approach to optimizing clinical study eligibility criteria.

Fang, Yilu; Liu, Hao; Idnay, Betina; Ta, Casey; Marder, Karen; Weng, Chunhua.

J Biomed Inform ; 142: 104375, 2023 06.

Artigo em Inglês | MEDLINE | ID: mdl-37141977

RESUMO

OBJECTIVE: Feasible, safe, and inclusive eligibility criteria are crucial to successful clinical research recruitment. Existing expert-centered methods for eligibility criteria selection may not be representative of real-world populations. This paper presents a novel model called OPTEC (OPTimal Eligibility Criteria) based on the Multiple Attribute Decision Making method boosted by an efficient greedy algorithm. METHODS: It systematically identifies the optimal criteria combination for a given medical condition with the optimal tradeoff among feasibility, patient safety, and cohort diversity. The model offers flexibility in attribute configurations and generalizability to various clinical domains. The model was evaluated on two clinical domains (i.e., Alzheimer's disease and Neoplasm of pancreas) using two datasets (i.e., MIMIC-III dataset and NewYork-Presbyterian/Columbia University Irving Medical Center (NYP/CUIMC) database). RESULTS: We simulated the process of automatically optimizing eligibility criteria according to user-specified prioritization preferences and generated recommendations based on the top-ranked criteria combination accordingly (top 0.41-2.75%) with OPTEC. Harnessing the power of the model, we designed an interactive criteria recommendation system and conducted a case study with an experienced clinical researcher using the think-aloud protocol. CONCLUSIONS: The results demonstrated that OPTEC could be used to recommend feasible eligibility criteria combinations, and to provide actionable recommendations for clinical study designers to construct a feasible, safe, and diverse cohort definition during early study design.

Assuntos

Algoritmos , Projetos de Pesquisa , Humanos , Seleção de Pacientes , Definição da Elegibilidade , Pesquisadores

6.

A qualitative analysis of stigmatizing language in birth admission clinical notes.

Barcelona, Veronica; Scharp, Danielle; Idnay, Betina R; Moen, Hans; Goffman, Dena; Cato, Kenrick; Topaz, Maxim.

Nurs Inq ; 30(3): e12557, 2023 07.

Artigo em Inglês | MEDLINE | ID: mdl-37073504

RESUMO

The presence of stigmatizing language in the electronic health record (EHR) has been used to measure implicit biases that underlie health inequities. The purpose of this study was to identify the presence of stigmatizing language in the clinical notes of pregnant people during the birth admission. We conducted a qualitative analysis on N = 1117 birth admission EHR notes from two urban hospitals in 2017. We identified stigmatizing language categories, such as Disapproval (39.3%), Questioning patient credibility (37.7%), Difficult patient (21.3%), Stereotyping (1.6%), and Unilateral decisions (1.6%) in 61 notes (5.4%). We also defined a new stigmatizing language category indicating Power/privilege. This was present in 37 notes (3.3%) and signaled approval of social status, upholding a hierarchy of bias. The stigmatizing language was most frequently identified in birth admission triage notes (16%) and least frequently in social work initial assessments (13.7%). We found that clinicians from various disciplines recorded stigmatizing language in the medical records of birthing people. This language was used to question birthing people's credibility and convey disapproval of decision-making abilities for themselves or their newborns. We reported a Power/privilege language bias in the inconsistent documentation of traits considered favorable for patient outcomes (e.g., employment status). Future work on stigmatizing language may inform tailored interventions to improve perinatal outcomes for all birthing people and their families.

Assuntos

Idioma , Estereotipagem , Recém-Nascido , Gravidez , Feminino , Humanos , Registros Eletrônicos de Saúde

7.

Identifying stigmatizing language in clinical documentation: A scoping review of emerging literature.

Barcelona, Veronica; Scharp, Danielle; Idnay, Betina R; Moen, Hans; Cato, Kenrick; Topaz, Maxim.

PLoS One ; 19(6): e0303653, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38941299

RESUMO

BACKGROUND: Racism and implicit bias underlie disparities in health care access, treatment, and outcomes. An emerging area of study in examining health disparities is the use of stigmatizing language in the electronic health record (EHR). OBJECTIVES: We sought to summarize the existing literature related to stigmatizing language documented in the EHR. To this end, we conducted a scoping review to identify, describe, and evaluate the current body of literature related to stigmatizing language and clinician notes. METHODS: We searched PubMed, Cumulative Index of Nursing and Allied Health Literature (CINAHL), and Embase databases in May 2022, and also conducted a hand search of IEEE to identify studies investigating stigmatizing language in clinical documentation. We included all studies published through April 2022. The results for each search were uploaded into EndNote X9 software, de-duplicated using the Bramer method, and then exported to Covidence software for title and abstract screening. RESULTS: Studies (N = 9) used cross-sectional (n = 3), qualitative (n = 3), mixed methods (n = 2), and retrospective cohort (n = 1) designs. Stigmatizing language was defined via content analysis of clinical documentation (n = 4), literature review (n = 2), interviews with clinicians (n = 3) and patients (n = 1), expert panel consultation, and task force guidelines (n = 1). Natural language processing was used in four studies to identify and extract stigmatizing words from clinical notes. All of the studies reviewed concluded that negative clinician attitudes and the use of stigmatizing language in documentation could negatively impact patient perception of care or health outcomes. DISCUSSION: The current literature indicates that NLP is an emerging approach to identifying stigmatizing language documented in the EHR. NLP-based solutions can be developed and integrated into routine documentation systems to screen for stigmatizing language and alert clinicians or their supervisors. Potential interventions resulting from this research could generate awareness about how implicit biases affect communication patterns and work to achieve equitable health care for diverse populations.

Assuntos

Documentação , Registros Eletrônicos de Saúde , Humanos , Idioma , Estereotipagem , Racismo

8.

Retrieval augmented scientific claim verification.

Liu, Hao; Soroush, Ali; Nestor, Jordan G; Park, Elizabeth; Idnay, Betina; Fang, Yilu; Pan, Jane; Liao, Stan; Bernard, Marguerite; Peng, Yifan; Weng, Chunhua.

JAMIA Open ; 7(1): ooae021, 2024 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-38455840

RESUMO

Objective: To automate scientific claim verification using PubMed abstracts. Materials and Methods: We developed CliVER, an end-to-end scientific Claim VERification system that leverages retrieval-augmented techniques to automatically retrieve relevant clinical trial abstracts, extract pertinent sentences, and use the PICO framework to support or refute a scientific claim. We also created an ensemble of three state-of-the-art deep learning models to classify rationale of support, refute, and neutral. We then constructed CoVERt, a new COVID VERification dataset comprising 15 PICO-encoded drug claims accompanied by 96 manually selected and labeled clinical trial abstracts that either support or refute each claim. We used CoVERt and SciFact (a public scientific claim verification dataset) to assess CliVER's performance in predicting labels. Finally, we compared CliVER to clinicians in the verification of 19 claims from 6 disease domains, using 189 648 PubMed abstracts extracted from January 2010 to October 2021. Results: In the evaluation of label prediction accuracy on CoVERt, CliVER achieved a notable F1 score of 0.92, highlighting the efficacy of the retrieval-augmented models. The ensemble model outperforms each individual state-of-the-art model by an absolute increase from 3% to 11% in the F1 score. Moreover, when compared with four clinicians, CliVER achieved a precision of 79.0% for abstract retrieval, 67.4% for sentence selection, and 63.2% for label prediction, respectively. Conclusion: CliVER demonstrates its early potential to automate scientific claim verification using retrieval-augmented strategies to harness the wealth of clinical trial abstracts in PubMed. Future studies are warranted to further test its clinical utility.

9.

Sociotechnical feasibility of natural language processing-driven tools in clinical trial eligibility prescreening for Alzheimer's disease and related dementias.

Idnay, Betina; Liu, Jianfang; Fang, Yilu; Hernandez, Alex; Kaw, Shivani; Etwaru, Alicia; Juarez Padilla, Janeth; Ramírez, Sergio Ozoria; Marder, Karen; Weng, Chunhua; Schnall, Rebecca.

J Am Med Inform Assoc ; 31(5): 1062-1073, 2024 Apr 19.

Artigo em Inglês | MEDLINE | ID: mdl-38447587

RESUMO

BACKGROUND: Alzheimer's disease and related dementias (ADRD) affect over 55 million globally. Current clinical trials suffer from low recruitment rates, a challenge potentially addressable via natural language processing (NLP) technologies for researchers to effectively identify eligible clinical trial participants. OBJECTIVE: This study investigates the sociotechnical feasibility of NLP-driven tools for ADRD research prescreening and analyzes the tools' cognitive complexity's effect on usability to identify cognitive support strategies. METHODS: A randomized experiment was conducted with 60 clinical research staff using three prescreening tools (Criteria2Query, Informatics for Integrating Biology and the Bedside [i2b2], and Leaf). Cognitive task analysis was employed to analyze the usability of each tool using the Health Information Technology Usability Evaluation Scale. Data analysis involved calculating descriptive statistics, interrater agreement via intraclass correlation coefficient, cognitive complexity, and Generalized Estimating Equations models. RESULTS: Leaf scored highest for usability followed by Criteria2Query and i2b2. Cognitive complexity was found to be affected by age, computer literacy, and number of criteria, but was not significantly associated with usability. DISCUSSION: Adopting NLP for ADRD prescreening demands careful task delegation, comprehensive training, precise translation of eligibility criteria, and increased research accessibility. The study highlights the relevance of these factors in enhancing NLP-driven tools' usability and efficacy in clinical research prescreening. CONCLUSION: User-modifiable NLP-driven prescreening tools were favorably received, with system type, evaluation sequence, and user's computer literacy influencing usability more than cognitive complexity. The study emphasizes NLP's potential in improving recruitment for clinical trials, endorsing a mixed-methods approach for future system evaluation and enhancements.

Assuntos

Doença de Alzheimer , Informática Médica , Humanos , Processamento de Linguagem Natural , Estudos de Viabilidade , Definição da Elegibilidade

10.

A Survey of Clinicians' Views of the Utility of Large Language Models.

Spotnitz, Matthew; Idnay, Betina; Gordon, Emily R; Shyu, Rebecca; Zhang, Gongbo; Liu, Cong; Cimino, James J; Weng, Chunhua.

Appl Clin Inform ; 15(2): 306-312, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-38442909

RESUMO

OBJECTIVES: Large language models (LLMs) like Generative pre-trained transformer (ChatGPT) are powerful algorithms that have been shown to produce human-like text from input data. Several potential clinical applications of this technology have been proposed and evaluated by biomedical informatics experts. However, few have surveyed health care providers for their opinions about whether the technology is fit for use. METHODS: We distributed a validated mixed-methods survey to gauge practicing clinicians' comfort with LLMs for a breadth of tasks in clinical practice, research, and education, which were selected from the literature. RESULTS: A total of 30 clinicians fully completed the survey. Of the 23 tasks, 16 were rated positively by more than 50% of the respondents. Based on our qualitative analysis, health care providers considered LLMs to have excellent synthesis skills and efficiency. However, our respondents had concerns that LLMs could generate false information and propagate training data bias.Our survey respondents were most comfortable with scenarios that allow LLMs to function in an assistive role, like a physician extender or trainee. CONCLUSION: In a mixed-methods survey of clinicians about LLM use, health care providers were encouraging of having LLMs in health care for many tasks, and especially in assistive roles. There is a need for continued human-centered development of both LLMs and artificial intelligence in general.

Assuntos

Algoritmos , Inteligência Artificial , Humanos , Instalações de Saúde , Pessoal de Saúde , Idioma

11.

The suitability of UMLS and SNOMED-CT for encoding outcome concepts.

Newbury, Abigail; Liu, Hao; Idnay, Betina; Weng, Chunhua.

J Am Med Inform Assoc ; 30(12): 1895-1903, 2023 11 17.

Artigo em Inglês | MEDLINE | ID: mdl-37615994

RESUMO

OBJECTIVE: Outcomes are important clinical study information. Despite progress in automated extraction of PICO (Population, Intervention, Comparison, and Outcome) entities from PubMed, rarely are these entities encoded by standard terminology to achieve semantic interoperability. This study aims to evaluate the suitability of the Unified Medical Language System (UMLS) and SNOMED-CT in encoding outcome concepts in randomized controlled trial (RCT) abstracts. MATERIALS AND METHODS: We iteratively developed and validated an outcome annotation guideline and manually annotated clinically significant outcome entities in the Results and Conclusions sections of 500 randomly selected RCT abstracts on PubMed. The extracted outcomes were fully, partially, or not mapped to the UMLS via MetaMap based on established heuristics. Manual UMLS browser search was performed for select unmapped outcome entities to further differentiate between UMLS and MetaMap errors. RESULTS: Only 44% of 2617 outcome concepts were fully covered in the UMLS, among which 67% were complex concepts that required the combination of 2 or more UMLS concepts to represent them. SNOMED-CT was present as a source in 61% of the fully mapped outcomes. DISCUSSION: Domains such as Metabolism and Nutrition, and Infections and Infectious Diseases need expanded outcome concept coverage in the UMLS and MetaMap. Future work is warranted to similarly assess the terminology coverage for P, I, C entities. CONCLUSION: Computational representation of clinical outcomes is important for clinical evidence extraction and appraisal and yet faces challenges from the inherent complexity and lack of coverage of these concepts in UMLS and SNOMED-CT, as demonstrated in this study.

Assuntos

Systematized Nomenclature of Medicine , Unified Medical Language System , PubMed , Ensaios Clínicos Controlados Aleatórios como Assunto

12.

Clinical research staff perceptions on a natural language processing-driven tool for eligibility prescreening: An iterative usability assessment.

Idnay, Betina; Fang, Yilu; Dreisbach, Caitlin; Marder, Karen; Weng, Chunhua; Schnall, Rebecca.

Int J Med Inform ; 171: 104985, 2023 03.

Artigo em Inglês | MEDLINE | ID: mdl-36638583

RESUMO

BACKGROUND: Participant recruitment is a barrier to successful clinical research. One strategy to improve recruitment is to conduct eligibility prescreening, a resource-intensive process where clinical research staff manually reviews electronic health records data to identify potentially eligible patients. Criteria2Query (C2Q) was developed to address this problem by capitalizing on natural language processing to generate queries to identify eligible participants from clinical databases semi-autonomously. OBJECTIVE: We examined the clinical research staff's perceived usability of C2Q for clinical research eligibility prescreening. METHODS: Twenty clinical research staff evaluated the usability of C2Q using a cognitive walkthrough with a think-aloud protocol and a Post-Study System Usability Questionnaire. On-screen activity and audio were recorded and transcribed. After every-five evaluators completed an evaluation, usability problems were rated by informatics experts and prioritized for system refinement. There were four iterations of system refinement based on the evaluation feedback. Guided by the Organizational Framework for Intuitive Human-computer Interaction, we performed a directed deductive content analysis of the verbatim transcriptions. RESULTS: Evaluators aged from 24 to 46 years old (33.8; SD: 7.32) demonstrated high computer literacy (6.36; SD:0.17); female (75 %), White (35 %), and clinical research coordinators (45 %). C2Q demonstrated high usability during the final cycle (2.26 out of 7 [lower scores are better], SD: 0.74). The number of unique usability issues decreased after each refinement. Fourteen subthemes emerged from three themes: seeking user goals, performing well-learned tasks, and determining what to do next. CONCLUSIONS: The cognitive walkthrough with a think-aloud protocol informed iterative system refinement and demonstrated the usability of C2Q by clinical research staff. Key recommendations for system development and implementation include improving system intuitiveness and overall user experience through comprehensive consideration of user needs and requirements for task completion.

Assuntos

Processamento de Linguagem Natural , Interface Usuário-Computador , Humanos , Feminino , Adulto Jovem , Adulto , Pessoa de Meia-Idade , Computadores , Registros Eletrônicos de Saúde , Registros

13.

Evaluating Large Language Models on Medical Evidence Summarization.

Tang, Liyan; Sun, Zhaoyi; Idnay, Betina; Nestor, Jordan G; Soroush, Ali; Elias, Pierre A; Xu, Ziyang; Ding, Ying; Durrett, Greg; Rousseau, Justin; Weng, Chunhua; Peng, Yifan.

medRxiv ; 2023 Apr 24.

Artigo em Inglês | MEDLINE | ID: mdl-37162998

RESUMO

Recent advances in large language models (LLMs) have demonstrated remarkable successes in zero- and few-shot performance on various downstream tasks, paving the way for applications in high-stakes domains. In this study, we systematically examine the capabilities and limitations of LLMs, specifically GPT-3.5 and ChatGPT, in performing zero-shot medical evidence summarization across six clinical domains. We conduct both automatic and human evaluations, covering several dimensions of summary quality. Our study has demonstrated that automatic metrics often do not strongly correlate with the quality of summaries. Furthermore, informed by our human evaluations, we define a terminology of error types for medical evidence summarization. Our findings reveal that LLMs could be susceptible to generating factually inconsistent summaries and making overly convincing or uncertain statements, leading to potential harm due to misinformation. Moreover, we find that models struggle to identify the salient information and are more error-prone when summarizing over longer textual contexts.

14.

Principal Investigators' Perceptions on Factors Associated with Successful Recruitment in Clinical Trials.

Idnay, Betina; Butler, Alex; Fang, Yilu; Li, Ziran; Lee, Junghwan; Ta, Casey; Liu, Cong; Ruotolo, Brenda; Yuan, Chi; Chen, Huanyao; Hripcsak, George; Larson, Elaine; Weng, Chunhua.

AMIA Jt Summits Transl Sci Proc ; 2023: 281-290, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37350899

RESUMO

Participant recruitment continues to be a challenge to the success of randomized controlled trials, resulting in increased costs, extended trial timelines and delayed treatment availability. Literature provides evidence that study design features (e.g., trial phase, study site involvement) and trial sponsor are significantly associated with recruitment success. Principal investigators oversee the conduct of clinical trials, including recruitment. Through a cross-sectional survey and a thematic analysis of free-text responses, we assessed the perceptions of sixteen principal investigators regarding success factors for participant recruitment. Study site involvement and funding source do not necessarily make recruitment easier or more challenging from the perspective of the principal investigators. The most commonly used recruitment strategies are also the most effort inefficient (e.g., in-person recruitment, reviewing the electronic medical records for prescreening). Finally, we recommended actionable steps, such as improving staff support and leveraging informatics-driven approaches, to allow clinical researchers to enhance participant recruitment.

15.

Uncovering key clinical trial features influencing recruitment.

Idnay, Betina; Fang, Yilu; Butler, Alex; Moran, Joyce; Li, Ziran; Lee, Junghwan; Ta, Casey; Liu, Cong; Yuan, Chi; Chen, Huanyao; Stanley, Edward; Hripcsak, George; Larson, Elaine; Marder, Karen; Chung, Wendy; Ruotolo, Brenda; Weng, Chunhua.

J Clin Transl Sci ; 7(1): e199, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37830010

RESUMO

Background: Randomized clinical trials (RCT) are the foundation for medical advances, but participant recruitment remains a persistent barrier to their success. This retrospective data analysis aims to (1) identify clinical trial features associated with successful participant recruitment measured by accrual percentage and (2) compare the characteristics of the RCTs by assessing the most and least successful recruitment, which are indicated by varying thresholds of accrual percentage such as ≥ 90% vs ≤ 10%, ≥ 80% vs ≤ 20%, and ≥ 70% vs ≤ 30%. Methods: Data from the internal research registry at Columbia University Irving Medical Center and Aggregated Analysis of ClinicalTrials.gov were collected for 393 randomized interventional treatment studies closed to further enrollment. We compared two regularized linear regression and six tree-based machine learning models for accrual percentage (i.e., reported accrual to date divided by the target accrual) prediction. The outperforming model and Tree SHapley Additive exPlanations were used for feature importance analysis for participant recruitment. The identified features were compared between the two subgroups. Results: CatBoost regressor outperformed the others. Key features positively associated with recruitment success, as measured by accrual percentage, include government funding and compensation. Meanwhile, cancer research and non-conventional recruitment methods (e.g., websites) are negatively associated with recruitment success. Statistically significant subgroup differences (corrected p-value < .05) were found in 15 of the top 30 most important features. Conclusion: This multi-source retrospective study highlighted key features influencing RCT participant recruitment, offering actionable steps for improvement, including flexible recruitment infrastructure and appropriate participant compensation.

16.

Evaluating large language models on medical evidence summarization.

Tang, Liyan; Sun, Zhaoyi; Idnay, Betina; Nestor, Jordan G; Soroush, Ali; Elias, Pierre A; Xu, Ziyang; Ding, Ying; Durrett, Greg; Rousseau, Justin F; Weng, Chunhua; Peng, Yifan.

NPJ Digit Med ; 6(1): 158, 2023 Aug 24.

Artigo em Inglês | MEDLINE | ID: mdl-37620423

RESUMO

Recent advances in large language models (LLMs) have demonstrated remarkable successes in zero- and few-shot performance on various downstream tasks, paving the way for applications in high-stakes domains. In this study, we systematically examine the capabilities and limitations of LLMs, specifically GPT-3.5 and ChatGPT, in performing zero-shot medical evidence summarization across six clinical domains. We conduct both automatic and human evaluations, covering several dimensions of summary quality. Our study demonstrates that automatic metrics often do not strongly correlate with the quality of summaries. Furthermore, informed by our human evaluations, we define a terminology of error types for medical evidence summarization. Our findings reveal that LLMs could be susceptible to generating factually inconsistent summaries and making overly convincing or uncertain statements, leading to potential harm due to misinformation. Moreover, we find that models struggle to identify the salient information and are more error-prone when summarizing over longer textual contexts.

17.

Combining human and machine intelligence for clinical trial eligibility querying.

Fang, Yilu; Idnay, Betina; Sun, Yingcheng; Liu, Hao; Chen, Zhehuan; Marder, Karen; Xu, Hua; Schnall, Rebecca; Weng, Chunhua.

J Am Med Inform Assoc ; 29(7): 1161-1171, 2022 06 14.

Artigo em Inglês | MEDLINE | ID: mdl-35426943

RESUMO

OBJECTIVE: To combine machine efficiency and human intelligence for converting complex clinical trial eligibility criteria text into cohort queries. MATERIALS AND METHODS: Criteria2Query (C2Q) 2.0 was developed to enable real-time user intervention for criteria selection and simplification, parsing error correction, and concept mapping. The accuracy, precision, recall, and F1 score of enhanced modules for negation scope detection, temporal and value normalization were evaluated using a previously curated gold standard, the annotated eligibility criteria of 1010 COVID-19 clinical trials. The usability and usefulness were evaluated by 10 research coordinators in a task-oriented usability evaluation using 5 Alzheimer's disease trials. Data were collected by user interaction logging, a demographic questionnaire, the Health Information Technology Usability Evaluation Scale (Health-ITUES), and a feature-specific questionnaire. RESULTS: The accuracies of negation scope detection, temporal and value normalization were 0.924, 0.916, and 0.966, respectively. C2Q 2.0 achieved a moderate usability score (3.84 out of 5) and a high learnability score (4.54 out of 5). On average, 9.9 modifications were made for a clinical study. Experienced researchers made more modifications than novice researchers. The most frequent modification was deletion (5.35 per study). Furthermore, the evaluators favored cohort queries resulting from modifications (score 4.1 out of 5) and the user engagement features (score 4.3 out of 5). DISCUSSION AND CONCLUSION: Features to engage domain experts and to overcome the limitations in automated machine output are shown to be useful and user-friendly. We concluded that human-computer collaboration is key to improving the adoption and user-friendliness of natural language processing.

Assuntos

COVID-19 , Inteligência Artificial , Definição da Elegibilidade/métodos , Humanos , Processamento de Linguagem Natural , Seleção de Pacientes

18.

A systematic review on natural language processing systems for eligibility prescreening in clinical research.

Idnay, Betina; Dreisbach, Caitlin; Weng, Chunhua; Schnall, Rebecca.

J Am Med Inform Assoc ; 29(1): 197-206, 2021 12 28.

Artigo em Inglês | MEDLINE | ID: mdl-34725689

RESUMO

OBJECTIVE: We conducted a systematic review to assess the effect of natural language processing (NLP) systems in improving the accuracy and efficiency of eligibility prescreening during the clinical research recruitment process. MATERIALS AND METHODS: Guided by the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) standards of quality for reporting systematic reviews, a protocol for study eligibility was developed a priori and registered in the PROSPERO database. Using predetermined inclusion criteria, studies published from database inception through February 2021 were identified from 5 databases. The Joanna Briggs Institute Critical Appraisal Checklist for Quasi-experimental Studies was adapted to determine the study quality and the risk of bias of the included articles. RESULTS: Eleven studies representing 8 unique NLP systems met the inclusion criteria. These studies demonstrated moderate study quality and exhibited heterogeneity in the study design, setting, and intervention type. All 11 studies evaluated the NLP system's performance for identifying eligible participants; 7 studies evaluated the system's impact on time efficiency; 4 studies evaluated the system's impact on workload; and 2 studies evaluated the system's impact on recruitment. DISCUSSION: NLP systems in clinical research eligibility prescreening are an understudied but promising field that requires further research to assess its impact on real-world adoption. Future studies should be centered on continuing to develop and evaluate relevant NLP systems to improve enrollment into clinical studies. CONCLUSION: Understanding the role of NLP systems in improving eligibility prescreening is critical to the advancement of clinical research recruitment.

Assuntos

Definição da Elegibilidade , Processamento de Linguagem Natural , Lista de Checagem , Gerenciamento de Dados , Humanos , Projetos de Pesquisa

19.

Cognitive Function Characterization Using Electronic Health Records Notes.

Pichon, Adrienne; Idnay, Betina; Marder, Karen; Schnall, Rebecca; Weng, Chunhua.

AMIA Annu Symp Proc ; 2021: 999-1008, 2021.

Artigo em Inglês | MEDLINE | ID: mdl-35308911

RESUMO

Cognitive impairment is a defining feature of neurological disorders such as Alzheimer's disease (AD), one of the leading causes of disability and mortality in the elderly population. Assessing cognitive impairment is important for diagnostic, clinical management, and research purposes. The Folstein Mini-Mental State Examination (MMSE) is the most common screening measure of cognitive function, yet this score is not consistently available in the electronic health records. We conducted a pilot study to extract frequently used concepts characterizing cognitive function from the clinical notes of AD patients in an Aging and Dementia clinical practice. Then we developed a model to infer the severity of cognitive impairment and created a subspecialized taxonomy for concepts associated with MMSE scores. We evaluated the taxonomy and the severity prediction model and presented example use cases of this model.

Assuntos

Doença de Alzheimer , Disfunção Cognitiva , Idoso , Doença de Alzheimer/diagnóstico , Cognição , Disfunção Cognitiva/diagnóstico , Disfunção Cognitiva/psicologia , Registros Eletrônicos de Saúde , Humanos , Testes Neuropsicológicos , Projetos Piloto

20.

Participatory Design of a Clinical Trial Eligibility Criteria Simplification Method.

Fang, Yilu; Kim, Jae Hyun; Idnay, Betina Ross; Aragon Garcia, Rebeca; Castillo, Carmen E; Sun, Yingcheng; Liu, Hao; Liu, Cong; Yuan, Chi; Weng, Chunhua.

Stud Health Technol Inform ; 281: 984-988, 2021 May 27.

Artigo em Inglês | MEDLINE | ID: mdl-34042820

RESUMO

Clinical trial eligibility criteria are important for selecting the right participants for clinical trials. However, they are often complex and not computable. This paper presents the participatory design of a human-computer collaboration method for criteria simplification that includes natural language processing followed by user-centered eligibility criteria simplification. A case study on the ARCADIA trial shows how criteria were simplified for structured database querying by clinical researchers and identifies rules for criteria simplification and concept normalization.

Assuntos

Processamento de Linguagem Natural , Pesquisadores , Bases de Dados Factuais , Definição da Elegibilidade , Humanos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA