Search | VHL Regional Portal

1.

Semi-supervised Double Deep Learning Temporal Risk Prediction (SeDDLeR) with Electronic Health Records.

Nogues, Isabelle-Emmanuella; Wen, Jun; Zhao, Yihan; Bonzel, Clara-Lea; Castro, Victor M; Lin, Yucong; Xu, Shike; Hou, Jue; Cai, Tianxi.

J Biomed Inform ; : 104685, 2024 Jul 12.

Article in English | MEDLINE | ID: mdl-39004109

ABSTRACT

BACKGROUND: Risk prediction plays a crucial role in planning for prevention, monitoring, and treatment. Electronic Health Records (EHRs) offer an expansive repository of temporal medical data encompassing both risk factors and outcome indicators essential for effective risk prediction. However, challenges emerge due to the lack of readily available gold-standard outcomes and the complex effects of various risk factors. Compounding these challenges are the false positives in diagnosis codes, and formidable task of pinpointing the onset timing in annotations. OBJECTIVE: We develop a Semi-supervised Double Deep Learning Temporal Risk Prediction (SeDDLeR) algorithm based on extensive unlabeled longitudinal Electronic Health Records (EHR) data augmented by a limited set of gold standard labels on the binary status information indicating whether the clinical event of interest occurred during the follow-up period. METHODS: The SeDDLeR algorithm calculates an individualized risk of developing future clinical events over time using each patient's baseline EHR features via the following steps: (1) construction of an initial EHR-derived surrogate as a proxy for the onset status; (2) deep learning calibration of the surrogate along gold-standard onset status; and (3) semi-supervised deep learning for risk prediction combining calibrated surrogates and gold-standard onset status. To account for missing onset time and heterogeneous follow-up, we introduce temporal kernel weighting. We devise a Gated Recurrent Units (GRUs) module to capture temporal characteristics. We subsequently assess our proposed SeDDLeR method in simulation studies and apply the method to the Massachusetts General Brigham (MGB) Biobank to predict type 2 diabetes (T2D) risk. RESULTS: SeDDLeR outperforms benchmark risk prediction methods, including Semi-parametric Transformation Model (STM) and DeepHit, with consistently best accuracy across experiments. SeDDLeR achieved the best C-statistics ( 0.815, SE 0.023; vs STM +.084, SE 0.030, P-value .004; vs DeepHit +.055, SE 0.027, P-value .024) and best average time-specific AUC (0.778, SE 0.022; vs STM + 0.059, SE 0.039, P-value .067; vs DeepHit + 0.168, SE 0.032, P-value <0.001) in the MGB T2D study. CONCLUSION: SeDDLeR can train robust risk prediction models in both real-world EHR and synthetic datasets with minimal requirements of labeling event times. It holds the potential to be incorporated for future clinical trial recruitment or clinical decision-making.

2.

Family history as the strongest predictor of aortic and peripheral aneurysms in patients with intracranial aneurysms.

Lai, Pui Man Rosalind; Akama-Garren, Elliot; Can, Anil; Tirado, Selena-Rae; Castro, Victor M; Dligach, Dmitriy; Finan, Sean; Gainer, Vivian S; Shadick, Nancy A; Savova, Guergana; Murphy, Shawn N; Cai, Tianxi; Weiss, Scott T; Du, Rose.

J Clin Neurosci ; 126: 128-134, 2024 Jun 12.

Article in English | MEDLINE | ID: mdl-38870642

ABSTRACT

OBJECTIVE: Intracranial aneurysms (IA) and aortic aneurysms (AA) are both abnormal dilations of arteries with familial predisposition and have been proposed to share co-prevalence and pathophysiology. Associations of IA and non-aortic peripheral aneurysms are less well-studied. The goal of the study was to understand the patterns of aortic and peripheral (extracranial) aneurysms in patients with IA, and risk factors associated with the development of these aneurysms. METHODS: 4701 patients were included in our retrospective analysis of all patients with intracranial aneurysms at our institution over the past 26 years. Patient demographics, comorbidities, and aneurysmal locations were analyzed. Univariate and multivariate analyses were performed to study associations with and without extracranial aneurysms. RESULTS: A total of 3.4% of patients (161 of 4701) with IA had at least one extracranial aneurysm. 2.8% had thoracic or abdominal aortic aneurysms. Age, male sex, hypertension, coronary artery disease, history of ischemic cerebral infarction, connective tissues disease, and family history of extracranial aneurysms in a 1st degree relative were associated with the presence of extracranial aneurysms and a higher number of extracranial aneurysms. In addition, family history of extracranial aneurysms in a second degree relative is associated with the presence of extracranial aneurysms and atrial fibrillation is associated with a higher number of extracranial aneurysms. CONCLUSION: Significant comorbidities are associated with extracranial aneurysms in patients with IA. Family history of extracranial aneurysms has the strongest association and suggests that IA patients with a family history of extracranial aneurysms may benefit from screening.

3.

Quantifying proportion of treatment effect by surrogate endpoint under heterogeneity.

Guo, Xinzhou; Bourgeois, Florence T; Cai, Tianxi.

Stat Methods Med Res ; : 9622802241247719, 2024 May 08.

Article in English | MEDLINE | ID: mdl-38717356

ABSTRACT

When the primary endpoints in randomized clinical trials require long term follow-up or are costly to measure, it is often desirable to assess treatment effects on surrogate instead of clinical endpoints. Prior to adopting a surrogate endpoint for such purposes, the extent of its surrogacy on the primary endpoint must be assessed. There is a rich statistical literature on assessing surrogacy in the overall population, much of which is based on quantifying the proportion of treatment effect on the primary endpoint that is explained by the treatment effect on the surrogate endpoint. However, the surrogacy of an endpoint may vary across different patient subgroups according to baseline demographic characteristics, and limited methods are currently available to assess overall surrogacy in the presence of potential surrogacy heterogeneity. In this paper, we propose methods that incorporate covariates for baseline information, such as age, to improve overall surrogacy assessment. We use flexible semi-non-parametric modeling strategies to adjust for covariate effects and derive a robust estimate for the proportion of treatment effect of the covariate-adjusted surrogate endpoint. Simulation results suggest that the adjusted surrogate endpoint has greater proportion of treatment effect compared to the unadjusted surrogate endpoint. We apply the proposed method to data from a clinical trial of infliximab and assess the adequacy of the surrogate endpoint in the presence of age heterogeneity.

4.

Assessing heterogeneity in surrogacy using censored data.

Parast, Layla; Tian, Lu; Cai, Tianxi.

Stat Med ; 43(17): 3184-3209, 2024 Jul 30.

Article in English | MEDLINE | ID: mdl-38812276

ABSTRACT

Determining whether a surrogate marker can be used to replace a primary outcome in a clinical study is complex. While many statistical methods have been developed to formally evaluate a surrogate marker, they generally do not provide a way to examine heterogeneity in the utility of a surrogate marker. Similar to treatment effect heterogeneity, where the effect of a treatment varies based on a patient characteristic, heterogeneity in surrogacy means that the strength or utility of the surrogate marker varies based on a patient characteristic. The few methods that have been recently developed to examine such heterogeneity cannot accommodate censored data. Studies with a censored outcome are typically the studies that could most benefit from a surrogate because the follow-up time is often long. In this paper, we develop a robust nonparametric approach to assess heterogeneity in the utility of a surrogate marker with respect to a baseline variable in a censored time-to-event outcome setting. In addition, we propose and evaluate a testing procedure to formally test for heterogeneity at a single time point or across multiple time points simultaneously. Finite sample performance of our estimation and testing procedure are examined in a simulation study. We use our proposed method to investigate the complex relationship between change in fasting plasma glucose, diabetes, and sex hormones using data from the diabetes prevention program study.

Subject(s)

Biomarkers , Blood Glucose , Computer Simulation , Humans , Biomarkers/blood , Blood Glucose/analysis , Female , Models, Statistical , Male , Gonadal Steroid Hormones/blood , Gonadal Steroid Hormones/therapeutic use , Statistics, Nonparametric , Data Interpretation, Statistical , Diabetes Mellitus

5.

Characterization of Post-COVID-19 Definitions and Clinical Coding Practices: Longitudinal Study.

Maripuri, Monika; Dey, Andrew; Honerlaw, Jacqueline; Hong, Chuan; Ho, Yuk-Lam; Tanukonda, Vidisha; Chen, Alicia W; Panickan, Vidul Ayakulangara; Wang, Xuan; Zhang, Harrison G; Yang, Doris; Samayamuthu, Malarkodi Jebathilagam; Morris, Michele; Visweswaran, Shyam; Beaulieu-Jones, Brendin; Ramoni, Rachel; Muralidhar, Sumitra; Gaziano, J Michael; Liao, Katherine; Xia, Zongqi; Brat, Gabriel A; Cai, Tianxi; Cho, Kelly.

Online J Public Health Inform ; 16: e53445, 2024 May 03.

Article in English | MEDLINE | ID: mdl-38700929

ABSTRACT

BACKGROUND: Post-COVID-19 condition (colloquially known as "long COVID-19") characterized as postacute sequelae of SARS-CoV-2 has no universal clinical case definition. Recent efforts have focused on understanding long COVID-19 symptoms, and electronic health record (EHR) data provide a unique resource for understanding this condition. The introduction of the International Classification of Diseases, Tenth Revision (ICD-10) code U09.9 for "Post COVID-19 condition, unspecified" to identify patients with long COVID-19 has provided a method of evaluating this condition in EHRs; however, the accuracy of this code is unclear. OBJECTIVE: This study aimed to characterize the utility and accuracy of the U09.9 code across 3 health care systems-the Veterans Health Administration, the Beth Israel Deaconess Medical Center, and the University of Pittsburgh Medical Center-against patients identified with long COVID-19 via a chart review by operationalizing the World Health Organization (WHO) and Centers for Disease Control and Prevention (CDC) definitions. METHODS: Patients who were COVID-19 positive with either a U07.1 ICD-10 code or positive polymerase chain reaction test within these health care systems were identified for chart review. Among this cohort, we sampled patients based on two approaches: (1) with a U09.9 code and (2) without a U09.9 code but with a new onset long COVID-19-related ICD-10 code, which allows us to assess the sensitivity of the U09.9 code. To operationalize the long COVID-19 definition based on health agency guidelines, symptoms were grouped into a "core" cluster of 11 commonly reported symptoms among patients with long COVID-19 and an extended cluster that captured all other symptoms by disease domain. Patients having ≥2 symptoms persisting for ≥60 days that were new onset after their COVID-19 infection, with ≥1 symptom in the core cluster, were labeled as having long COVID-19 per chart review. The code's performance was compared across 3 health care systems and across different time periods of the pandemic. RESULTS: Overall, 900 patient charts were reviewed across 3 health care systems. The prevalence of long COVID-19 among the cohort with the U09.9 ICD-10 code based on the operationalized WHO definition was between 23.2% and 62.4% across these health care systems. We also evaluated a less stringent version of the WHO definition and the CDC definition and observed an increase in the prevalence of long COVID-19 at all 3 health care systems. CONCLUSIONS: This is one of the first studies to evaluate the U09.9 code against a clinical case definition for long COVID-19, as well as the first to apply this definition to EHR data using a chart review approach on a nationwide cohort across multiple health care systems. This chart review approach can be implemented at other EHR systems to further evaluate the utility and performance of the U09.9 code.

6.

Heterogeneous associations between interleukin-6 receptor variants and phenotypes across ancestries and implications for therapy.

Wang, Xuan; Liu, Molei; Nogues, Isabelle-Emmanuella; Chen, Tony; Xiong, Xin; Bonzel, Clara-Lea; Zhang, Harrison; Hong, Chuan; Xia, Yin; Dahal, Kumar; Costa, Lauren; Cui, Jing; Gaziano, J Michael; Kim, Seoyoung C; Ho, Yuk-Lam; Cho, Kelly; Cai, Tianxi; Liao, Katherine P.

Sci Rep ; 14(1): 8021, 2024 04 05.

Article in English | MEDLINE | ID: mdl-38580710

ABSTRACT

The Phenome-Wide Association Study (PheWAS) is increasingly used to broadly screen for potential treatment effects, e.g., IL6R variant as a proxy for IL6R antagonists. This approach offers an opportunity to address the limited power in clinical trials to study differential treatment effects across patient subgroups. However, limited methods exist to efficiently test for differences across subgroups in the thousands of multiple comparisons generated as part of a PheWAS. In this study, we developed an approach that maximizes the power to test for heterogeneous genotype-phenotype associations and applied this approach to an IL6R PheWAS among individuals of African (AFR) and European (EUR) ancestries. We identified 29 traits with differences in IL6R variant-phenotype associations, including a lower risk of type 2 diabetes in AFR (OR 0.96) vs EUR (OR 1.0, p-value for heterogeneity = 8.5 × 10-3), and higher white blood cell count (p-value for heterogeneity = 8.5 × 10-131). These data suggest a more salutary effect of IL6R blockade for T2D among individuals of AFR vs EUR ancestry and provide data to inform ongoing clinical trials targeting IL6 for an expanding number of conditions. Moreover, the method to test for heterogeneity of associations can be applied broadly to other large-scale genotype-phenotype screens in diverse populations.

Subject(s)

Diabetes Mellitus, Type 2 , Humans , Diabetes Mellitus, Type 2/drug therapy , Diabetes Mellitus, Type 2/genetics , Genetic Association Studies , Phenotype , Polymorphism, Single Nucleotide , Receptors, Interleukin-6/genetics

7.

Neurological diagnoses in hospitalized COVID-19 patients associated with adverse outcomes: A multinational cohort study.

Hutch, Meghan R; Son, Jiyeon; Le, Trang T; Hong, Chuan; Wang, Xuan; Shakeri Hossein Abad, Zahra; Morris, Michele; Gutiérrez-Sacristán, Alba; Klann, Jeffrey G; Spiridou, Anastasia; Batugo, Ashley; Bellazzi, Riccardo; Benoit, Vincent; Bonzel, Clara-Lea; Bryant, William A; Chiudinelli, Lorenzo; Cho, Kelly; Das, Priyam; González González, Tomás; Hanauer, David A; Henderson, Darren W; Ho, Yuk-Lam; Loh, Ne Hooi Will; Makoudjou, Adeline; Makwana, Simran; Malovini, Alberto; Moal, Bertrand; Mowery, Danielle L; Neuraz, Antoine; Samayamuthu, Malarkodi Jebathilagam; Sanz Vidorreta, Fernando J; Schriver, Emily R; Schubert, Petra; Talbert, Jeffery; Tan, Amelia L M; Tan, Byorn W L; Tan, Bryce W Q; Tibollo, Valentina; Tippman, Patric; Verdy, Guillaume; Yuan, William; Avillach, Paul; Gehlenborg, Nils; Omenn, Gilbert S; Visweswaran, Shyam; Cai, Tianxi; Luo, Yuan; Xia, Zongqi.

PLOS Digit Health ; 3(4): e0000484, 2024 Apr.

Article in English | MEDLINE | ID: mdl-38620037

ABSTRACT

Few studies examining the patient outcomes of concurrent neurological manifestations during acute COVID-19 leveraged multinational cohorts of adults and children or distinguished between central and peripheral nervous system (CNS vs. PNS) involvement. Using a federated multinational network in which local clinicians and informatics experts curated the electronic health records data, we evaluated the risk of prolonged hospitalization and mortality in hospitalized COVID-19 patients from 21 healthcare systems across 7 countries. For adults, we used a federated learning approach whereby we ran Cox proportional hazard models locally at each healthcare system and performed a meta-analysis on the aggregated results to estimate the overall risk of adverse outcomes across our geographically diverse populations. For children, we reported descriptive statistics separately due to their low frequency of neurological involvement and poor outcomes. Among the 106,229 hospitalized COVID-19 patients (104,031 patients ≥18 years; 2,198 patients <18 years, January 2020-October 2021), 15,101 (14%) had at least one CNS diagnosis, while 2,788 (3%) had at least one PNS diagnosis. After controlling for demographics and pre-existing conditions, adults with CNS involvement had longer hospital stay (11 versus 6 days) and greater risk of (Hazard Ratio = 1.78) and faster time to death (12 versus 24 days) than patients with no neurological condition (NNC) during acute COVID-19 hospitalization. Adults with PNS involvement also had longer hospital stay but lower risk of mortality than the NNC group. Although children had a low frequency of neurological involvement during COVID-19 hospitalization, a substantially higher proportion of children with CNS involvement died compared to those with NNC (6% vs 1%). Overall, patients with concurrent CNS manifestation during acute COVID-19 hospitalization faced greater risks for adverse clinical outcomes than patients without any neurological diagnosis. Our global informatics framework using a federated approach (versus a centralized data collection approach) has utility for clinical discovery beyond COVID-19.

8.

Centralized Interactive Phenomics Resource: an integrated online phenomics knowledgebase for health data users.

Honerlaw, Jacqueline; Ho, Yuk-Lam; Fontin, Francesca; Murray, Michael; Galloway, Ashley; Heise, David; Connatser, Keith; Davies, Laura; Gosian, Jeffrey; Maripuri, Monika; Russo, John; Sangar, Rahul; Tanukonda, Vidisha; Zielinski, Edward; Dubreuil, Maureen; Zimolzak, Andrew J; Panickan, Vidul A; Cheng, Su-Chun; Whitbourne, Stacey B; Gagnon, David R; Cai, Tianxi; Liao, Katherine P; Ramoni, Rachel B; Gaziano, J Michael; Muralidhar, Sumitra; Cho, Kelly.

J Am Med Inform Assoc ; 31(5): 1126-1134, 2024 Apr 19.

Article in English | MEDLINE | ID: mdl-38481028

ABSTRACT

OBJECTIVE: Development of clinical phenotypes from electronic health records (EHRs) can be resource intensive. Several phenotype libraries have been created to facilitate reuse of definitions. However, these platforms vary in target audience and utility. We describe the development of the Centralized Interactive Phenomics Resource (CIPHER) knowledgebase, a comprehensive public-facing phenotype library, which aims to facilitate clinical and health services research. MATERIALS AND METHODS: The platform was designed to collect and catalog EHR-based computable phenotype algorithms from any healthcare system, scale metadata management, facilitate phenotype discovery, and allow for integration of tools and user workflows. Phenomics experts were engaged in the development and testing of the site. RESULTS: The knowledgebase stores phenotype metadata using the CIPHER standard, and definitions are accessible through complex searching. Phenotypes are contributed to the knowledgebase via webform, allowing metadata validation. Data visualization tools linking to the knowledgebase enhance user interaction with content and accelerate phenotype development. DISCUSSION: The CIPHER knowledgebase was developed in the largest healthcare system in the United States and piloted with external partners. The design of the CIPHER website supports a variety of front-end tools and features to facilitate phenotype development and reuse. Health data users are encouraged to contribute their algorithms to the knowledgebase for wider dissemination to the research community, and to use the platform as a springboard for phenotyping. CONCLUSION: CIPHER is a public resource for all health data users available at https://phenomics.va.ornl.gov/ which facilitates phenotype reuse, development, and dissemination of phenotyping knowledge.

Subject(s)

Electronic Health Records , Phenomics , Phenotype , Knowledge Bases , Algorithms

9.

Semisupervised transfer learning for evaluation of model classification performance.

Wang, Linshanshan; Wang, Xuan; Liao, Katherine P; Cai, Tianxi.

Biometrics ; 80(1)2024 Jan 29.

Article in English | MEDLINE | ID: mdl-38465982

ABSTRACT

In many modern machine learning applications, changes in covariate distributions and difficulty in acquiring outcome information have posed challenges to robust model training and evaluation. Numerous transfer learning methods have been developed to robustly adapt the model itself to some unlabeled target populations using existing labeled data in a source population. However, there is a paucity of literature on transferring performance metrics, especially receiver operating characteristic (ROC) parameters, of a trained model. In this paper, we aim to evaluate the performance of a trained binary classifier on unlabeled target population based on ROC analysis. We proposed Semisupervised Transfer lEarning of Accuracy Measures (STEAM), an efficient three-step estimation procedure that employs (1) double-index modeling to construct calibrated density ratio weights and (2) robust imputation to leverage the large amount of unlabeled data to improve estimation efficiency. We establish the consistency and asymptotic normality of the proposed estimator under the correct specification of either the density ratio model or the outcome model. We also correct for potential overfitting bias in the estimators in finite samples with cross-validation. We compare our proposed estimators to existing methods and show reductions in bias and gains in efficiency through simulations. We illustrate the practical utility of the proposed method on evaluating prediction performance of a phenotyping model for rheumatoid arthritis (RA) on a temporally evolving EHR cohort.

Subject(s)

Machine Learning , Supervised Machine Learning , Humans , ROC Curve , Research Design , Bias

10.

A rank-based approach to evaluate a surrogate marker in a small sample setting.

Parast, Layla; Cai, Tianxi; Tian, Lu.

Biometrics ; 80(1)2024 Jan 29.

Article in English | MEDLINE | ID: mdl-38386359

ABSTRACT

In clinical studies of chronic diseases, the effectiveness of an intervention is often assessed using "high cost" outcomes that require long-term patient follow-up and/or are invasive to obtain. While much progress has been made in the development of statistical methods to identify surrogate markers, that is, measurements that could replace such costly outcomes, they are generally not applicable to studies with a small sample size. These methods either rely on nonparametric smoothing which requires a relatively large sample size or rely on strict model assumptions that are unlikely to hold in practice and empirically difficult to verify with a small sample size. In this paper, we develop a novel rank-based nonparametric approach to evaluate a surrogate marker in a small sample size setting. The method developed in this paper is motivated by a small study of children with nonalcoholic fatty liver disease (NAFLD), a diagnosis for a range of liver conditions in individuals without significant history of alcohol intake. Specifically, we examine whether change in alanine aminotransferase (ALT; measured in blood) is a surrogate marker for change in NAFLD activity score (obtained by biopsy) in a trial, which compared Vitamin E ($n=50$) versus placebo ($n=46$) among children with NAFLD.

Subject(s)

Non-alcoholic Fatty Liver Disease , Child , Humans , Non-alcoholic Fatty Liver Disease/diagnosis , Biomarkers , Biopsy , Sample Size

11.

LATTE: Label-efficient incident phenotyping from longitudinal electronic health records.

Wen, Jun; Hou, Jue; Bonzel, Clara-Lea; Zhao, Yihan; Castro, Victor M; Gainer, Vivian S; Weisenfeld, Dana; Cai, Tianrun; Ho, Yuk-Lam; Panickan, Vidul A; Costa, Lauren; Hong, Chuan; Gaziano, J Michael; Liao, Katherine P; Lu, Junwei; Cho, Kelly; Cai, Tianxi.

Patterns (N Y) ; 5(1): 100906, 2024 Jan 12.

Article in English | MEDLINE | ID: mdl-38264714

ABSTRACT

Electronic health record (EHR) data are increasingly used to support real-world evidence studies but are limited by the lack of precise timings of clinical events. Here, we propose a label-efficient incident phenotyping (LATTE) algorithm to accurately annotate the timing of clinical events from longitudinal EHR data. By leveraging the pre-trained semantic embeddings, LATTE selects predictive features and compresses their information into longitudinal visit embeddings through visit attention learning. LATTE models the sequential dependency between the target event and visit embeddings to derive the timings. To improve label efficiency, LATTE constructs longitudinal silver-standard labels from unlabeled patients to perform semi-supervised training. LATTE is evaluated on the onset of type 2 diabetes, heart failure, and relapses of multiple sclerosis. LATTE consistently achieves substantial improvements over benchmark methods while providing high prediction interpretability. The event timings are shown to help discover risk factors of heart failure among patients with rheumatoid arthritis.

12.

TAXN: Translate Align Extract Normalize, a Multilingual Extraction Tool for Clinical Texts.

Neuraz, Antoine; Lerner, Ivan; Birot, Olivier; Arias, Camila; Han, Larry; Bonzel, Clara Lea; Cai, Tianxi; Huynh, Kim Tam; Coulet, Adrien.

Stud Health Technol Inform ; 310: 649-653, 2024 Jan 25.

Article in English | MEDLINE | ID: mdl-38269889

ABSTRACT

Several studies have shown that about 80% of the medical information in an electronic health record is only available through unstructured data. Resources such as medical terminologies in languages other than English are limited and restrain the NLP tools. We propose here to leverage English based resources in other languages using a combination of translation, word alignment, entity extraction and term normalization (TAXN). We implement this extraction pipeline in an open-source library called "medkit". We demonstrate the interest of this approach through a specific use-case: enriching a phenotypic dictionary for post-acute sequelae in COVID-19 (PASC). TAXN proved to be efficient to propose new synonyms of UMLS terms using a corpus of 70 articles in French with 356 terms enriched with at least one validated new synonym. This study was based on freely available deep-learning models.

Subject(s)

Multilingualism , Humans , Language , Disease Progression , Electronic Health Records

13.

Statistical Methods to Evaluate Surrogate Markers.

Parast, Layla; Tian, Lu; Cai, Tianxi; Palaniappan, Latha.

Med Care ; 62(2): 102-108, 2024 Feb 01.

Article in English | MEDLINE | ID: mdl-38079232

ABSTRACT

BACKGROUND: There is tremendous interest in evaluating surrogate markers given their potential to decrease study time, costs, and patient burden. OBJECTIVES: The purpose of this statistical workshop article is to describe and illustrate how to evaluate a surrogate marker of interest using the proportion of treatment effect (PTE) explained as a measure of the quality of the surrogate marker for: (1) a setting with a general fully observed primary outcome (eg, biopsy score); and (2) a setting with a time-to-event primary outcome which may be censored due to study termination or early drop out (eg, time to diabetes). METHODS: The methods are motivated by 2 randomized trials, one among children with nonalcoholic fatty liver disease where the primary outcome was a change in biopsy score (general outcome) and another study among adults at high risk for Type 2 diabetes where the primary outcome was time to diabetes (time-to-event outcome). The methods are illustrated using the Rsurrogate package with a detailed R code provided. RESULTS: In the biopsy score outcome setting, the estimated PTE of the examined surrogate marker was 0.182 (95% confidence interval [CI]: 0.121, 0.240), that is, the surrogate explained only 18.2% of the treatment effect on the biopsy score. In the diabetes setting, the estimated PTE of the surrogate marker was 0.596 (95% CI: 0.404, 0.760), that is, the surrogate explained 59.6% of the treatment effect on diabetes incidence. CONCLUSIONS: This statistical workshop provides tools that will support future researchers in the evaluation of surrogate markers.

Subject(s)

Diabetes Mellitus, Type 2 , Child , Humans , Treatment Outcome , Biomarkers

14.

Knowledge-Driven Online Multimodal Automated Phenotyping System.

Xiong, Xin; Sweet, Sara Morini; Liu, Molei; Hong, Chuan; Bonzel, Clara-Lea; Panickan, Vidul Ayakulangara; Zhou, Doudou; Wang, Linshanshan; Costa, Lauren; Ho, Yuk-Lam; Geva, Alon; Mandl, Kenneth D; Cheng, Suchun; Xia, Zongqi; Cho, Kelly; Gaziano, J Michael; Liao, Katherine P; Cai, Tianxi; Cai, Tianrun.

medRxiv ; 2023 Oct 02.

Article in English | MEDLINE | ID: mdl-37873131

ABSTRACT

Though electronic health record (EHR) systems are a rich repository of clinical information with large potential, the use of EHR-based phenotyping algorithms is often hindered by inaccurate diagnostic records, the presence of many irrelevant features, and the requirement for a human-labeled training set. In this paper, we describe a knowledge-driven online multimodal automated phenotyping (KOMAP) system that i) generates a list of informative features by an online narrative and codified feature search engine (ONCE) and ii) enables the training of a multimodal phenotyping algorithm based on summary data. Powered by composite knowledge from multiple EHR sources, online article corpora, and a large language model, features selected by ONCE show high concordance with the state-of-the-art AI models (GPT4 and ChatGPT) and encourage large-scale phenotyping by providing a smaller but highly relevant feature set. Validation of the KOMAP system across four healthcare centers suggests that it can generate efficient phenotyping algorithms with robust performance. Compared to other methods requiring patient-level inputs and gold-standard labels, the fully online KOMAP provides a significant opportunity to enable multi-center collaboration.

15.

Can earlier biomarker measurements explain a treatment effect on diabetes incidence? A robust comparison of five surrogate markers.

Parast, Layla; Tian, Lu; Cai, Tianxi; Palaniappan, Latha P.

BMJ Open Diabetes Res Care ; 11(5)2023 10.

Article in English | MEDLINE | ID: mdl-37907279

ABSTRACT

INTRODUCTION: We measured and compared five individual surrogate markers-change from baseline to 1 year after randomization in hemoglobin A1c (HbA1c), fasting glucose, 2-hour postchallenge glucose, triglyceride-glucose index (TyG) index, and homeostatic model assessment of insulin resistance (HOMA-IR)-in terms of their ability to explain a treatment effect on reducing the risk of type 2 diabetes mellitus at 2, 3, and 4 years after treatment initiation. RESEARCH DESIGN AND METHODS: Study participants were from the Diabetes Prevention Program study, randomly assigned to either a lifestyle intervention (n=1023) or placebo (n=1030). The surrogate markers were measured at baseline and 1 year, and diabetes incidence was examined at 2, 3, and 4 years postrandomization. Surrogacy was evaluated using a robust model-free estimate of the proportion of treatment effect explained (PTE) by the surrogate marker. RESULTS: Across all time points, change in fasting glucose and HOMA-IR explained higher proportions of the treatment effect than 2-hour glucose, TyG index, or HbA1c. For example, at 2 years, glucose explained the highest (80.1%) proportion of the treatment effect, followed by HOMA-IR (77.7%), 2-hour glucose (76.2%), and HbA1c (74.6%); the TyG index explained the smallest (70.3%) proportion. CONCLUSIONS: These data suggest that, of the five examined surrogate markers, glucose and HOMA-IR were the superior surrogate markers in terms of PTE, compared with 2-hour glucose, HbA1c, and TyG index.

Subject(s)

Diabetes Mellitus, Type 2 , Insulin Resistance , Humans , Diabetes Mellitus, Type 2/epidemiology , Diabetes Mellitus, Type 2/prevention & control , Blood Glucose , Glycated Hemoglobin , Incidence , Biomarkers , Glucose

16.

Clinical phenotypes and outcomes in children with multisystem inflammatory syndrome across SARS-CoV-2 variant eras: a multinational study from the 4CE consortium.

Sperotto, Francesca; Gutiérrez-Sacristán, Alba; Makwana, Simran; Li, Xiudi; Rofeberg, Valerie N; Cai, Tianxi; Bourgeois, Florence T; Omenn, Gilbert S; Hanauer, David A; Sáez, Carlos; Bonzel, Clara-Lea; Bucholz, Emily; Dionne, Audrey; Elias, Matthew D; García-Barrio, Noelia; González, Tomás González; Issitt, Richard W; Kernan, Kate F; Laird-Gion, Jessica; Maidlow, Sarah E; Mandl, Kenneth D; Ahooyi, Taha Mohseni; Moraleda, Cinta; Morris, Michele; Moshal, Karyn L; Pedrera-Jiménez, Miguel; Shah, Mohsin A; South, Andrew M; Spiridou, Anastasia; Taylor, Deanne M; Verdy, Guillaume; Visweswaran, Shyam; Wang, Xuan; Xia, Zongqi; Zachariasse, Joany M; Newburger, Jane W; Avillach, Paul.

EClinicalMedicine ; 64: 102212, 2023 Oct.

Article in English | MEDLINE | ID: mdl-37745025

ABSTRACT

Background: Multisystem inflammatory syndrome in children (MIS-C) is a severe complication of SARS-CoV-2 infection. It remains unclear how MIS-C phenotypes vary across SARS-CoV-2 variants. We aimed to investigate clinical characteristics and outcomes of MIS-C across SARS-CoV-2 eras. Methods: We performed a multicentre observational retrospective study including seven paediatric hospitals in four countries (France, Spain, U.K., and U.S.). All consecutive confirmed patients with MIS-C hospitalised between February 1st, 2020, and May 31st, 2022, were included. Electronic Health Records (EHR) data were used to calculate pooled risk differences (RD) and effect sizes (ES) at site level, using Alpha as reference. Meta-analysis was used to pool data across sites. Findings: Of 598 patients with MIS-C (61% male, 39% female; mean age 9.7 years [SD 4.5]), 383 (64%) were admitted in the Alpha era, 111 (19%) in the Delta era, and 104 (17%) in the Omicron era. Compared with patients admitted in the Alpha era, those admitted in the Delta era were younger (ES -1.18 years [95% CI -2.05, -0.32]), had fewer respiratory symptoms (RD -0.15 [95% CI -0.33, -0.04]), less frequent non-cardiogenic shock or systemic inflammatory response syndrome (SIRS) (RD -0.35 [95% CI -0.64, -0.07]), lower lymphocyte count (ES -0.16 × 109/uL [95% CI -0.30, -0.01]), lower C-reactive protein (ES -28.5 mg/L [95% CI -46.3, -10.7]), and lower troponin (ES -0.14 ng/mL [95% CI -0.26, -0.03]). Patients admitted in the Omicron versus Alpha eras were younger (ES -1.6 years [95% CI -2.5, -0.8]), had less frequent SIRS (RD -0.18 [95% CI -0.30, -0.05]), lower lymphocyte count (ES -0.39 × 109/uL [95% CI -0.52, -0.25]), lower troponin (ES -0.16 ng/mL [95% CI -0.30, -0.01]) and less frequently received anticoagulation therapy (RD -0.19 [95% CI -0.37, -0.04]). Length of hospitalization was shorter in the Delta versus Alpha eras (-1.3 days [95% CI -2.3, -0.4]). Interpretation: Our study suggested that MIS-C clinical phenotypes varied across SARS-CoV-2 eras, with patients in Delta and Omicron eras being younger and less sick. EHR data can be effectively leveraged to identify rare complications of pandemic diseases and their variation over time. Funding: None.

17.

Endovascular Aneurysm Repair Devices as a Use Case for Postmarketing Surveillance of Medical Devices.

Wang, Xuan; Ayakulangara Panickan, Vidul; Cai, Tianrun; Xiong, Xin; Cho, Kelly; Cai, Tianxi; Bourgeois, Florence T.

JAMA Intern Med ; 183(10): 1090-1097, 2023 10 01.

Article in English | MEDLINE | ID: mdl-37603326

ABSTRACT

Importance: The US Food and Drug Administration (FDA) is building a national postmarketing surveillance system for medical devices, moving to a "total product life cycle" approach whereby more limited premarketing data are balanced with postmarketing surveillance to capture rare adverse events and long-term safety issues. Objective: To assess the methodological requirements and feasibility of postmarketing device surveillance using endovascular aneurysm repair devices (EVARs), which have been the subject of safety concerns, using clinical data from a large health care system. Design, Setting, and Participants: This retrospective cohort study included patients with electronic health record (EHR) data in the Veterans Affairs Corporate Data Warehouse. Exposure: Implantation of an AFX Endovascular AAA System (AFX) device (any of 3 iterations) or a non-AFX comparator EVAR device from January 1, 2011, to December 21, 2021. Main Outcomes and Measures: The primary outcomes were rates of type III endoleaks and all-cause mortality; and rates of these outcomes associated with AFX devices compared with non-AFX devices, assessed using Cox proportional hazards regression models and doubly robust causal modeling. Information on type III endoleaks was available only as free-text mentions in clinical notes, while all-cause mortality data could be extracted using structured data. Device-specific information required by the FDA is ascertained using unique device identifiers (UDIs), which include factors such as model numbers, catalog numbers, and manufacturer-specific product codes. The availability of UDIs in EHRs was assessed. Results: In total, 13â¯941 patients (mean [SD] age, 71.8 [7.4] years) received 1 of the devices of interest (AFX with Strata [AFX-S]: 718 patients [5.2%]; AFX with Duraply [AFX-D]: 404 patients [2.9%]; or AFX2: 682 patients [4.9%]), and 12 137 (87.1%) received non-AFX devices. The UDIs were not recorded in the EHR for any patient with an AFX device, and partial UDIs were available for 19 patients (0.1%) with a non-AFX device. This necessitated the development of advanced natural language processing tools to define the cohort of patients for analysis. The study identified a significantly higher risk of type III endoleaks at 5 years among patients receiving any of the AFX device iterations, including the most recent version, AFX2 (11.6%; 95% CI, 8.1%-15.1%) compared with that among patients with non-AFX devices (5.7%; 95% CI, 2.2%-9.2%; absolute risk difference, 5.9%; 95% CI, 2.3%-9.4%). However, there was no significantly higher all-cause mortality for any of the AFX device iterations, including for AFX2 (19.0%; 95% CI, 16.0%-22.0%) compared with non-AFX devices (18.0%; 95% CI, 15.0%-21.0%; absolute risk difference, 1.0%; 95% CI, -2.1% to 4.1%). Conclusions and Relevance: The findings of this cohort study suggest that clinical data can be used for the postmarketing device surveillance required by the FDA. The study also highlights ongoing challenges to performing larger-scale surveillance, including lack of consistent use of UDIs and insufficient relevant structured data to efficiently capture certain outcomes of interest.

Subject(s)

Aortic Aneurysm, Abdominal , Blood Vessel Prosthesis Implantation , Endovascular Procedures , Humans , Aged , Blood Vessel Prosthesis , Endoleak/etiology , Endovascular Aneurysm Repair , Aortic Aneurysm, Abdominal/etiology , Aortic Aneurysm, Abdominal/mortality , Aortic Aneurysm, Abdominal/surgery , Retrospective Studies , Cohort Studies , Treatment Outcome , Endovascular Procedures/adverse effects , Endovascular Procedures/instrumentation

18.

Semi-supervised calibration of noisy event risk (SCANER) with electronic health records.

Hong, Chuan; Liang, Liang; Yuan, Qianyu; Cho, Kelly; Liao, Katherine P; Pencina, Michael J; Christiani, David C; Cai, Tianxi.

J Biomed Inform ; 144: 104425, 2023 08.

Article in English | MEDLINE | ID: mdl-37331495

ABSTRACT

OBJECTIVE: Electronic health records (EHR), containing detailed longitudinal clinical information on a large number of patients and covering broad patient populations, open opportunities for comprehensive predictive modeling of disease progression and treatment response. However, since EHRs were originally constructed for administrative purposes not for research, in the EHR-linked studies, it is often not feasible to capture reliable information for analytical variables, especially in the survival setting, when both accurate event status and event times are needed for model building. For example, progression-free survival (PFS), a commonly used survival outcome for cancer patients, often involves complex information embedded in free-text clinical notes and cannot be extracted reliably. Proxies of PFS time such as time to the first mention of progression in the notes are at best good approximations to the true event time. This leads to difficulty in efficiently estimating event rates for an EHR patient cohort. Estimating survival rates based on error-prone outcome definitions can lead to biased results and hamper the power in the downstream analysis. On the other hand, extracting accurate event time information via manual annotation is time and resource intensive. The objective of this study is to develop a calibrated survival rate estimator using noisy outcomes from EHR data. MATERIALS AND METHODS: In this paper, we propose a two-stage semi-supervised calibration of noisy event rate (SCANER) estimator that can effectively overcome censoring induced dependency and attains more robust performance (i.e., not sensitive to misspecification of the imputation model) by fully utilizing both a small-labeled set of gold-standard survival outcomes annotated via manual chart review and a set of proxy features automatically captured via EHR in the unlabeled set. We validate the SCANER estimator by estimating the PFS rates for a virtual cohort of lung cancer patients from one large tertiary care center and the ICU-free survival rates for COVID patients from two large tertiary care centers. RESULTS: In terms of survival rate estimates, the SCANER had very similar point estimates compared to the complete-case Kaplan Meier estimator. On the other hand, other benchmark methods for comparison, which fail to account for the induced dependency between event time and the censoring time conditioning on surrogate outcomes, produced biased results across all three case studies. In terms of standard errors, the SCANER estimator was more efficient than the KM estimator, with up to 50% efficiency gain. CONCLUSION: The SCANER estimator achieves more efficient, robust, and accurate survival rate estimates compared to existing approaches. This promising new approach can also improve the resolution (i.e., granularity of event time) by using labels conditioning on multiple surrogates, particularly among less common or poorly coded conditions.

Subject(s)

COVID-19 , Lung Neoplasms , Humans , Electronic Health Records , Calibration , Survival Analysis

19.

Multimodal learning on graphs for disease relation extraction.

Lin, Yucong; Lu, Keming; Yu, Sheng; Cai, Tianxi; Zitnik, Marinka.

J Biomed Inform ; 143: 104415, 2023 07.

Article in English | MEDLINE | ID: mdl-37276949

ABSTRACT

Disease knowledge graphs have emerged as a powerful tool for artificial intelligence to connect, organize, and access diverse information about diseases. Relations between disease concepts are often distributed across multiple datasets, including unstructured plain text datasets and incomplete disease knowledge graphs. Extracting disease relations from multimodal data sources is thus crucial for constructing accurate and comprehensive disease knowledge graphs. We introduce REMAP, a multimodal approach for disease relation extraction. The REMAP machine learning approach jointly embeds a partial, incomplete knowledge graph and a medical language dataset into a compact latent vector space, aligning the multimodal embeddings for optimal disease relation extraction. Additionally, REMAP utilizes a decoupled model structure to enable inference in single-modal data, which can be applied under missing modality scenarios. We apply the REMAP approach to a disease knowledge graph with 96,913 relations and a text dataset of 1.24 million sentences. On a dataset annotated by human experts, REMAP improves language-based disease relation extraction by 10.0% (accuracy) and 17.2% (F1-score) by fusing disease knowledge graphs with language information. Furthermore, REMAP leverages text information to recommend new relationships in the knowledge graph, outperforming graph-based methods by 8.4% (accuracy) and 10.4% (F1-score). REMAP is a flexible multimodal approach for extracting disease relations by fusing structured knowledge and language information. This approach provides a powerful model to easily find, access, and evaluate relations between disease concepts.

Subject(s)

Artificial Intelligence , Machine Learning , Humans , Unified Medical Language System , Language , Natural Language Processing

20.

Utilizing biologic disease-modifying anti-rheumatic treatment sequences to subphenotype rheumatoid arthritis.

Das, Priyam; Weisenfeld, Dana; Dahal, Kumar; De, Debsurya; Feathers, Vivi; Coblyn, Jonathan S; Weinblatt, Michael E; Shadick, Nancy A; Cai, Tianxi; Liao, Katherine P.

Arthritis Res Ther ; 25(1): 93, 2023 06 02.

Article in English | MEDLINE | ID: mdl-37269020

ABSTRACT

BACKGROUND: Many patients with rheumatoid arthritis (RA) require a trial of multiple biologic disease-modifying anti-rheumatic drugs (bDMARDs) to control their disease. With the availability of several bDMARD options, the history of bDMARDs may provide an alternative approach to understanding subphenotypes of RA. The objective of this study was to determine whether there exist distinct clusters of RA patients based on bDMARD prescription history to subphenotype RA. METHODS: We studied patients from a validated electronic health record-based RA cohort with data from January 1, 2008, through July 31, 2019; all subjects prescribed ≥ 1 bDMARD or targeted synthetic (ts) DMARD were included. To determine whether subjects had similar b/tsDMARD sequences, the sequences were considered as a Markov chain over the state-space of 5 classes of b/tsDMARDs. The maximum likelihood estimator (MLE)-based approach was used to estimate the Markov chain parameters to determine the clusters. The EHR data of study subjects were further linked with a registry containing prospectively collected data for RA disease activity, i.e., clinical disease activity index (CDAI). As a proof of concept, we tested whether the clusters derived from b/tsDMARD sequences correlated with clinical measures, specifically differing trajectories of CDAI. RESULTS: We studied 2172 RA subjects, mean age 52 years, RA duration 3.4 years, and 62% seropositive. We observed 550 unique b/tsDMARD sequences and identified 4 main clusters: (1) TNFi persisters (65.7%), (2) TNFi and abatacept therapy (8.0%), (3) on rituximab or multiple b/tsDMARDs (12.7%), (4) prescribed multiple therapies with tocilizumab predominant (13.6%). Compared to the other groups, TNFi persisters had the most favorable trajectory of CDAI over time. CONCLUSION: We observed that RA subjects can be clustered based on the sequence of b/tsDMARD prescriptions over time and that the clusters were correlated with differing trajectories of disease activity over time. This study highlights an alternative approach to consider subphenotyping of patients with RA for studies aimed at understanding treatment response.

Subject(s)

Antirheumatic Agents , Arthritis, Rheumatoid , Biological Products , Humans , Middle Aged , Arthritis, Rheumatoid/drug therapy , Antirheumatic Agents/therapeutic use , Rituximab/therapeutic use , Abatacept/therapeutic use , Biological Products/therapeutic use

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL