Búsqueda | BVS CLAP/SMR-OPS/OMS

1.

Interactive network-based clustering and investigation of multimorbidity association matrices with associationSubgraphs.

Strayer, Nick; Zhang, Siwei; Yao, Lydia; Vessels, Tess; Bejan, Cosmin A; Hsi, Ryan S; Shirey-Rice, Jana K; Balko, Justin M; Johnson, Douglas B; Phillips, Elizabeth J; Bick, Alex; Edwards, Todd L; Velez Edwards, Digna R; Pulley, Jill M; Wells, Quinn S; Savona, Michael R; Cox, Nancy J; Roden, Dan M; Ruderfer, Douglas M; Xu, Yaomin.

Bioinformatics ; 39(1)2023 01 01.

Artículo en Inglés | MEDLINE | ID: mdl-36472455

RESUMEN

MOTIVATION: Making sense of networked multivariate association patterns is vitally important to many areas of high-dimensional analysis. Unfortunately, as the data-space dimensions grow, the number of association pairs increases in O(n2); this means that traditional visualizations such as heatmaps quickly become too complicated to parse effectively. RESULTS: Here, we present associationSubgraphs: a new interactive visualization method to quickly and intuitively explore high-dimensional association datasets using network percolation and clustering. The goal is to provide an efficient investigation of association subgraphs, each containing a subset of variables with stronger and more frequent associations among themselves than the remaining variables outside the subset, by showing the entire clustering dynamics and providing subgraphs under all possible cutoff values at once. Particularly, we apply associationSubgraphs to a phenome-wide multimorbidity association matrix generated from an electronic health record and provide an online, interactive demonstration for exploring multimorbidity subgraphs. AVAILABILITY AND IMPLEMENTATION: An R package implementing both the algorithm and visualization components of associationSubgraphs is available at https://github.com/tbilab/associationsubgraphs. Online documentation is available at https://prod.tbilab.org/associationsubgraphs_info/. A demo using a multimorbidity association matrix is available at https://prod.tbilab.org/associationsubgraphs-example/.

Asunto(s)

Multimorbilidad , Programas Informáticos , Algoritmos , Análisis por Conglomerados , Fenómica

2.

Improving Methods of Identifying Anaphylaxis for Medical Product Safety Surveillance Using Natural Language Processing and Machine Learning.

Carrell, David S; Gruber, Susan; Floyd, James S; Bann, Maralyssa A; Cushing-Haugen, Kara L; Johnson, Ron L; Graham, Vina; Cronkite, David J; Hazlehurst, Brian L; Felcher, Andrew H; Bejan, Cosmin A; Kennedy, Adee; Shinde, Mayura U; Karami, Sara; Ma, Yong; Stojanovic, Danijela; Zhao, Yueqin; Ball, Robert; Nelson, Jennifer C.

Am J Epidemiol ; 192(2): 283-295, 2023 02 01.

Artículo en Inglés | MEDLINE | ID: mdl-36331289

RESUMEN

We sought to determine whether machine learning and natural language processing (NLP) applied to electronic medical records could improve performance of automated health-care claims-based algorithms to identify anaphylaxis events using data on 516 patients with outpatient, emergency department, or inpatient anaphylaxis diagnosis codes during 2015-2019 in 2 integrated health-care institutions in the Northwest United States. We used one site's manually reviewed gold-standard outcomes data for model development and the other's for external validation based on cross-validated area under the receiver operating characteristic curve (AUC), positive predictive value (PPV), and sensitivity. In the development site 154 (64%) of 239 potential events met adjudication criteria for anaphylaxis compared with 180 (65%) of 277 in the validation site. Logistic regression models using only structured claims data achieved a cross-validated AUC of 0.58 (95% CI: 0.54, 0.63). Machine learning improved cross-validated AUC to 0.62 (0.58, 0.66); incorporating NLP-derived covariates further increased cross-validated AUCs to 0.70 (0.66, 0.75) in development and 0.67 (0.63, 0.71) in external validation data. A classification threshold with cross-validated PPV of 79% and cross-validated sensitivity of 66% in development data had cross-validated PPV of 78% and cross-validated sensitivity of 56% in external data. Machine learning and NLP-derived data improved identification of validated anaphylaxis events.

Asunto(s)

Anafilaxia , Procesamiento de Lenguaje Natural , Humanos , Anafilaxia/diagnóstico , Anafilaxia/epidemiología , Aprendizaje Automático , Algoritmos , Servicio de Urgencia en Hospital , Registros Electrónicos de Salud

3.

Contraceptive exposure associates with urinary tract infection risk in a cohort of reproductive-age women: a case control study.

Lo, Claire; Abraham, Abin; Bejan, Cosmin A; Reasoner, Seth A; Davidson, Mario; Lipworth, Loren; Aronoff, David M.

Eur J Contracept Reprod Health Care ; 28(1): 17-22, 2023 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-36537554

RESUMEN

PURPOSE: Although non-barrier contraception is commonly prescribed, the risk of urinary tract infections (UTI) with contraceptive exposure is unclear. MATERIALS AND METHODS: Using data from Vanderbilt University Medical Centre's deidentified electronic health record (EHR), women ages 18-52 were randomly sampled and matched based on age and length of EHR. This case-control analysis tested for association between contraception exposure and outcome using UTI-positive (UTI+) as cases and upper respiratory infection+ (URI+) as controls. RESULTS: 24,563 UTI + cases (mean EHR: 64.2 months; mean age: 31.2 years) and 48,649 UTI-/URI + controls (mean EHR: 63.2 months; mean age: 31.9 years) were analysed. In the primary analysis, UTI risk was statistically significantly increased for the oral contraceptive pill (OCP; OR = 1.10 [95%CI = 1.02-1.11], p ≤ 0.05), intrauterine device (IUD; OR = 1.13 [95%CI = 1.04-1.23], p ≤ 0.05), etonogestrel implant (Nexplanon®; OR = 1.56 [95% CI = 1.24-1.96], p ≤ 0.05), and medroxyprogesterone acetate injectable (Depo-Provera®; OR = 2.16 [95%CI = 1.99-2.33], p ≤ 0.05) use compared to women not prescribed contraception. A secondary analysis that included any non-IUD contraception, which could serve as a proxy for sexual activity, demonstrated a small attenuation for the association between UTI and IUD (OR = 1.09 [95%CI = 0.98-1.21], p = 0.13). CONCLUSION: This study notes potential for a small increase in UTIs with contraceptive use. Prospective studies are required before this information is applied in clinical settings. CONDENSATION: Although non-barrier contraception is commonly prescribed, the risk of urinary tract infections (UTI) with contraceptive exposure is poorly understood. This large-cohort, case-control study notes potential for a small increase in UTIs with contraceptive use.

Asunto(s)

Anticonceptivos Femeninos , Infecciones Urinarias , Femenino , Humanos , Adulto , Adolescente , Adulto Joven , Persona de Mediana Edad , Estudios de Casos y Controles , Acetato de Medroxiprogesterona , Anticonceptivos Orales , Anticoncepción/efectos adversos , Infecciones Urinarias/epidemiología , Infecciones Urinarias/etiología , Anticonceptivos Femeninos/efectos adversos

4.

Dense phenotyping from electronic health records enables machine learning-based prediction of preterm birth.

Abraham, Abin; Le, Brian; Kosti, Idit; Straub, Peter; Velez-Edwards, Digna R; Davis, Lea K; Newton, J M; Muglia, Louis J; Rokas, Antonis; Bejan, Cosmin A; Sirota, Marina; Capra, John A.

BMC Med ; 20(1): 333, 2022 09 28.

Artículo en Inglés | MEDLINE | ID: mdl-36167547

RESUMEN

BACKGROUND: Identifying pregnancies at risk for preterm birth, one of the leading causes of worldwide infant mortality, has the potential to improve prenatal care. However, we lack broadly applicable methods to accurately predict preterm birth risk. The dense longitudinal information present in electronic health records (EHRs) is enabling scalable and cost-efficient risk modeling of many diseases, but EHR resources have been largely untapped in the study of pregnancy. METHODS: Here, we apply machine learning to diverse data from EHRs with 35,282 deliveries to predict singleton preterm birth. RESULTS: We find that machine learning models based on billing codes alone can predict preterm birth risk at various gestational ages (e.g., ROC-AUC = 0.75, PR-AUC = 0.40 at 28 weeks of gestation) and outperform comparable models trained using known risk factors (e.g., ROC-AUC = 0.65, PR-AUC = 0.25 at 28 weeks). Examining the patterns learned by the model reveals it stratifies deliveries into interpretable groups, including high-risk preterm birth subtypes enriched for distinct comorbidities. Our machine learning approach also predicts preterm birth subtypes (spontaneous vs. indicated), mode of delivery, and recurrent preterm birth. Finally, we demonstrate the portability of our approach by showing that the prediction models maintain their accuracy on a large, independent cohort (5978 deliveries) from a different healthcare system. CONCLUSIONS: By leveraging rich phenotypic and genetic features derived from EHRs, we suggest that machine learning algorithms have great potential to improve medical care during pregnancy. However, further work is needed before these models can be applied in clinical settings.

Asunto(s)

Nacimiento Prematuro , Algoritmos , Registros Electrónicos de Salud , Femenino , Edad Gestacional , Humanos , Recién Nacido , Aprendizaje Automático , Embarazo , Nacimiento Prematuro/diagnóstico , Nacimiento Prematuro/epidemiología

5.

Prediction of Future Health Care Utilization Through Note-extracted Psychosocial Factors.

Dorr, David A; Quiñones, Ana R; King, Taylor; Wei, Melissa Y; White, Kellee; Bejan, Cosmin A.

Med Care ; 60(8): 570-578, 2022 08 01.

Artículo en Inglés | MEDLINE | ID: mdl-35658116

RESUMEN

BACKGROUND: Persons with multimorbidity (≥2 chronic conditions) face an increased risk of poor health outcomes, especially as they age. Psychosocial factors such as social isolation, chronic stress, housing insecurity, and financial insecurity have been shown to exacerbate these outcomes, but are not routinely assessed during the clinical encounter. Our objective was to extract these concepts from chart notes using natural language processing and predict their impact on health care utilization for patients with multimorbidity. METHODS: A cohort study to predict the 1-year likelihood of hospitalizations and emergency department visits for patients 65+ with multimorbidity with and without psychosocial factors. Psychosocial factors were extracted from narrative notes; all other covariates were extracted from electronic health record data from a large academic medical center using validated algorithms and concept sets. Logistic regression was performed to predict the likelihood of hospitalization and emergency department visit in the next year. RESULTS: In all, 76,479 patients were eligible; the majority were White (89%), 54% were female, with mean age 73. Those with psychosocial factors were older, had higher baseline utilization, and more chronic illnesses. The 4 psychosocial factors all independently predicted future utilization (odds ratio=1.27-2.77, C -statistic=0.63). Accounting for demographics, specific conditions, and previous utilization, 3 of 4 of the extracted factors remained predictive (odds ratio=1.13-1.86) for future utilization. Compared with models with no psychosocial factors, they had improved discrimination. Individual predictions were mixed, with social isolation predicting depression and morbidity; stress predicting atherosclerotic cardiovascular disease onset; and housing insecurity predicting substance use disorder morbidity. DISCUSSION: Psychosocial factors are known to have adverse health impacts, but are rarely measured; using natural language processing, we extracted factors that identified a higher risk segment of older adults with multimorbidity. Combining these extraction techniques with other measures of social determinants may help catalyze population health efforts to address psychosocial factors to mitigate their health impacts.

Asunto(s)

Hospitalización , Aceptación de la Atención de Salud , Anciano , Enfermedad Crónica , Estudios de Cohortes , Servicio de Urgencia en Hospital , Femenino , Humanos , Masculino , Multimorbilidad , Aceptación de la Atención de Salud/psicología

6.

Phenotyping coronavirus disease 2019 during a global health pandemic: Lessons learned from the characterization of an early cohort.

DeLozier, Sarah; Bland, Harris T; McPheeters, Melissa; Wells, Quinn; Farber-Eger, Eric; Bejan, Cosmin A; Fabbri, Daniel; Rosenbloom, Trent; Roden, Dan; Johnson, Kevin B; Wei, Wei-Qi; Peterson, Josh; Bastarache, Lisa.

J Biomed Inform ; 117: 103777, 2021 05.

Artículo en Inglés | MEDLINE | ID: mdl-33838341

RESUMEN

From the start of the coronavirus disease 2019 (COVID-19) pandemic, researchers have looked to electronic health record (EHR) data as a way to study possible risk factors and outcomes. To ensure the validity and accuracy of research using these data, investigators need to be confident that the phenotypes they construct are reliable and accurate, reflecting the healthcare settings from which they are ascertained. We developed a COVID-19 registry at a single academic medical center and used data from March 1 to June 5, 2020 to assess differences in population-level characteristics in pandemic and non-pandemic years respectively. Median EHR length, previously shown to impact phenotype performance in type 2 diabetes, was significantly shorter in the SARS-CoV-2 positive group relative to a 2019 influenza tested group (median 3.1 years vs 8.7; Wilcoxon rank sum P = 1.3e-52). Using three phenotyping methods of increasing complexity (billing codes alone and domain-specific algorithms provided by an EHR vendor and clinical experts), common medical comorbidities were abstracted from COVID-19 EHRs, defined by the presence of a positive laboratory test (positive predictive value 100%, recall 93%). After combining performance data across phenotyping methods, we observed significantly lower false negative rates for those records billed for a comprehensive care visit (p = 4e-11) and those with complete demographics data recorded (p = 7e-5). In an early COVID-19 cohort, we found that phenotyping performance of nine common comorbidities was influenced by median EHR length, consistent with previous studies, as well as by data density, which can be measured using portable metrics including CPT codes. Here we present those challenges and potential solutions to creating deeply phenotyped, acute COVID-19 cohorts.

Asunto(s)

COVID-19/diagnóstico , Registros Electrónicos de Salud , Fenotipo , Comorbilidad , Diabetes Mellitus Tipo 2 , Salud Global , Humanos , Gripe Humana , Funciones de Verosimilitud , Pandemias

7.

HLA-A*32:01 is strongly associated with vancomycin-induced drug reaction with eosinophilia and systemic symptoms.

Konvinse, Katherine C; Trubiano, Jason A; Pavlos, Rebecca; James, Ian; Shaffer, Christian M; Bejan, Cosmin A; Schutte, Ryan J; Ostrov, David A; Pilkinton, Mark A; Rosenbach, Misha; Zwerner, Jeffrey P; Williams, Kristina B; Bourke, Jack; Martinez, Patricia; Rwandamuriye, Francois; Chopra, Abha; Watson, Mark; Redwood, Alec J; White, Katie D; Mallal, Simon A; Phillips, Elizabeth J.

J Allergy Clin Immunol ; 144(1): 183-192, 2019 07.

Artículo en Inglés | MEDLINE | ID: mdl-30776417

RESUMEN

BACKGROUND: Vancomycin is a prevalent cause of the severe hypersensitivity syndrome drug reaction with eosinophilia and systemic symptoms (DRESS), which leads to significant morbidity and mortality and commonly occurs in the setting of combination antibiotic therapy, affecting future treatment choices. Variations in HLA class I in particular have been associated with serious T cell-mediated adverse drug reactions, which has led to preventive screening strategies for some drugs. OBJECTIVE: We sought to determine whether variation in the HLA region is associated with vancomycin-induced DRESS. METHODS: Probable vancomycin-induced DRESS cases were matched 1:2 with tolerant control subjects based on sex, race, and age by using BioVU, Vanderbilt's deidentified electronic health record database. Associations between DRESS and carriage of HLA class I and II alleles were assessed by means of conditional logistic regression. An extended sample set from BioVU was used to conduct a time-to-event analysis of those exposed to vancomycin with and without the identified HLA risk allele. RESULTS: Twenty-three subjects met the inclusion criteria for vancomycin-associated DRESS. Nineteen (82.6%) of 23 cases carried HLA-A*32:01 compared with 0 (0%) of 46 of the matched vancomycin-tolerant control subjects (P = 1 × 10-8) and 6.3% of the BioVU population (n = 54,249, P = 2 × 10-16). Time-to-event analysis of DRESS development during vancomycin treatment among the HLA-A*32:01-positive group indicated that 19.2% had DRESS and did so within 4 weeks. CONCLUSIONS: HLA-A*32:01 is strongly associated with vancomycin-induced DRESS in a population of predominantly European ancestry. HLA-A*32:01 testing could improve antibiotic safety, help implicate vancomycin as the causal drug, and preserve future treatment options with coadministered antibiotics.

Asunto(s)

Antibacterianos/efectos adversos , Síndrome de Hipersensibilidad a Medicamentos/inmunología , Antígenos HLA-A/inmunología , Vancomicina/efectos adversos , Adolescente , Adulto , Anciano , Antibacterianos/química , Síndrome de Hipersensibilidad a Medicamentos/etiología , Femenino , Antígenos HLA-A/química , Humanos , Masculino , Persona de Mediana Edad , Simulación del Acoplamiento Molecular , Vancomicina/química , Adulto Joven

8.

Building bridges across electronic health record systems through inferred phenotypic topics.

Chen, You; Ghosh, Joydeep; Bejan, Cosmin Adrian; Gunter, Carl A; Gupta, Siddharth; Kho, Abel; Liebovitz, David; Sun, Jimeng; Denny, Joshua; Malin, Bradley.

J Biomed Inform ; 55: 82-93, 2015 Jun.

Artículo en Inglés | MEDLINE | ID: mdl-25841328

RESUMEN

OBJECTIVE: Data in electronic health records (EHRs) is being increasingly leveraged for secondary uses, ranging from biomedical association studies to comparative effectiveness. To perform studies at scale and transfer knowledge from one institution to another in a meaningful way, we need to harmonize the phenotypes in such systems. Traditionally, this has been accomplished through expert specification of phenotypes via standardized terminologies, such as billing codes. However, this approach may be biased by the experience and expectations of the experts, as well as the vocabulary used to describe such patients. The goal of this work is to develop a data-driven strategy to (1) infer phenotypic topics within patient populations and (2) assess the degree to which such topics facilitate a mapping across populations in disparate healthcare systems. METHODS: We adapt a generative topic modeling strategy, based on latent Dirichlet allocation, to infer phenotypic topics. We utilize a variance analysis to assess the projection of a patient population from one healthcare system onto the topics learned from another system. The consistency of learned phenotypic topics was evaluated using (1) the similarity of topics, (2) the stability of a patient population across topics, and (3) the transferability of a topic across sites. We evaluated our approaches using four months of inpatient data from two geographically distinct healthcare systems: (1) Northwestern Memorial Hospital (NMH) and (2) Vanderbilt University Medical Center (VUMC). RESULTS: The method learned 25 phenotypic topics from each healthcare system. The average cosine similarity between matched topics across the two sites was 0.39, a remarkably high value given the very high dimensionality of the feature space. The average stability of VUMC and NMH patients across the topics of two sites was 0.988 and 0.812, respectively, as measured by the Pearson correlation coefficient. Also the VUMC and NMH topics have smaller variance of characterizing patient population of two sites than standard clinical terminologies (e.g., ICD9), suggesting they may be more reliably transferred across hospital systems. CONCLUSIONS: Phenotypic topics learned from EHR data can be more stable and transferable than billing codes for characterizing the general status of a patient population. This suggests that EHR-based research may be able to leverage such phenotypic topics as variables when pooling patient populations in predictive models.

Asunto(s)

Registros Electrónicos de Salud/organización & administración , Almacenamiento y Recuperación de la Información/métodos , Aprendizaje Automático , Registro Médico Coordinado/métodos , Vocabulario Controlado , Registros Electrónicos de Salud/clasificación , Procesamiento de Lenguaje Natural , Fenotipo , Estados Unidos

9.

Evaluation of Genetic Associations with Clinical Phenotypes of Kidney Stone Disease.

Hsi, Ryan S; Zhang, Siwei; Triozzi, Jefferson L; Hung, Adriana M; Xu, Yaomin; Bejan, Cosmin A.

Eur Urol Open Sci ; 67: 38-44, 2024 Sep.

Artículo en Inglés | MEDLINE | ID: mdl-39156495

RESUMEN

Background and objective: Previous studies have reported a strong genetic contribution to kidney stone risk. This study aims to identify genetic associations of kidney stone disease within a large-scale electronic health record system. Methods: We performed genome-wide association studies (GWASs) for nephrolithiasis from genotyped samples of 5571 cases and 83 692 controls. This analysis included a primary GWAS focused on nephrolithiasis and subsequent subgroup GWASs stratified by stone composition types. For significant risk variants, we performed association analyses with stone composition and first-time 24-h urine parameters. To assess disease severity, we investigated the associations with age at first stone diagnosis, age at first stone-related procedure, and time between first and second stone-related procedures. Key findings and limitations: The primary GWAS analysis identified ten significant loci, all located on chromosome 16 within coding regions of the UMOD gene. The strongest signal was rs28544423 (odds ratio 1.17, 95% confidence interval 1.11-1.23, p = 2.7 × 10-9). In subgroup GWASs stratified by six kidney stone composition subtypes, 19 significant loci were identified including two loci in coding regions (brushite; NXPH1, rs79970906 and rs4725104). The UMOD single nucleotide polymorphism rs28544423 was associated with differences in 24-h excretion of urinary analytes, and the minor allele was positively associated with calcium oxalate dihydrate stone composition (p < 0.05). No associations were found between UMOD variants and disease severity. Limitations include an omitted variable bias and a misclassification bias. Conclusions and clinical implications: We replicated germline variants associated with kidney stone disease risk at UMOD and reported novel variants associated with stone composition. Genetic variants of UMOD are associated with differences in 24-h urine parameters and stone composition, but not disease severity. Patient summary: We identify genetic variants linked to kidney stone disease within an electronic health record (EHR) system. These findings suggest a role for the EHR to enable a precision-medicine approach for stone disease.

10.

Evaluation of genetic associations with clinical phenotypes of kidney stone disease.

Hsi, Ryan S; Zhang, Siwei; Triozzi, Jefferson L; Hung, Adriana M; Xu, Yaomin; Bejan, Cosmin A.

medRxiv ; 2024 Jan 22.

Artículo en Inglés | MEDLINE | ID: mdl-38343797

RESUMEN

Introduction and Objective: We sought to replicate and discover genetic associations of kidney stone disease within a large-scale electronic health record (EHR) system. Methods: We performed genome-wide association studies (GWASs) for nephrolithiasis from genotyped samples of 5,571 cases and 83,692 controls. Among the significant risk variants, we performed association analyses of stone composition and first-time 24-hour urine parameters. To assess disease severity, we investigated the associations of risk variants with age at first stone diagnosis, age at first procedure, and time from first to second procedure. Results: The main GWAS analysis identified 10 significant loci, each located on chromosome 16 within coding regions of the UMOD gene, which codes for uromodulin, a urine protein with inhibitory activity for calcium crystallization. The strongest signal was from SNP 16:20359633-C-T (odds ratio [OR] 1.17, 95% CI 1.11-1.23), with the remaining significant SNPs having similar effect sizes. In subgroup GWASs by stone composition, 19 significant loci were identified, of which two loci were located in coding regions (brushite; NXPH1 , rs79970906 and rs4725104). The UMOD SNP 16:20359633-C-T was associated with differences in 24-hour excretion of urinary calcium, uric acid, phosphorus, sulfate; and the minor allele was positively associated with calcium oxalate dihydrate stone composition (p<0.05). No associations were found between UMOD variants and disease severity. Conclusions: We replicated germline variants associated with kidney stone disease risk at UMOD and reported novel variants associated with stone composition. Genetic variants of UMOD are associated with differences in 24-hour urine parameters and stone composition, but not disease severity.

11.

Driver mutation zygosity is a critical factor in predicting clonal hematopoiesis transformation risk.

Kishtagari, Ashwin; Khan, M A Wasay; Li, Yajing; Vlasschaert, Caitlyn; Marneni, Naimisha; Silver, Alexander J; von Beck, Kelly; Spaulding, Travis; Stockton, Shannon; Snider, Christina; Sochacki, Andrew; Dorand, Dixon; Mack, Taralynn M; Ferrell, P Brent; Xu, Yaomin; Bejan, Cosmin A; Savona, Michael R; Bick, Alexander G.

Blood Cancer J ; 14(1): 6, 2024 01 15.

Artículo en Inglés | MEDLINE | ID: mdl-38225345

RESUMEN

Clonal hematopoiesis (CH) can be caused by either single gene mutations (eg point mutations in JAK2 causing CHIP) or mosaic chromosomal alterations (e.g., loss of heterozygosity at chromosome 9p). CH is associated with a significantly increased risk of hematologic malignancies. However, the absolute rate of transformation on an annualized basis is low. Improved prognostication of transformation risk is urgently needed for routine clinical practice. We hypothesized that the co-occurrence of CHIP and mCAs at the same locus (e.g., transforming a heterozygous JAK2 CHIP mutation into a homozygous mutation through concomitant loss of heterozygosity at chromosome 9) might have important prognostic implications for malignancy transformation risk. We tested this hypothesis using our discovery cohort, the UK Biobank (n = 451,180), and subsequently validated it in the BioVU cohort (n = 91,335). We find that individuals with a concurrent somatic mutation and mCA were at significantly increased risk of hematologic malignancy (for example, In BioVU cohort incidence of hematologic malignancies is higher in individuals with co-occurring JAK2 V617F and 9p CN-LOH; HR = 54.76, 95% CI = 33.92-88.41, P < 0.001 vs. JAK2 V617F alone; HR = 44.05, 95% CI = 35.06-55.35, P < 0.001). Currently, the 'zygosity' of the CHIP mutation is not routinely reported in clinical assays or considered in prognosticating CHIP transformation risk. Based on these observations, we propose that clinical reports should include 'zygosity' status of CHIP mutations and that future prognostication systems should take mutation 'zygosity' into account.

Asunto(s)

Hematopoyesis Clonal , Neoplasias Hematológicas , Humanos , Mutación , Mutación Puntual , Aberraciones Cromosómicas , Neoplasias Hematológicas/genética

12.

Cost-Effective and Scalable Clonal Hematopoiesis Assay Provides Insight into Clonal Dynamics.

Mack, Taralynn; Vlasschaert, Caitlyn; von Beck, Kelly; Silver, Alexander J; Heimlich, J Brett; Poisner, Hannah; Condon, Henry R; Ulloa, Jessica; Sochacki, Andrew L; Spaulding, Travis P; Kishtagari, Ashwin; Bejan, Cosmin A; Xu, Yaomin; Savona, Michael R; Jones, Angela; Bick, Alexander G.

J Mol Diagn ; 26(7): 563-573, 2024 Jul.

Artículo en Inglés | MEDLINE | ID: mdl-38588769

RESUMEN

Clonal hematopoiesis of indeterminate potential (CHIP) is a common age-related phenomenon in which hematopoietic stem cells acquire mutations in a select set of genes commonly mutated in myeloid neoplasia which then expand clonally. Current sequencing assays to detect CHIP mutations are not optimized for the detection of these variants and can be cost-prohibitive when applied to large cohorts or to serial sequencing. In this study, an affordable (approximately US $8 per sample), accurate, and scalable sequencing assay for CHIP is introduced and validated. The efficacy of the assay was demonstrated by identifying CHIP mutations in a cohort of 456 individuals with DNA collected at multiple time points in Vanderbilt University's biobank and quantifying clonal expansion rates over time. A total of 101 individuals with CHIP/clonal cytopenia of undetermined significance were identified, and individual-level clonal expansion rate was calculated using the variant allele fraction at both time points. Differences in clonal expansion rate by driver gene were observed, but there was also significant individual-level heterogeneity, emphasizing the multifactorial nature of clonal expansion. Additionally, mutation co-occurrence and clonal competition between multiple driver mutations were explored.

Asunto(s)

Hematopoyesis Clonal , Mutación , Humanos , Hematopoyesis Clonal/genética , Masculino , Femenino , Anciano , Persona de Mediana Edad , Adulto , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/economía , Análisis Costo-Beneficio , Células Madre Hematopoyéticas/metabolismo , Células Madre Hematopoyéticas/citología , Evolución Clonal/genética , Anciano de 80 o más Años , Hematopoyesis/genética

13.

Overcome the Limitation of Phenome-Wide Association Studies (PheWAS): Extension of PheWAS to Efficient and Robust Large-Scale ICD Codes Analysis.

Lin, Ya-Chen; Zhang, Siwei; Vessels, Tess; Bastarache, Lisa; Bejan, Cosmin Adrian; Hsie, Ryan S; Philips, Elizabeth J; Ruderfer, Doug M; Pulley, Jill M; Edwards, Todd L; Wells, Quinn S; Warner, Jeremy L; Denny, Joshua C; Roden, Dan M; Kang, Hakmook; Xu, Yaomin.

medRxiv ; 2024 Apr 19.

Artículo en Inglés | MEDLINE | ID: mdl-38699370

RESUMEN

The Phenome-wide association studies (PheWAS) have become widely used for efficient, high-throughput evaluation of relationship between a genetic factor and a large number of disease phenotypes, typically extracted from a DNA biobank linked with electronic medical records (EMR). Phecodes, billing code-derived disease case-control status, are usually used as outcome variables in PheWAS and logistic regression has been the standard choice of analysis method. Since the clinical diagnoses in EMR are often inaccurate with errors which can lead to biases in the odds ratio estimates, much effort has been put to accurately define the cases and controls to ensure an accurate analysis. Specifically in order to correctly classify controls in the population, an exclusion criteria list for each Phecode was manually compiled to obtain unbiased odds ratios. However, the accuracy of the list cannot be guaranteed without extensive data curation process. The costly curation process limits the efficiency of large-scale analyses that take full advantage of all structured phenotypic information available in EMR. Here, we proposed to estimate relative risks (RR) instead. We first demonstrated the desired nature of RR that overcomes the inaccuracy in the controls via theoretical formula. With simulation and real data application, we further confirmed that RR is unbiased without compiling exclusion criteria lists. With RR as estimates, we are able to efficiently extend PheWAS to a larger-scale, phenome construction agnostic analysis of phenotypes, using ICD 9/10 codes, which preserve much more disease-related clinical information than Phecodes.

14.

Polygenic risk score for ulcerative colitis predicts immune checkpoint inhibitor-mediated colitis.

Middha, Pooja; Thummalapalli, Rohit; Betti, Michael J; Yao, Lydia; Quandt, Zoe; Balaratnam, Karmugi; Bejan, Cosmin A; Cardenas, Eduardo; Falcon, Christina J; Faleck, David M; Gubens, Matthew A; Huntsman, Scott; Johnson, Douglas B; Kachuri, Linda; Khan, Khaleeq; Li, Min; Lovly, Christine M; Murray, Megan H; Patel, Devalben; Werking, Kristin; Xu, Yaomin; Zhan, Luna Jia; Balko, Justin M; Liu, Geoffrey; Aldrich, Melinda C; Schoenfeld, Adam J; Ziv, Elad.

Nat Commun ; 15(1): 2568, 2024 Mar 26.

Artículo en Inglés | MEDLINE | ID: mdl-38531883

RESUMEN

Immune checkpoint inhibitor-mediated colitis (IMC) is a common adverse event of treatment with immune checkpoint inhibitors (ICI). We hypothesize that genetic susceptibility to Crohn's disease (CD) and ulcerative colitis (UC) predisposes to IMC. In this study, we first develop a polygenic risk scores for CD (PRSCD) and UC (PRSUC) in cancer-free individuals and then test these PRSs on IMC in a cohort of 1316 patients with ICI-treated non-small cell lung cancer and perform a replication in 873 ICI-treated pan-cancer patients. In a meta-analysis, the PRSUC predicts all-grade IMC (ORmeta=1.35 per standard deviation [SD], 95% CI = 1.12-1.64, P = 2×10-03) and severe IMC (ORmeta=1.49 per SD, 95% CI = 1.18-1.88, P = 9×10-04). PRSCD is not associated with IMC. Furthermore, PRSUC predicts severe IMC among patients treated with combination ICIs (ORmeta=2.20 per SD, 95% CI = 1.07-4.53, P = 0.03). Overall, PRSUC can identify patients receiving ICI at risk of developing IMC and may be useful to monitor patients and improve patient outcomes.

Asunto(s)

Carcinoma de Pulmón de Células no Pequeñas , Colitis Ulcerosa , Colitis , Enfermedad de Crohn , Neoplasias Pulmonares , Humanos , Colitis Ulcerosa/genética , Inhibidores de Puntos de Control Inmunológico , Puntuación de Riesgo Genético , Enfermedad de Crohn/genética

15.

PheMIME: an interactive web app and knowledge base for phenome-wide, multi-institutional multimorbidity analysis.

Zhang, Siwei; Strayer, Nick; Vessels, Tess; Choi, Karmel; Wang, Geoffrey W; Li, Yajing; Bejan, Cosmin A; Hsi, Ryan S; Bick, Alexander G; Velez Edwards, Digna R; Savona, Michael R; Phillips, Elizabeth J; Pulley, Jill M; Self, Wesley H; Hopkins, Wilkins Consuelo; Roden, Dan M; Smoller, Jordan W; Ruderfer, Douglas M; Xu, Yaomin.

J Am Med Inform Assoc ; 2024 Aug 10.

Artículo en Inglés | MEDLINE | ID: mdl-39127052

RESUMEN

OBJECTIVES: To address the need for interactive visualization tools and databases in characterizing multimorbidity patterns across different populations, we developed the Phenome-wide Multi-Institutional Multimorbidity Explorer (PheMIME). This tool leverages three large-scale EHR systems to facilitate efficient analysis and visualization of disease multimorbidity, aiming to reveal both robust and novel disease associations that are consistent across different systems and to provide insight for enhancing personalized healthcare strategies. MATERIALS AND METHODS: PheMIME integrates summary statistics from phenome-wide analyses of disease multimorbidities, utilizing data from Vanderbilt University Medical Center, Mass General Brigham, and the UK Biobank. It offers interactive and multifaceted visualizations for exploring multimorbidity. Incorporating an enhanced version of associationSubgraphs, PheMIME also enables dynamic analysis and inference of disease clusters, promoting the discovery of complex multimorbidity patterns. A case study on schizophrenia demonstrates its capability for generating interactive visualizations of multimorbidity networks within and across multiple systems. Additionally, PheMIME supports diverse multimorbidity-based discoveries, detailed further in online case studies. RESULTS: The PheMIME is accessible at https://prod.tbilab.org/PheMIME/. A comprehensive tutorial and multiple case studies for demonstration are available at https://prod.tbilab.org/PheMIME_supplementary_materials/. The source code can be downloaded from https://github.com/tbilab/PheMIME. DISCUSSION: PheMIME represents a significant advancement in medical informatics, offering an efficient solution for accessing, analyzing, and interpreting the complex and noisy real-world patient data in electronic health records. CONCLUSION: PheMIME provides an extensive multimorbidity knowledge base that consolidates data from three EHR systems, and it is a novel interactive tool designed to analyze and visualize multimorbidities across multiple EHR datasets. It stands out as the first of its kind to offer extensive multimorbidity knowledge integration with substantial support for efficient online analysis and interactive visualization.

16.

Interoperability of phenome-wide multimorbidity patterns: a comparative study of two large-scale EHR systems.

Strayer, Nick; Vessels, Tess; Choi, Karmel; Zhang, Siwei; Li, Yajing; Han, Lide; Sharber, Brian; Hsi, Ryan S; Bejan, Cosmin A; Bick, Alexander G; Balko, Justin M; Johnson, Douglas B; Wheless, Lee E; Wells, Quinn S; Philips, Elizabeth J; Pulley, Jill M; Self, Wesley H; Chen, Qingxia; Hartert, Tina; Wilkins, Consuelo H; Savona, Michael R; Shyr, Yu; Roden, Dan M; Smoller, Jordan W; Ruderfer, Douglas M; Xu, Yaomin.

medRxiv ; 2024 May 27.

Artículo en Inglés | MEDLINE | ID: mdl-38585743

RESUMEN

Background: Electronic health records (EHR) are increasingly used for studying multimorbidities. However, concerns about accuracy, completeness, and EHRs being primarily designed for billing and administrative purposes raise questions about the consistency and reproducibility of EHR-based multimorbidity research. Methods: Utilizing phecodes to represent the disease phenome, we analyzed pairwise comorbidity strengths using a dual logistic regression approach and constructed multimorbidity as an undirected weighted graph. We assessed the consistency of the multimorbidity networks within and between two major EHR systems at local (nodes and edges), meso (neighboring patterns), and global (network statistics) scales. We present case studies to identify disease clusters and uncover clinically interpretable disease relationships. We provide an interactive web tool and a knowledge base combining data from multiple sources for online multimorbidity analysis. Findings: Analyzing data from 500,000 patients across Vanderbilt University Medical Center and Mass General Brigham health systems, we observed a strong correlation in disease frequencies (Kendall's τ = 0.643) and comorbidity strengths (Pearson ρ = 0.79). Consistent network statistics across EHRs suggest similar structures of multimorbidity networks at various scales. Comorbidity strengths and similarities of multimorbidity connection patterns align with the disease genetic correlations. Graph-theoretic analyses revealed a consistent core-periphery structure, implying efficient network clustering through threshold graph construction. Using hydronephrosis as a case study, we demonstrated the network's ability to uncover clinically relevant disease relationships and provide novel insights. Interpretation: Our findings demonstrate the robustness of large-scale EHR data for studying phenome-wide multimorbidities. The alignment of multimorbidity patterns with genetic data suggests the potential utility for uncovering shared biology of diseases. The consistent core-periphery structure offers analytical insights to discover complex disease interactions. This work also sets the stage for advanced disease modeling, with implications for precision medicine. Funding: VUMC Biostatistics Development Award, the National Institutes of Health, and the VA CSRD.

17.

Defining Suicidal Thought and Behavior Phenotypes for Genetic Studies.

Monson, Eric T; Colbert, Sarah M C; Andreassen, Ole A; Ayinde, Olatunde O; Bejan, Cosmin A; Ceja, Zuriel; Coon, Hilary; DiBlasi, Emily; Izotova, Anastasia; Kaufman, Erin A; Koromina, Maria; Myung, Woojae; Nurnberger, John I; Serretti, Alessandro; Smoller, Jordan W; Stein, Murray B; Zai, Clement C; Aslan, Mihaela; Barr, Peter B; Bigdeli, Tim B; Harvey, Philip D; Kimbrel, Nathan A; Patel, Pujan R; Ruderfer, Douglas; Docherty, Anna R; Mullins, Niamh; Mann, J John.

medRxiv ; 2024 Jul 29.

Artículo en Inglés | MEDLINE | ID: mdl-39132474

RESUMEN

Background: Standardized definitions of suicidality phenotypes, including suicidal ideation (SI), attempt (SA), and death (SD) are a critical step towards improving understanding and comparison of results in suicide research. The complexity of suicidality contributes to heterogeneity in phenotype definitions, impeding evaluation of clinical and genetic risk factors across studies and efforts to combine samples within consortia. Here, we present expert and data-supported recommendations for defining suicidality and control phenotypes to facilitate merging current/legacy samples with definition variability and aid future sample creation. Methods: A subgroup of clinician researchers and experts from the Suicide Workgroup of the Psychiatric Genomics Consortium (PGC) reviewed existing PGC definitions for SI, SA, SD, and control groups and generated preliminary consensus guidelines for instrument-derived and international classification of disease (ICD) data. ICD lists were validated in two independent datasets (N = 9,151 and 12,394). Results: Recommendations are provided for evaluated instruments for SA and SI, emphasizing selection of lifetime measures phenotype-specific wording. Recommendations are also provided for defining SI and SD from ICD data. As the SA ICD definition is complex, SA code list recommendations were validated against instrument results with sensitivity (range = 15.4% to 80.6%), specificity (range = 67.6% to 97.4%), and positive predictive values (range = 0.59-0.93) reported. Conclusions: Best-practice guidelines are presented for the use of existing information to define SI/SA/SD in consortia research. These proposed definitions are expected to facilitate more homogeneous data aggregation for genetic and multisite studies. Future research should involve refinement, improved generalizability, and validation in diverse populations.

18.

Assertion modeling and its role in clinical phenotype identification.

Bejan, Cosmin Adrian; Vanderwende, Lucy; Xia, Fei; Yetisgen-Yildiz, Meliha.

J Biomed Inform ; 46(1): 68-74, 2013 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-23000479

RESUMEN

This paper describes an approach to assertion classification and an empirical study on the impact this task has on phenotype identification, a real world application in the clinical domain. The task of assertion classification is to assign to each medical concept mentioned in a clinical report (e.g., pneumonia, chest pain) a specific assertion category (e.g., present, absent, and possible). To improve the classification of medical assertions, we propose several new features that capture the semantic properties of special cue words highly indicative of a specific assertion category. The results obtained outperform the current state-of-the-art results for this task. Furthermore, we confirm the intuition that assertion classification contributes in significantly improving the results of phenotype identification from free-text clinical records.

Asunto(s)

Modelos Teóricos , Neumonía/fisiopatología , Humanos , Fenotipo

19.

Kidney Stone Prevalence Based on Self-Report and Electronic Health Records: Insight into the Prevalence of Active Medical Care for Kidney Stones.

Forbes, Connor M; Nimmagadda, Naren; Kavoussi, Nicholas L; Xu, Yaomin; Bejan, Cosmin A; Miller, Nicole L; Hsi, Ryan S.

Urology ; 173: 55-60, 2023 03.

Artículo en Inglés | MEDLINE | ID: mdl-36435346

RESUMEN

OBJECTIVE: To compare rates of patient-reported kidney stone disease to Electronic Health Records (EHR) kidney stone diagnosis using a common dataset to evaluate for socio-demographic differences, including between those with and without active care. METHODS: From the All of Us research database, we identified 21,687 adult participants with both patient-reported and EHR data. We compared differences in age, sex, race, education, employment status and healthcare access between patients with self-reported kidney stone history without EHR data to those with EHR-based diagnoses. RESULTS: In this population, the self-reported prevalence of kidney stones was 8.6% overall (n = 1877), including 4.6% (n = 1004) who had self-reported diagnoses but no EHR data. Among those with self-reported kidney stone diagnoses only, the median age was 66. The EHR-based prevalence of kidney stones was 5.7% (n = 1231), median age 67. No differences were observed in age, sex, education, employment status, rural/urban status, or ability to afford healthcare between groups with EHR diagnosis or self-reported diagnosis only. Of patients who had a self-reported history of kidney stones, 24% reported actively seeing a provider for kidney stones. CONCLUSION: Kidney stone prevalence by self-report is higher than EHR-based prevalence in this national dataset. Using either method alone to estimate kidney stone prevalence may exclude some patients with the condition, although the demographic profile of both groups is similar. Approximately 1 in 4 patients report actively seeing a provider for stone disease.

Asunto(s)

Cálculos Renales , Humanos , Cálculos Renales/diagnóstico , Cálculos Renales/epidemiología , Cálculos Renales/terapia , Masculino , Femenino , Adulto , Persona de Mediana Edad , Anciano , Registros Electrónicos de Salud , Prevalencia , Salud Poblacional

20.

Global trends of monkeypox-related articles: A bibliometric analysis over the last five decades (1964 - July 14, 2022).

Kamal, Manar A; Farahat, Ramadan A; Awad, Ahmed K; Tabassum, Shehroze; Labieb, Fatma; Bejan, Cosmin A; Al-Tawfiq, Jaffar A; Dhama, Kuldeep; Dergaa, Ismail.

J Infect Public Health ; 16(9): 1333-1340, 2023 Sep.

Artículo en Inglés | MEDLINE | ID: mdl-37429097

RESUMEN

BACKGROUND: The first human monkeypox (MPX) case was identified in the Democratic Republic of Congo (DRC) in 1970 with an outbreak in 2010 and the first human MPX case in the UK in 2022. In this study, we conducted a bibliometric analysis of the literature on monkeypox based on the Web of Science Core Collection (WOSCC) of the Institute for Scientific Information (ISI) to identify relevant topics and trends in monkeypox research. METHODS: We searched the Web of Science from 1964 until July 14, 2022, for all publications using the keywords "Monkeypox" and "Monkeypox virus." Results were compared using numerous bibliometric methodologies and stratified by journal, author, year, institution, and country-specific metrics. RESULTS: Out of 1170 publications initially selected, 1163 entered our analysis, with 65.26 % (n = 759) being original research articles and 9.37 % (n = 109) being review articles. Most MPX publications were in 2010, with 6.02 % (n = 70), followed by 2009 and 2022 at 5.67 % (n = 66) each. The USA was the country with the highest number of publications, with n = 662 (56.92 %) of total publications, followed by Germany with n = 82 (7.05 %), the UK with n = 74 (6.36 %), and Congo with n = 65 (5.59 %). Journal of Virology published the highest number of MPX publications, followed by Virology Journal and Emerging Infectious Diseases with n = 52 (9.25 %), n = 43 (7.65 %), and n = 32 (5.69 %) publications, respectively. The top contributing institutions were the Centers for Disease Control and Prevention (CDC), the US Army Medical Research Institute of Infectious Diseases, and the National Institutes of Health (NIH)National Institute of Allergy and Infectious Diseases (NIAID). CONCLUSION: Our analysis provides an objective and robust overview of the current literature on MPX and its global trends; this information could serve as a reference guide for those aiming to conduct further MPX-related research and as a source for those seeking information about MPX.

Asunto(s)

Mpox , Humanos , Bibliometría , Brotes de Enfermedades , Alemania , Mpox/epidemiología , Monkeypox virus

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA