RESUMEN
Coronary artery disease (CAD) exists on a spectrum of disease represented by a combination of risk factors and pathogenic processes. An in silico score for CAD built using machine learning and clinical data in electronic health records captures disease progression, severity and underdiagnosis on this spectrum and could enhance genetic discovery efforts for CAD. Here we tested associations of rare and ultrarare coding variants with the in silico score for CAD in the UK Biobank, All of Us Research Program and BioMe Biobank. We identified associations in 17 genes; of these, 14 show at least moderate levels of prior genetic, biological and/or clinical support for CAD. We also observed an excess of ultrarare coding variants in 321 aggregated CAD genes, suggesting more ultrarare variant associations await discovery. These results expand our understanding of the genetic etiology of CAD and illustrate how digital markers can enhance genetic association investigations for complex diseases.
Asunto(s)
Enfermedad de la Arteria Coronaria , Predisposición Genética a la Enfermedad , Aprendizaje Automático , Enfermedad de la Arteria Coronaria/genética , Humanos , Exoma/genética , Secuenciación del Exoma/métodos , Variación Genética , Estudio de Asociación del Genoma Completo/métodos , Femenino , Polimorfismo de Nucleótido SimpleRESUMEN
BACKGROUND: Diet is a key modifiable risk factor of coronary artery disease (CAD). However, the causal effects of specific dietary traits on CAD risk remain unclear. With the expansion of dietary data in population biobanks, Mendelian randomization (MR) could help enable the efficient estimation of causality in diet-disease associations. OBJECTIVES: The primary goal was to test causality for 13 common dietary traits on CAD risk using a systematic 2-sample MR framework. A secondary goal was to identify plasma metabolites mediating diet-CAD associations suspected to be causal. METHODS: Cross-sectional genetic and dietary data on up to 420,531 UK Biobank and 184,305 CARDIoGRAMplusC4D individuals of European ancestry were used in 2-sample MR. The primary analysis used fixed effect inverse-variance weighted regression, while sensitivity analyses used weighted median estimation, MR-Egger regression, and MR-Pleiotropy Residual Sum and Outlier. RESULTS: Genetic variants serving as proxies for muesli intake were negatively associated with CAD risk (OR: 0.74; 95% CI: 0.65-0.84; P = 5.385 × 10-4). Sensitivity analyses using weighted median estimation supported this with a significant association in the same direction. Additionally, we identified higher plasma acetate levels as a potential mediator (OR: 0.03; 95% CI: 0.01-0.12; P = 1.15 × 10-4). CONCLUSIONS: Muesli, a mixture of oats, seeds, nuts, dried fruit, and milk, may causally reduce CAD risk. Circulating levels of acetate, a gut microbiota-derived short-chain fatty acid, could be mediating its cardioprotective effects. These findings highlight the role of gut flora in cardiovascular health and help prioritize randomized trials on dietary interventions for CAD.
RESUMEN
Population-based genomic screening may help diagnose individuals with disease-risk variants. Here, we perform a genome-first evaluation for nine disorders in 29,039 participants with linked exome sequences and electronic health records (EHRs). We identify 614 individuals with 303 pathogenic/likely pathogenic or predicted loss-of-function (P/LP/LoF) variants, yielding 644 observations; 487 observations (76%) lack a corresponding clinical diagnosis in the EHR. Upon further investigation, 75 clinically undiagnosed observations (15%) have evidence of symptomatic untreated disease, including familial hypercholesterolemia (3 of 6 [50%] undiagnosed observations with disease evidence) and breast cancer (23 of 106 [22%]). These genetic findings enable targeted phenotyping that reveals new diagnoses in previously undiagnosed individuals. Disease yield is greater with variants in penetrant genes for which disease is observed in carriers in an independent cohort. The prevalence of P/LP/LoF variants exceeds that of clinical diagnoses, and some clinically undiagnosed carriers are discovered to have disease. These results highlight the potential of population-based genomic screening.
Asunto(s)
Secuenciación del Exoma , Exoma , Humanos , Femenino , Masculino , Exoma/genética , Secuenciación del Exoma/métodos , Persona de Mediana Edad , Adulto , Enfermedades Genéticas Congénitas/genética , Enfermedades Genéticas Congénitas/diagnóstico , Enfermedades Genéticas Congénitas/epidemiología , Predisposición Genética a la Enfermedad , Registros Electrónicos de Salud , Pruebas Genéticas/métodos , Genoma Humano , Anciano , Atención a la Salud , Adolescente , Genómica/métodos , Adulto JovenRESUMEN
Studies have shown that drug targets with human genetic support are more likely to succeed in clinical trials. Hence, a tool integrating genetic evidence to prioritize drug target genes is beneficial for drug discovery. We built a genetic priority score (GPS) by integrating eight genetic features with drug indications from the Open Targets and SIDER databases. The top 0.83%, 0.28% and 0.19% of the GPS conferred a 5.3-, 9.9- and 11.0-fold increased effect of having an indication, respectively. In addition, we observed that targets in the top 0.28% of the score were 1.7-, 3.7- and 8.8-fold more likely to advance from phase I to phases II, III and IV, respectively. Complementary to the GPS, we incorporated the direction of genetic effect and drug mechanism into a directional version of the score called the GPS with direction of effect. We applied our method to 19,365 protein-coding genes and 399 drug indications and made all results available through a web portal.
Asunto(s)
Genética Humana , Farmacogenética , Humanos , Descubrimiento de DrogasRESUMEN
Kidney disease affects 50% of all diabetic patients; however, prediction of disease progression has been challenging due to inherent disease heterogeneity. We use deep learning to identify novel genetic signatures prognostically associated with outcomes. Using autoencoders and unsupervised clustering of electronic health record data on 1,372 diabetic kidney disease patients, we establish two clusters with differential prevalence of end-stage kidney disease. Exome-wide associations identify a novel variant in ARHGEF18, a Rho guanine exchange factor specifically expressed in glomeruli. Overexpression of ARHGEF18 in human podocytes leads to impairments in focal adhesion architecture, cytoskeletal dynamics, cellular motility, and RhoA/Rac1 activation. Mutant GEF18 is resistant to ubiquitin mediated degradation leading to pathologically increased protein levels. Our findings uncover the first known disease-causing genetic variant that affects protein stability of a cytoskeletal regulator through impaired degradation, a potentially novel class of expression quantitative trait loci that can be therapeutically targeted.
RESUMEN
BACKGROUND: Lyme disease is the most prevalent vector-borne disease in the US, yet its host factors are poorly understood and diagnostic tests are limited. We evaluated patients in a large health system to uncover cholesterol's role in the susceptibility, severity, and machine learning-based diagnosis of Lyme disease. METHODS: A longitudinal health system cohort comprised 1 019 175 individuals with electronic health record data and 50 329 with linked genetic data. Associations of blood cholesterol level, cholesterol genetic scores comprising common genetic variants, and burden of rare loss-of-function (LoF) variants in cholesterol metabolism genes with Lyme disease were investigated. A portable machine learning model was constructed and tested to predict Lyme disease using routine lipid and clinical measurements. RESULTS: There were 3832 cases of Lyme disease. Increasing cholesterol was associated with greater risk of Lyme disease and hypercholesterolemia was more prevalent in Lyme disease cases than in controls. Cholesterol genetic scores and rare LoF variants in CD36 and LDLR were associated with Lyme disease risk. Serological profiling of cases revealed parallel trajectories of rising cholesterol and immunoglobulin levels over the disease course, including marked increases in individuals with LoF variants and high cholesterol genetic scores. The machine learning model predicted Lyme disease solely using routine lipid panel, blood count, and metabolic measurements. CONCLUSIONS: These results demonstrate the value of large-scale genetic and clinical data to reveal host factors underlying infectious disease biology, risk, and prognosis and the potential for their clinical translation to machine learning diagnostics that do not need specialized assays.
Asunto(s)
Hipercolesterolemia , Enfermedad de Lyme , Humanos , Enfermedad de Lyme/diagnóstico , Enfermedad de Lyme/epidemiología , Colesterol , Pronóstico , Aprendizaje AutomáticoRESUMEN
Systemic autoimmune rheumatic diseases (SARDs) can lead to irreversible damage if left untreated, yet these patients often endure long diagnostic journeys before being diagnosed and treated. Machine learning may help overcome the challenges of diagnosing SARDs and inform clinical decision-making. Here, we developed and tested a machine learning model to identify patients who should receive rheumatological evaluation for SARDs using longitudinal electronic health records of 161,584 individuals from two institutions. The model demonstrated high performance for predicting cases of autoantibody-tested individuals in a validation set, an external test set, and an independent cohort with a broader case definition. This approach identified more individuals for autoantibody testing compared with current clinical standards and a greater proportion of autoantibody carriers among those tested. Diagnoses of SARDs and other autoimmune conditions increased with higher model probabilities. The model detected a need for autoantibody testing and rheumatology encounters up to five years before the test date and assessment date, respectively. Altogether, these findings illustrate that the clinical manifestations of a diverse array of autoimmune conditions are detectable in electronic health records using machine learning, which may help systematize and accelerate autoimmune testing.
Asunto(s)
Enfermedades Autoinmunes , Registros Electrónicos de Salud , Humanos , Enfermedades Autoinmunes/diagnóstico , Pacientes , Autoanticuerpos , Aprendizaje AutomáticoRESUMEN
Background: Causality between plasma triglyceride (TG) levels and atherosclerotic cardiovascular disease (ASCVD) risk remains controversial despite more than four decades of study and two recent landmark trials, STRENGTH, and REDUCE-IT. Further unclear is the association between TG levels and non-atherosclerotic diseases across organ systems. Methods: Here, we conducted a phenome-wide, two-sample Mendelian randomization (MR) analysis using inverse-variance weighted (IVW) regression to systematically infer the causal effects of plasma TG levels on 2600 disease traits in the European ancestry population of UK Biobank. For replication, we externally tested 221 nominally significant associations (p<0.05) in an independent cohort from FinnGen. To account for potential horizontal pleiotropy and the influence of invalid instrumental variables, we performed sensitivity analyses using MR-Egger regression, weighted median estimator, and MR-PRESSO. Finally, we used multivariable MR (MVMR) controlling for correlated lipid fractions to distinguish the independent effect of plasma TG levels. Results: Our results identified seven disease traits reaching Bonferroni-corrected significance in both the discovery (p<1.92 × 10-5) and replication analyses (p<2.26 × 10-4), suggesting a causal relationship between plasma TG levels and ASCVDs, including coronary artery disease (OR 1.33, 95% CI 1.24-1.43, p=2.47 × 10-13). We also identified 12 disease traits that were Bonferroni-significant in the discovery or replication analysis and at least nominally significant in the other analysis (p<0.05), identifying plasma TG levels as a novel potential risk factor for nine non-ASCVD diseases, including uterine leiomyoma (OR 1.19, 95% CI 1.10-1.29, p=1.17 × 10-5). Conclusions: Taking a phenome-wide, two-sample MR approach, we identified causal associations between plasma TG levels and 19 disease traits across organ systems. Our findings suggest unrealized drug repurposing opportunities or adverse effects related to approved and emerging TG-lowering agents, as well as mechanistic insights for future studies. Funding: RD is supported by the National Institute of General Medical Sciences of the National Institutes of Health (NIH) (R35-GM124836) and the National Heart, Lung, and Blood Institute of the NIH (R01-HL139865 and R01-HL155915).
Asunto(s)
Aterosclerosis , Enfermedad de la Arteria Coronaria , Humanos , Análisis de la Aleatorización Mendeliana , Fenotipo , Enfermedad de la Arteria Coronaria/genética , Triglicéridos , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido SimpleRESUMEN
BACKGROUND: Binary diagnosis of coronary artery disease does not preserve the complexity of disease or quantify its severity or its associated risk with death; hence, a quantitative marker of coronary artery disease is warranted. We evaluated a quantitative marker of coronary artery disease derived from probabilities of a machine learning model. METHODS: In this cohort study, we developed and validated a coronary artery disease-predictive machine learning model using 95 935 electronic health records and assessed its probabilities as in-silico scores for coronary artery disease (ISCAD; range 0 [lowest probability] to 1 [highest probability]) in participants in two longitudinal biobank cohorts. We measured the association of ISCAD with clinical outcomes-namely, coronary artery stenosis, obstructive coronary artery disease, multivessel coronary artery disease, all-cause death, and coronary artery disease sequelae. FINDINGS: Among 95 935 participants, 35 749 were from the BioMe Biobank (median age 61 years [IQR 18]; 14 599 [41%] were male and 21 150 [59%] were female; 5130 [14%] were with diagnosed coronary artery disease) and 60 186 were from the UK Biobank (median age 62 [15] years; 25 031 [42%] male and 35 155 [58%] female; 8128 [14%] with diagnosed coronary artery disease). The model predicted coronary artery disease with an area under the receiver operating characteristic curve of 0·95 (95% CI 0·94-0·95; sensitivity of 0·94 [0·94-0·95] and specificity of 0·82 [0·81-0·83]) and 0·93 (0·92-0·93; sensitivity of 0·90 [0·89-0·90] and specificity of 0·88 [0·87-0·88]) in the BioMe validation and holdout sets, respectively, and 0·91 (0·91-0·91; sensitivity of 0·84 [0·83-0·84] and specificity of 0·83 [0·82-0·83]) in the UK Biobank external test set. ISCAD captured coronary artery disease risk from known risk factors, pooled cohort equations, and polygenic risk scores. Coronary artery stenosis increased quantitatively with ascending ISCAD quartiles (increase per quartile of 12 percentage points), including risk of obstructive coronary artery disease, multivessel coronary artery disease, and stenosis of major coronary arteries. Hazard ratios (HRs) and prevalence of all-cause death increased stepwise over ISCAD deciles (decile 1: HR 1·0 [95% CI 1·0-1·0], 0·2% prevalence; decile 6: 11 [3·9-31], 3·1% prevalence; and decile 10: 56 [20-158], 11% prevalence). A similar trend was observed for recurrent myocardial infarction. 12 (46%) undiagnosed individuals with high ISCAD (≥0·9) had clinical evidence of coronary artery disease according to the 2014 American College of Cardiology/American Heart Association Task Force guidelines. INTERPRETATION: Electronic health record-based machine learning was used to generate an in-silico marker for coronary artery disease that can non-invasively quantify atherosclerosis and risk of death on a continuous spectrum, and identify underdiagnosed individuals. FUNDING: National Institutes of Health.
Asunto(s)
Enfermedad de la Arteria Coronaria , Estenosis Coronaria , Humanos , Masculino , Femenino , Persona de Mediana Edad , Enfermedad de la Arteria Coronaria/diagnóstico , Enfermedad de la Arteria Coronaria/epidemiología , Estudios de Cohortes , Valor Predictivo de las Pruebas , Estenosis Coronaria/diagnóstico , Factores de Riesgo , Aprendizaje Automático , Angiografía CoronariaRESUMEN
Phenome-wide association studies identified numerous loci associated with traits and diseases. To help interpret these associations, we constructed a phenome-wide network map of colocalized genes and phenotypes. We generated colocalized signals using the Genotype-Tissue Expression data and genome-wide association results in UK Biobank. We identified 9151 colocalized genes for 1411 phenotypes across 48 tissues. Then, we constructed bipartite networks using the colocalized signals in each tissue, and showed that the majority of links were observed in a single tissue. We applied the biLouvain clustering algorithm in each tissue-specific network to identify co-clusters of genes and phenotypes. We observed significant enrichments of these co-clusters with known biological and functional gene classes. Overall, the phenome-wide map provides links between genes, phenotypes and tissues, and can yield biological and clinical discoveries.
Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Bancos de Muestras Biológicas , Fenotipo , Reino UnidoRESUMEN
Genetic risk for coronary artery disease (CAD) is commonly measured with polygenic risk scores (PRS); yet, the relationship of atherosclerotic burden with PRS in healthy individuals not at high clinical risk for CAD (ie, without a high pooled cohort equations [PCE] score) is unknown. Here, we implemented a novel recall-by-PRS strategy to measure coronary artery calcium (CAC) scores prospectively in 53 healthy individuals with extreme high PRS (median [IQR] PRS = 94% [83-98]) and low PRS (median [IQR] PRS = 3.6% [1.2-10]). The high PRS group was associated with a 2.8-fold greater CAC than the low PRS group, adjusted for age, sex, BMI, smoking, and statin use, and had a 6.7-fold greater proportion of individuals with CAC exceeding 300 HU. These findings reveal that extreme PRS tracks with CAD risk even in those without high clinical risk and demonstrate proof of principle for recall-by-PRS approaches that should be assessed prospectively in larger trials.
Asunto(s)
Calcio , Enfermedad de la Arteria Coronaria , Calcio de la Dieta , Estudios de Cohortes , Enfermedad de la Arteria Coronaria/genética , Humanos , Medición de Riesgo , Factores de RiesgoRESUMEN
BACKGROUND: Clinical features from electronic health records (EHRs) can be used to build a complementary tool to predict coronary artery disease (CAD) susceptibility. OBJECTIVES: The purpose of this study was to determine whether an EHR score can improve CAD prediction and reclassification 1 year before diagnosis, beyond conventional clinical guidelines as determined by the pooled cohort equations (PCE) and a polygenic risk score for CAD. METHODS: We applied a machine learning framework using clinical features from the EHR in a multiethnic, clinical care cohort (BioMe) comprising 555 CAD cases and 6,349 control subjects and in a population-based cohort (UK Biobank) comprising 3,130 CAD cases and 378,344 control subjects for external validation. RESULTS: Compared with the PCE, the EHR score improved CAD prediction by 12% in the BioMe Biobank and by 9% in the UK Biobank. The EHR score reclassified 25.8% and 15.2% individuals in each cohort respectively, compared with the PCE score. We observed larger improvements in the EHR score over the PCE in a subgroup of individuals with low CAD risk, with 20% increased discrimination and 34.4% increased reclassification. In all models, the polygenic risk score for CAD did not improve CAD prediction, compared with the PCE or EHR score. CONCLUSIONS: The EHR score resulted in increased prediction and reclassification for CAD, demonstrating its potential use for population health monitoring of short-term CAD risk in large health systems.
Asunto(s)
Enfermedad de la Arteria Coronaria , Registros Electrónicos de Salud , Estudios de Cohortes , Enfermedad de la Arteria Coronaria/diagnóstico , Enfermedad de la Arteria Coronaria/epidemiología , Estudio de Asociación del Genoma Completo , Humanos , Medición de Riesgo/métodos , Factores de RiesgoRESUMEN
AIMS: Individuals with supranormal left ventricular ejection fraction (snLVEF; LVEF >70%) have increased mortality. However, the genetic and phenotypic profile of snLVEF remains unknown. This study aimed to determine the relationship of both snLVEF genetic risk and phenotype with survival and underdiagnosed heart failure (HF). METHODS AND RESULTS: A snLVEF genetic risk score (GRS) was applied and cases of snLVEF were identified in 486 754 individuals across two population-based cohorts (BioMe Biobank and UK Biobank). The snLVEF GRS and phenotype were evaluated for association with survival, as well as HF diagnosis, markers, symptoms, and medications. Of 486 754 participants, the median age was 58 years, 20 069 (4.1%) died, and 10 088 (2.1%) had diagnosed HF. Both snLVEF GRS (hazard ratio [HR] 1.1 for top 10% vs. bottom 10% GRS; p = 0.002) and phenotype (HR 1.4; p = 0.003) were associated with increased all-cause mortality. Both snLVEF GRS and phenotype were associated with reduced HF diagnosis (odds ratio [OR] 0.97 and OR 0.63, respectively; both p ≤0.002). However, the snLVEF GRS and phenotype were both associated with elevated brain natriuretic peptide (BNP) levels (146 and 185 pg/ml increase, respectively; p <0.001), including 268 out of 455 (59%) individuals with snLVEF phenotype who had BNP >100 pg/ml. Among 476 666 participants without HF diagnoses, snLVEF GRS and phenotype were associated with increased HF symptoms (e.g. exertional dyspnoea OR 1.4 and OR 1.3; p <0.003) and HF medications (e.g. loop diuretic OR 1.2 and OR 1.03; p <0.02). Associations were consistent in hypertensive individuals without cardiac comorbidities. CONCLUSIONS: Genetic predisposition to and presence of snLVEF are associated with decreased survival and underdiagnosed HF.
Asunto(s)
Insuficiencia Cardíaca , Humanos , Insuficiencia Cardíaca/diagnóstico , Insuficiencia Cardíaca/epidemiología , Insuficiencia Cardíaca/genética , Volumen Sistólico , Función Ventricular IzquierdaRESUMEN
Importance: Population-based assessment of disease risk associated with gene variants informs clinical decisions and risk stratification approaches. Objective: To evaluate the population-based disease risk of clinical variants in known disease predisposition genes. Design, Setting, and Participants: This cohort study included 72â¯434 individuals with 37â¯780 clinical variants who were enrolled in the BioMe Biobank from 2007 onwards with follow-up until December 2020 and the UK Biobank from 2006 to 2010 with follow-up until June 2020. Participants had linked exome and electronic health record data, were older than 20 years, and were of diverse ancestral backgrounds. Exposures: Variants previously reported as pathogenic or predicted to cause a loss of protein function by bioinformatic algorithms (pathogenic/loss-of-function variants). Main Outcomes and Measures: The primary outcome was the disease risk associated with clinical variants. The risk difference (RD) between the prevalence of disease in individuals with a variant allele (penetrance) vs in individuals with a normal allele was measured. Results: Among 72â¯434 study participants, 43â¯395 were from the UK Biobank (mean [SD] age, 57 [8.0] years; 24 065 [55%] women; 2948 [7%] non-European) and 29â¯039 were from the BioMe Biobank (mean [SD] age, 56 [16] years; 17 355 [60%] women; 19 663 [68%] non-European). Of 5360 pathogenic/loss-of-function variants, 4795 (89%) were associated with an RD less than or equal to 0.05. Mean penetrance was 6.9% (95% CI, 6.0%-7.8%) for pathogenic variants and 0.85% (95% CI, 0.76%-0.95%) for benign variants reported in ClinVar (difference, 6.0 [95% CI, 5.6-6.4] percentage points), with a median of 0% for both groups due to large numbers of nonpenetrant variants. Penetrance of pathogenic/loss-of-function variants for late-onset diseases was modified by age: mean penetrance was 10.3% (95% CI, 9.0%-11.6%) in individuals 70 years or older and 8.5% (95% CI, 7.9%-9.1%) in individuals 20 years or older (difference, 1.8 [95% CI, 0.40-3.3] percentage points). Penetrance of pathogenic/loss-of-function variants was heterogeneous even in known disease predisposition genes, including BRCA1 (mean [range], 38% [0%-100%]), BRCA2 (mean [range], 38% [0%-100%]), and PALB2 (mean [range], 26% [0%-100%]). Conclusions and Relevance: In 2 large biobank cohorts, the estimated penetrance of pathogenic/loss-of-function variants was variable but generally low. Further research of population-based penetrance is needed to refine variant interpretation and clinical evaluation of individuals with these variant alleles.
Asunto(s)
Predisposición Genética a la Enfermedad , Variación Genética , Mutación con Pérdida de Función , Penetrancia , Anciano , Bancos de Muestras Biológicas , Estudios de Cohortes , Femenino , Humanos , Masculino , Mutación , Reino UnidoRESUMEN
Background Despite advances in cardiovascular disease and risk factor management, mortality from ischemic heart failure (HF) in patients with coronary artery disease (CAD) remains high. Given the partial role of genetics in HF and lack of reliable risk stratification tools, we developed and validated a polygenic risk score for HF in patients with CAD, which we term HF-PRS. Methods and Results Using summary statistics from a recent genome-wide association study for HF, we developed candidate PRSs in the Mount Sinai BioMe CAD patient cohort (N=6274) by using the pruning and thresholding method and LDPred. We validated the best score in the Penn Medicine BioBank (N=7250) and performed a subgroup analysis in a high-risk cohort who had undergone coronary catheterization. We observed a significant association between HF-PRS score and ischemic HF even after adjusting for evidence of obstructive CAD in patients of European ancestry in both BioMe (odds ratio [OR], 1.14 per SD; 95% CI, 1.05-1.24; P=0.003) and Penn Medicine BioBank (OR, 1.07 per SD; 95% CI, 1.01-1.13; P=0.016). In European patients with CAD in Penn Medicine BioBank who had undergone coronary catheterization, individuals in the top 10th percentile of PRS had a 2-fold increased odds of ischemic HF (OR, 2.0; 95% CI, 1.1-3.7; P=0.02) compared with the bottom 10th percentile. Conclusions A PRS for HF enables risk stratification in patients with CAD. Future prospective studies aimed at demonstrating clinical utility are warranted for adoption in the patient setting.
Asunto(s)
Insuficiencia Cardíaca , Enfermedad de la Arteria Coronaria/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Insuficiencia Cardíaca/diagnóstico , Insuficiencia Cardíaca/genética , Humanos , Herencia Multifactorial , Estudios Prospectivos , Factores de RiesgoRESUMEN
PURPOSE: Limited mechanical ventilators (MV) during the Coronavirus disease (COVID-19) pandemic have led to the use of non-invasive ventilation (NIV) in hypoxemic patients, which has not been studied well. We aimed to assess the association of NIV versus MV with mortality and morbidity during respiratory intervention among hypoxemic patients admitted with COVID-19. METHODS: We performed a retrospective multi-center cohort study across 5 hospitals during March-April 2020. Outcomes included mortality, severe COVID-19-related symptoms, time to discharge, and final oxygen saturation (SpO2) at the conclusion of the respiratory intervention. Multivariable regression of outcomes was conducted in all hypoxemic participants, 4 subgroups, and propensity-matched analysis. RESULTS: Of 2381 participants with laboratory-confirmed SARS-CoV-2, 688 were included in the study who were hypoxemic upon initiation of respiratory intervention. During the study period, 299 participants died (43%), 163 were admitted to the ICU (24%), and 121 experienced severe COVID-19-related symptoms (18%). Participants on MV had increased mortality than those on NIV (128/154 [83%] versus 171/534 [32%], OR = 30, 95% CI 16-60) with a mean survival of 6 versus 15 days, respectively. The MV group experienced more severe COVID-19-related symptoms [55/154 (36%) versus 66/534 (12%), OR = 4.3, 95% CI 2.7-6.8], longer time to discharge (mean 17 versus 7.1 days), and lower final SpO2 (92 versus 94%). Across all subgroups and propensity-matched analysis, MV was associated with a greater OR of death than NIV. CONCLUSIONS: NIV was associated with lower respiratory intervention mortality and morbidity than MV. However, findings may be liable to unmeasured confounding and further study from randomized controlled trials is needed to definitively determine the role of NIV in hypoxemic patients with COVID-19.
Asunto(s)
COVID-19 , Ventilación no Invasiva , Estudios de Cohortes , Humanos , Respiración Artificial , Estudios Retrospectivos , SARS-CoV-2RESUMEN
Biobanks with exomes linked to electronic health records (EHRs) enable the study of genetic pleiotropy between rare variants and seemingly disparate diseases. We performed robust clinical phenotyping of rare, putatively deleterious variants (loss-of-function [LoF] and deleterious missense variants) in ERCC6, a gene implicated in inherited retinal disease. We analyzed 213,084 exomes, along with a targeted set of retinal, cardiac, and immune phenotypes from two large-scale EHR-linked biobanks. In the primary analysis, a burden of deleterious variants in ERCC6 was strongly associated with (1) retinal disorders; (2) cardiac and electrocardiogram perturbations; and (3) immunodeficiency and decreased immunoglobulin levels. Meta-analysis of results from the BioMe Biobank and UK Biobank showed a significant association of deleterious ERCC6 burden with retinal dystrophy (odds ratio [OR] = 2.6, 95% confidence interval [CI]: 1.5-4.6; p = 8.7 × 10-4 ), atypical atrial flutter (OR = 3.5, 95% CI: 1.9-6.5; p = 6.2 × 10-5 ), arrhythmia (OR = 1.5, 95% CI: 1.2-2.0; p = 2.7 × 10-3 ), and lymphocyte immunodeficiency (OR = 3.8, 95% CI: 2.1-6.8; p = 5.0 × 10-6 ). Carriers of ERCC6 LoF variants who lacked a diagnosis of these conditions exhibited increased symptoms, indicating underdiagnosis. These results reveal a unique genetic link among retinal, cardiac, and immune disorders and underscore the value of EHR-linked biobanks in assessing the full clinical profile of carriers of rare variants.
Asunto(s)
Pleiotropía Genética , Distrofias Retinianas , Arritmias Cardíacas , ADN Helicasas , Enzimas Reparadoras del ADN , Exoma , Humanos , Proteínas de Unión a Poli-ADP-Ribosa , Distrofias Retinianas/genética , Secuenciación del Exoma/métodosRESUMEN
Diabetic retinopathy (DR) is a common consequence in type 2 diabetes (T2D) and a leading cause of blindness in working-age adults. Yet, its genetic predisposition is largely unknown. Here, we examined the polygenic architecture underlying DR by deriving and assessing a genome-wide polygenic risk score (PRS) for DR. We evaluated the PRS in 6079 individuals with T2D of European, Hispanic, African and other ancestries from a large-scale multi-ethnic biobank. Main outcomes were PRS association with DR diagnosis, symptoms and complications, and time to diagnosis, and transferability to non-European ancestries. We observed that PRS was significantly associated with DR. A standard deviation increase in PRS was accompanied by an adjusted odds ratio (OR) of 1.12 [95% confidence interval (CI) 1.04-1.20; P = 0.001] for DR diagnosis. When stratified by ancestry, PRS was associated with the highest OR in European ancestry (OR = 1.22, 95% CI 1.02-1.41; P = 0.049), followed by African (OR = 1.15, 95% CI 1.03-1.28; P = 0.028) and Hispanic ancestries (OR = 1.10, 95% CI 1.00-1.10; P = 0.050). Individuals in the top PRS decile had a 1.8-fold elevated risk for DR versus the bottom decile (P = 0.002). Among individuals without DR diagnosis, the top PRS decile had more DR symptoms than the bottom decile (P = 0.008). The PRS was associated with retinal hemorrhage (OR = 1.44, 95% CI 1.03-2.02; P = 0.03) and earlier DR presentation (10% probability of DR by 4 years in the top PRS decile versus 8 years in the bottom decile). These results establish the significant polygenic underpinnings of DR and indicate the need for more diverse ancestries in biobanks to develop multi-ancestral PRS.