Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 37
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Lancet ; 401(10372): 215-225, 2023 Jan 21.
Artículo en Inglés | MEDLINE | ID: mdl-36563696

RESUMEN

BACKGROUND: Binary diagnosis of coronary artery disease does not preserve the complexity of disease or quantify its severity or its associated risk with death; hence, a quantitative marker of coronary artery disease is warranted. We evaluated a quantitative marker of coronary artery disease derived from probabilities of a machine learning model. METHODS: In this cohort study, we developed and validated a coronary artery disease-predictive machine learning model using 95 935 electronic health records and assessed its probabilities as in-silico scores for coronary artery disease (ISCAD; range 0 [lowest probability] to 1 [highest probability]) in participants in two longitudinal biobank cohorts. We measured the association of ISCAD with clinical outcomes-namely, coronary artery stenosis, obstructive coronary artery disease, multivessel coronary artery disease, all-cause death, and coronary artery disease sequelae. FINDINGS: Among 95 935 participants, 35 749 were from the BioMe Biobank (median age 61 years [IQR 18]; 14 599 [41%] were male and 21 150 [59%] were female; 5130 [14%] were with diagnosed coronary artery disease) and 60 186 were from the UK Biobank (median age 62 [15] years; 25 031 [42%] male and 35 155 [58%] female; 8128 [14%] with diagnosed coronary artery disease). The model predicted coronary artery disease with an area under the receiver operating characteristic curve of 0·95 (95% CI 0·94-0·95; sensitivity of 0·94 [0·94-0·95] and specificity of 0·82 [0·81-0·83]) and 0·93 (0·92-0·93; sensitivity of 0·90 [0·89-0·90] and specificity of 0·88 [0·87-0·88]) in the BioMe validation and holdout sets, respectively, and 0·91 (0·91-0·91; sensitivity of 0·84 [0·83-0·84] and specificity of 0·83 [0·82-0·83]) in the UK Biobank external test set. ISCAD captured coronary artery disease risk from known risk factors, pooled cohort equations, and polygenic risk scores. Coronary artery stenosis increased quantitatively with ascending ISCAD quartiles (increase per quartile of 12 percentage points), including risk of obstructive coronary artery disease, multivessel coronary artery disease, and stenosis of major coronary arteries. Hazard ratios (HRs) and prevalence of all-cause death increased stepwise over ISCAD deciles (decile 1: HR 1·0 [95% CI 1·0-1·0], 0·2% prevalence; decile 6: 11 [3·9-31], 3·1% prevalence; and decile 10: 56 [20-158], 11% prevalence). A similar trend was observed for recurrent myocardial infarction. 12 (46%) undiagnosed individuals with high ISCAD (≥0·9) had clinical evidence of coronary artery disease according to the 2014 American College of Cardiology/American Heart Association Task Force guidelines. INTERPRETATION: Electronic health record-based machine learning was used to generate an in-silico marker for coronary artery disease that can non-invasively quantify atherosclerosis and risk of death on a continuous spectrum, and identify underdiagnosed individuals. FUNDING: National Institutes of Health.


Asunto(s)
Enfermedad de la Arteria Coronaria , Estenosis Coronaria , Humanos , Masculino , Femenino , Persona de Mediana Edad , Enfermedad de la Arteria Coronaria/diagnóstico , Enfermedad de la Arteria Coronaria/epidemiología , Estudios de Cohortes , Valor Predictivo de las Pruebas , Estenosis Coronaria/diagnóstico , Factores de Riesgo , Aprendizaje Automático , Angiografía Coronaria
2.
Hum Mol Genet ; 30(10): 952-960, 2021 05 29.
Artículo en Inglés | MEDLINE | ID: mdl-33704450

RESUMEN

Diabetic retinopathy (DR) is a common consequence in type 2 diabetes (T2D) and a leading cause of blindness in working-age adults. Yet, its genetic predisposition is largely unknown. Here, we examined the polygenic architecture underlying DR by deriving and assessing a genome-wide polygenic risk score (PRS) for DR. We evaluated the PRS in 6079 individuals with T2D of European, Hispanic, African and other ancestries from a large-scale multi-ethnic biobank. Main outcomes were PRS association with DR diagnosis, symptoms and complications, and time to diagnosis, and transferability to non-European ancestries. We observed that PRS was significantly associated with DR. A standard deviation increase in PRS was accompanied by an adjusted odds ratio (OR) of 1.12 [95% confidence interval (CI) 1.04-1.20; P = 0.001] for DR diagnosis. When stratified by ancestry, PRS was associated with the highest OR in European ancestry (OR = 1.22, 95% CI 1.02-1.41; P = 0.049), followed by African (OR = 1.15, 95% CI 1.03-1.28; P = 0.028) and Hispanic ancestries (OR = 1.10, 95% CI 1.00-1.10; P = 0.050). Individuals in the top PRS decile had a 1.8-fold elevated risk for DR versus the bottom decile (P = 0.002). Among individuals without DR diagnosis, the top PRS decile had more DR symptoms than the bottom decile (P = 0.008). The PRS was associated with retinal hemorrhage (OR = 1.44, 95% CI 1.03-2.02; P = 0.03) and earlier DR presentation (10% probability of DR by 4 years in the top PRS decile versus 8 years in the bottom decile). These results establish the significant polygenic underpinnings of DR and indicate the need for more diverse ancestries in biobanks to develop multi-ancestral PRS.


Asunto(s)
Diabetes Mellitus Tipo 2/epidemiología , Retinopatía Diabética/epidemiología , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Adulto , Anciano , Población Negra/genética , Diabetes Mellitus Tipo 2/complicaciones , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/patología , Retinopatía Diabética/complicaciones , Retinopatía Diabética/genética , Retinopatía Diabética/patología , Hispánicos o Latinos/genética , Humanos , Persona de Mediana Edad , Herencia Multifactorial/genética , Medición de Riesgo , Factores de Riesgo , Población Blanca/genética
3.
Am Heart J ; 250: 29-33, 2022 08.
Artículo en Inglés | MEDLINE | ID: mdl-35526571

RESUMEN

Genetic risk for coronary artery disease (CAD) is commonly measured with polygenic risk scores (PRS); yet, the relationship of atherosclerotic burden with PRS in healthy individuals not at high clinical risk for CAD (ie, without a high pooled cohort equations [PCE] score) is unknown. Here, we implemented a novel recall-by-PRS strategy to measure coronary artery calcium (CAC) scores prospectively in 53 healthy individuals with extreme high PRS (median [IQR] PRS = 94% [83-98]) and low PRS (median [IQR] PRS = 3.6% [1.2-10]). The high PRS group was associated with a 2.8-fold greater CAC than the low PRS group, adjusted for age, sex, BMI, smoking, and statin use, and had a 6.7-fold greater proportion of individuals with CAC exceeding 300 HU. These findings reveal that extreme PRS tracks with CAD risk even in those without high clinical risk and demonstrate proof of principle for recall-by-PRS approaches that should be assessed prospectively in larger trials.


Asunto(s)
Calcio , Enfermedad de la Arteria Coronaria , Calcio de la Dieta , Estudios de Cohortes , Enfermedad de la Arteria Coronaria/genética , Humanos , Medición de Riesgo , Factores de Riesgo
4.
JAMA ; 327(4): 350-359, 2022 01 25.
Artículo en Inglés | MEDLINE | ID: mdl-35076666

RESUMEN

Importance: Population-based assessment of disease risk associated with gene variants informs clinical decisions and risk stratification approaches. Objective: To evaluate the population-based disease risk of clinical variants in known disease predisposition genes. Design, Setting, and Participants: This cohort study included 72 434 individuals with 37 780 clinical variants who were enrolled in the BioMe Biobank from 2007 onwards with follow-up until December 2020 and the UK Biobank from 2006 to 2010 with follow-up until June 2020. Participants had linked exome and electronic health record data, were older than 20 years, and were of diverse ancestral backgrounds. Exposures: Variants previously reported as pathogenic or predicted to cause a loss of protein function by bioinformatic algorithms (pathogenic/loss-of-function variants). Main Outcomes and Measures: The primary outcome was the disease risk associated with clinical variants. The risk difference (RD) between the prevalence of disease in individuals with a variant allele (penetrance) vs in individuals with a normal allele was measured. Results: Among 72 434 study participants, 43 395 were from the UK Biobank (mean [SD] age, 57 [8.0] years; 24 065 [55%] women; 2948 [7%] non-European) and 29 039 were from the BioMe Biobank (mean [SD] age, 56 [16] years; 17 355 [60%] women; 19 663 [68%] non-European). Of 5360 pathogenic/loss-of-function variants, 4795 (89%) were associated with an RD less than or equal to 0.05. Mean penetrance was 6.9% (95% CI, 6.0%-7.8%) for pathogenic variants and 0.85% (95% CI, 0.76%-0.95%) for benign variants reported in ClinVar (difference, 6.0 [95% CI, 5.6-6.4] percentage points), with a median of 0% for both groups due to large numbers of nonpenetrant variants. Penetrance of pathogenic/loss-of-function variants for late-onset diseases was modified by age: mean penetrance was 10.3% (95% CI, 9.0%-11.6%) in individuals 70 years or older and 8.5% (95% CI, 7.9%-9.1%) in individuals 20 years or older (difference, 1.8 [95% CI, 0.40-3.3] percentage points). Penetrance of pathogenic/loss-of-function variants was heterogeneous even in known disease predisposition genes, including BRCA1 (mean [range], 38% [0%-100%]), BRCA2 (mean [range], 38% [0%-100%]), and PALB2 (mean [range], 26% [0%-100%]). Conclusions and Relevance: In 2 large biobank cohorts, the estimated penetrance of pathogenic/loss-of-function variants was variable but generally low. Further research of population-based penetrance is needed to refine variant interpretation and clinical evaluation of individuals with these variant alleles.


Asunto(s)
Predisposición Genética a la Enfermedad , Variación Genética , Mutación con Pérdida de Función , Penetrancia , Anciano , Bancos de Muestras Biológicas , Estudios de Cohortes , Femenino , Humanos , Masculino , Mutación , Reino Unido
5.
Hum Mutat ; 42(8): 969-977, 2021 08.
Artículo en Inglés | MEDLINE | ID: mdl-34005834

RESUMEN

Biobanks with exomes linked to electronic health records (EHRs) enable the study of genetic pleiotropy between rare variants and seemingly disparate diseases. We performed robust clinical phenotyping of rare, putatively deleterious variants (loss-of-function [LoF] and deleterious missense variants) in ERCC6, a gene implicated in inherited retinal disease. We analyzed 213,084 exomes, along with a targeted set of retinal, cardiac, and immune phenotypes from two large-scale EHR-linked biobanks. In the primary analysis, a burden of deleterious variants in ERCC6 was strongly associated with (1) retinal disorders; (2) cardiac and electrocardiogram perturbations; and (3) immunodeficiency and decreased immunoglobulin levels. Meta-analysis of results from the BioMe Biobank and UK Biobank showed a significant association of deleterious ERCC6 burden with retinal dystrophy (odds ratio [OR] = 2.6, 95% confidence interval [CI]: 1.5-4.6; p = 8.7 × 10-4 ), atypical atrial flutter (OR = 3.5, 95% CI: 1.9-6.5; p = 6.2 × 10-5 ), arrhythmia (OR = 1.5, 95% CI: 1.2-2.0; p = 2.7 × 10-3 ), and lymphocyte immunodeficiency (OR = 3.8, 95% CI: 2.1-6.8; p = 5.0 × 10-6 ). Carriers of ERCC6 LoF variants who lacked a diagnosis of these conditions exhibited increased symptoms, indicating underdiagnosis. These results reveal a unique genetic link among retinal, cardiac, and immune disorders and underscore the value of EHR-linked biobanks in assessing the full clinical profile of carriers of rare variants.


Asunto(s)
Pleiotropía Genética , Distrofias Retinianas , Arritmias Cardíacas , ADN Helicasas , Enzimas Reparadoras del ADN , Exoma , Humanos , Proteínas de Unión a Poli-ADP-Ribosa , Distrofias Retinianas/genética , Secuenciación del Exoma/métodos
6.
Ann Hum Genet ; 84(3): 280-290, 2020 05.
Artículo en Inglés | MEDLINE | ID: mdl-31834638

RESUMEN

Most genome-wide association studies used genetic-model-based tests assuming an additive mode of inheritance, leading to underpowered association tests in case of departure from additivity. The general regression model (GRM) association test proposed by Fisher and Wilson in 1980 makes no assumption on the genetic model. Interestingly, it also allows formal testing of the underlying genetic model. We conducted a simulation study of quantitative traits to compare the power of the GRM test to the classical linear regression tests, the maximum of the three statistics (MAX), and the allele-based (allelic) tests. Simulations were performed on two samples sizes, using a large panel of genetic models, varying genetic models, minor allele frequencies, and the percentage of explained variance. In case of departure from additivity, the GRM was more powerful than the additive regression tests (power gain reaching 80%) and had similar power when the true model is additive. GRM was also as or more powerful than the MAX or allelic tests. The true simulated model was mostly retained by the GRM test. Application of GRM to HbA1c illustrates its gain in power. To conclude, GRM increases power to detect association for quantitative traits, allows determining the genetic model and is easily applicable.


Asunto(s)
Estudio de Asociación del Genoma Completo , Modelos Genéticos , Alelos , Simulación por Computador , Frecuencia de los Genes , Hemoglobina Glucada/genética , Humanos , Modelos Lineales , Sitios de Carácter Cuantitativo
7.
Bioinformatics ; 34(16): 2773-2780, 2018 08 15.
Artículo en Inglés | MEDLINE | ID: mdl-29547902

RESUMEN

Motivation: Large scale genome-wide association studies (GWAS) are tools of choice for discovering associations between genotypes and phenotypes. To date, many studies rely on univariate statistical tests for association between the phenotype and each assayed single nucleotide polymorphism (SNP). However, interaction between SNPs, namely epistasis, must be considered when tackling the complexity of underlying biological mechanisms. Epistasis analysis at large scale entails a prohibitive computational burden when addressing the detection of more than two interacting SNPs. In this paper, we introduce a stochastic causal graph-based method, SMMB, to analyze epistatic patterns in GWAS data. Results: We present Stochastic Multiple Markov Blanket algorithm (SMMB), which combines both ensemble stochastic strategy inspired from random forests and Bayesian Markov blanket-based methods. We compared SMMB with three other recent algorithms using both simulated and real datasets. Our method outperforms the other compared methods for a majority of simulated cases of 2-way and 3-way epistasis patterns (especially in scenarii where minor allele frequencies of causal SNPs are low). Our approach performs similarly as two other compared methods for large real datasets, in terms of power, and runs faster. Availability and implementation: Parallel version available on https://ls2n.fr/listelogicielsequipe/DUKe/128/. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Epistasis Genética , Estudio de Asociación del Genoma Completo/métodos , Polimorfismo de Nucleótido Simple , Teorema de Bayes , Humanos
8.
Diabetologia ; 57(8): 1601-10, 2014 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-24893864

RESUMEN

AIMS/HYPOTHESIS: Genome-wide association studies have firmly established 65 independent European-derived loci associated with type 2 diabetes and 36 loci contributing to variations in fasting plasma glucose (FPG). Using individual data from the Data from an Epidemiological Study on the Insulin Resistance Syndrome (DESIR) prospective study, we evaluated the contribution of three genetic risk scores (GRS) to variations in metabolic traits, and to the incidence and prevalence of impaired fasting glycaemia (IFG) and type 2 diabetes. METHODS: Three GRS (GRS-1, 65 type 2 diabetes-associated single nucleotide polymorphisms [SNPs]; GRS-2, GRS-1 combined with 24 FPG-raising SNPs; and GRS-3, FPG-raising SNPs alone) were analysed in 4,075 DESIR study participants. GRS-mediated effects on longitudinal variations in quantitative traits were assessed in 3,927 nondiabetic individuals using multivariate linear mixed models, and on the incidence and prevalence of hyperglycaemia at 9 years using Cox and logistic regression models. The contribution of each GRS to risk prediction was evaluated using the C-statistic and net reclassification improvement (NRI) analysis. RESULTS: The two most inclusive GRS were significantly associated with increased FPG (ß = 0.0011 mmol/l per year per risk allele, p GRS-1 = 8.2 × 10(-5) and p GRS-2 = 6.0 × 10(-6)), increased incidence of IFG and type 2 diabetes (per allele: HR GRS-1 1.03, p = 4.3 × 10(-9) and HR GRS-2 1.04, p = 1.0 × 10(-16)), and the 9 year prevalence (OR GRS-1 1.13 [95% CI 1.10, 1.17], p = 1.9 × 10(-14) for type 2 diabetes only; OR GRS-2 1.07 [95% CI 1.05, 1.08], p = 7.8 × 10(-25), for IFG and type 2 diabetes). No significant interaction was found between GRS-1 or GRS-2 and potential confounding factors. Each GRS yielded a modest, but significant, improvement in overall reclassification rates (NRI GRS-1 17.3%, p = 6.6 × 10(-7); NRI GRS-2 17.6%, p = 4.2 × 10(-7); NRI GRS-3 13.1%, p = 1.7 × 10(-4)). CONCLUSIONS/INTERPRETATION: Polygenic scores based on combined genetic information from type 2 diabetes risk and FPG variation contribute to discriminating middle-aged individuals at risk of developing type 2 diabetes in a general population.


Asunto(s)
Glucemia/análisis , Diabetes Mellitus Tipo 2/genética , Predisposición Genética a la Enfermedad , Homeostasis/genética , Polimorfismo de Nucleótido Simple , Adulto , Anciano , Alelos , Diabetes Mellitus Tipo 2/sangre , Femenino , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Estudios Prospectivos , Población Blanca/genética
9.
Hepatol Commun ; 8(9)2024 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-39185915

RESUMEN

BACKGROUND: Liver fibrosis is a critical public health concern, necessitating early detection to prevent progression. This study evaluates the recently developed LiverRisk score and steatosis-associated Fibrosis Estimator (SAFE) score against established indices for prognostication and/or fibrosis prediction in 4diverse cohorts, including participants with metabolic dysfunction-associated steatotic liver disease (MASLD). METHODS: We used data from the Mount Sinai Data Warehouse (32,828 participants without liver disease diagnoses), the Mount Sinai MASLD/MASH Longitudinal Registry (422 participants with MASLD), and National Health and Nutrition Examination Survey 2017-2020 (4133 participants representing the general population) to compare LiverRisk score, FIB-4 index, APRI, and SAFE score. Analyses included Cox proportional hazards regressions, Kaplan-Meier estimates, and classification metrics to evaluate performance in prognostication and fibrosis prediction. RESULTS: In Mount Sinai Data Warehouse, LiverRisk score was significantly associated with future liver-related outcomes but did not significantly outperform FIB-4 or APRI for predicting any of the outcomes. In the general population, LiverRisk score and SAFE score outperformed FIB-4 and APRI in identifying fibrosis, but LiverRisk score underperformed among participants who were non-White or had type 2 diabetes. Among participants with MASLD, SAFE score outperformed FIB-4 and APRI in 1 of 2 cohorts, but there were generally few significant performance differences between all 4 scores. CONCLUSIONS: LiverRisk score does not consistently outperform existing predictors in diverse populations, and further validation is needed before adoption in settings with significant differences from the original derivation cohorts. It remains necessary to replicate the ability of these scores to predict liver-specific mortality, as well as to develop diagnostic tools for liver fibrosis that are accessible and substantially better than current scores, especially among patients with MASLD and other chronic liver conditions.


Asunto(s)
Cirrosis Hepática , Encuestas Nutricionales , Sistema de Registros , Humanos , Cirrosis Hepática/sangre , Cirrosis Hepática/patología , Masculino , Femenino , Persona de Mediana Edad , Adulto , Pronóstico , Índice de Severidad de la Enfermedad , Anciano , Estados Unidos/epidemiología , Biomarcadores/sangre , Hígado Graso/patología
10.
JACC Adv ; 3(4)2024 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-38737007

RESUMEN

BACKGROUND: Diet is a key modifiable risk factor of coronary artery disease (CAD). However, the causal effects of specific dietary traits on CAD risk remain unclear. With the expansion of dietary data in population biobanks, Mendelian randomization (MR) could help enable the efficient estimation of causality in diet-disease associations. OBJECTIVES: The primary goal was to test causality for 13 common dietary traits on CAD risk using a systematic 2-sample MR framework. A secondary goal was to identify plasma metabolites mediating diet-CAD associations suspected to be causal. METHODS: Cross-sectional genetic and dietary data on up to 420,531 UK Biobank and 184,305 CARDIoGRAMplusC4D individuals of European ancestry were used in 2-sample MR. The primary analysis used fixed effect inverse-variance weighted regression, while sensitivity analyses used weighted median estimation, MR-Egger regression, and MR-Pleiotropy Residual Sum and Outlier. RESULTS: Genetic variants serving as proxies for muesli intake were negatively associated with CAD risk (OR: 0.74; 95% CI: 0.65-0.84; P = 5.385 × 10-4). Sensitivity analyses using weighted median estimation supported this with a significant association in the same direction. Additionally, we identified higher plasma acetate levels as a potential mediator (OR: 0.03; 95% CI: 0.01-0.12; P = 1.15 × 10-4). CONCLUSIONS: Muesli, a mixture of oats, seeds, nuts, dried fruit, and milk, may causally reduce CAD risk. Circulating levels of acetate, a gut microbiota-derived short-chain fatty acid, could be mediating its cardioprotective effects. These findings highlight the role of gut flora in cardiovascular health and help prioritize randomized trials on dietary interventions for CAD.

11.
Nat Commun ; 15(1): 8891, 2024 Oct 15.
Artículo en Inglés | MEDLINE | ID: mdl-39406732

RESUMEN

Identifying genetic drivers of chronic diseases is necessary for drug discovery. Here, we develop a machine learning-assisted genetic priority score, which we call ML-GPS, that incorporates genetic associations with predicted disease phenotypes to enhance target discovery. First, we construct gradient boosting models to predict 112 chronic disease phecodes in the UK Biobank and analyze associations of predicted and observed phenotypes with common, rare, and ultra-rare variants to model the allelic series. We integrate these associations with existing evidence using gradient boosting with continuous feature encoding to construct ML-GPS, training it to predict drug indications in Open Targets and externally testing it in SIDER. We then generate ML-GPS predictions for 2,362,636 gene-phecode pairs. We find that the use of predicted phenotypes, which identify substantially more genetic associations than observed phenotypes across the allele frequency spectrum, significantly improves the performance of ML-GPS. ML-GPS increases coverage of drug targets, with the top 1% of all scores providing support for 15,077 gene-phecode pairs that previously had no support. ML-GPS can also identify well-known target-disease relationships, promising targets without indicated drugs, and targets for several drugs in clinical trials, including LRRK2 inhibitors for Parkinson's disease and olpasiran for cardiovascular disease.


Asunto(s)
Descubrimiento de Drogas , Aprendizaje Automático , Fenotipo , Humanos , Enfermedad Crónica/tratamiento farmacológico , Descubrimiento de Drogas/métodos , Frecuencia de los Genes , Predisposición Genética a la Enfermedad
12.
Nat Genet ; 56(7): 1412-1419, 2024 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-38862854

RESUMEN

Coronary artery disease (CAD) exists on a spectrum of disease represented by a combination of risk factors and pathogenic processes. An in silico score for CAD built using machine learning and clinical data in electronic health records captures disease progression, severity and underdiagnosis on this spectrum and could enhance genetic discovery efforts for CAD. Here we tested associations of rare and ultrarare coding variants with the in silico score for CAD in the UK Biobank, All of Us Research Program and BioMe Biobank. We identified associations in 17 genes; of these, 14 show at least moderate levels of prior genetic, biological and/or clinical support for CAD. We also observed an excess of ultrarare coding variants in 321 aggregated CAD genes, suggesting more ultrarare variant associations await discovery. These results expand our understanding of the genetic etiology of CAD and illustrate how digital markers can enhance genetic association investigations for complex diseases.


Asunto(s)
Enfermedad de la Arteria Coronaria , Predisposición Genética a la Enfermedad , Aprendizaje Automático , Enfermedad de la Arteria Coronaria/genética , Humanos , Exoma/genética , Secuenciación del Exoma/métodos , Variación Genética , Estudio de Asociación del Genoma Completo/métodos , Femenino , Polimorfismo de Nucleótido Simple
13.
Am J Ophthalmol ; 267: 204-212, 2024 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-38906208

RESUMEN

PURPOSE: Polygenic risk scores (PRSs) likely predict risk and prognosis of glaucoma. We compared the PRS performance for primary open-angle glaucoma (POAG), defined using International Classification of Diseases (ICD) codes vs manual medical record review. DESIGN: Retrospective cohort study. METHODS: We identified POAG cases in the Mount Sinai BioMe and Mass General Brigham (MGB) biobanks using ICD codes. We confirmed POAG based on optical coherence tomograms and visual fields. In a separate 5% sample, the absence of POAG was confirmed with intraocular pressure and cup-disc ratio criteria. We used genotype data and either self-reported glaucoma diagnoses or ICD-10 codes for glaucoma diagnoses from the UK Biobank and the lassosum method to compute a genome-wide POAG PRS. We compared the area under the curve (AUC) for POAG prediction based on ICD codes vs medical records. RESULTS: We reviewed 804 of 996 BioMe and 367 of 1006 MGB ICD-identified cases. In BioMe and MGB, respectively, positive predictive value was 53% and 55%; negative predictive value was 96% and 97%; sensitivity was 97% and 97%; and specificity was 44% and 53%. Adjusted PRS AUCs for POAG using ICD codes vs manual record review in BioMe were not statistically different (P ≥.21) by ancestry: 0.77 vs 0.75 for African, 0.80 vs 0.80 for Hispanic, and 0.81 vs 0.81 for European. Results were similar in MGB (P ≥.18): 0.72 vs 0.80 for African, 0.83 vs 0.86 for Hispanic, and 0.74 vs 0.73 for European. CONCLUSIONS: A POAG PRS performed similarly using either manual review or ICD codes in 2 electronic health record-linked biobanks; manual assessment of glaucoma status might not be necessary for some PRS studies. However, caution should be exercised when using ICD codes for glaucoma diagnosis given their low specificity (44%-53%) for manually confirmed cases of glaucoma.


Asunto(s)
Registros Electrónicos de Salud , Glaucoma de Ángulo Abierto , Presión Intraocular , Humanos , Glaucoma de Ángulo Abierto/genética , Glaucoma de Ángulo Abierto/diagnóstico , Estudios Retrospectivos , Masculino , Femenino , Presión Intraocular/fisiología , Anciano , Persona de Mediana Edad , Bancos de Muestras Biológicas , Factores de Riesgo , Clasificación Internacional de Enfermedades , Campos Visuales/fisiología , Herencia Multifactorial , Área Bajo la Curva , Tomografía de Coherencia Óptica , Estudio de Asociación del Genoma Completo , Medición de Riesgo/métodos , Curva ROC , Valor Predictivo de las Pruebas , Puntuación de Riesgo Genético
14.
Nat Genet ; 56(1): 51-59, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-38172303

RESUMEN

Studies have shown that drug targets with human genetic support are more likely to succeed in clinical trials. Hence, a tool integrating genetic evidence to prioritize drug target genes is beneficial for drug discovery. We built a genetic priority score (GPS) by integrating eight genetic features with drug indications from the Open Targets and SIDER databases. The top 0.83%, 0.28% and 0.19% of the GPS conferred a 5.3-, 9.9- and 11.0-fold increased effect of having an indication, respectively. In addition, we observed that targets in the top 0.28% of the score were 1.7-, 3.7- and 8.8-fold more likely to advance from phase I to phases II, III and IV, respectively. Complementary to the GPS, we incorporated the direction of genetic effect and drug mechanism into a directional version of the score called the GPS with direction of effect. We applied our method to 19,365 protein-coding genes and 399 drug indications and made all results available through a web portal.


Asunto(s)
Genética Humana , Farmacogenética , Humanos , Descubrimiento de Drogas
15.
Nat Commun ; 15(1): 8741, 2024 Oct 09.
Artículo en Inglés | MEDLINE | ID: mdl-39384761

RESUMEN

Whole genome sequences (WGS) enable discovery of rare variants which may contribute to missing heritability of coronary artery disease (CAD). To measure their contribution, we apply the GREML-LDMS-I approach to WGS of 4949 cases and 17,494 controls of European ancestry from the NHLBI TOPMed program. We estimate CAD heritability at 34.3% assuming a prevalence of 8.2%. Ultra-rare (minor allele frequency ≤ 0.1%) variants with low linkage disequilibrium (LD) score contribute ~50% of the heritability. We also investigate CAD heritability enrichment using a diverse set of functional annotations: i) constraint; ii) predicted protein-altering impact; iii) cis-regulatory elements from a cell-specific chromatin atlas of the human coronary; and iv) annotation principal components representing a wide range of functional processes. We observe marked enrichment of CAD heritability for most functional annotations. These results reveal the predominant role of ultra-rare variants in low LD on the heritability of CAD. Moreover, they highlight several functional processes including cell type-specific regulatory mechanisms as key drivers of CAD genetic risk.


Asunto(s)
Enfermedad de la Arteria Coronaria , Predisposición Genética a la Enfermedad , Desequilibrio de Ligamiento , Polimorfismo de Nucleótido Simple , Humanos , Enfermedad de la Arteria Coronaria/genética , Masculino , Femenino , Frecuencia de los Genes , Estudio de Asociación del Genoma Completo , Población Blanca/genética , Estudios de Casos y Controles , Secuenciación Completa del Genoma , Variación Genética , Persona de Mediana Edad
16.
Neuroendocrinology ; 97(2): 146-59, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-22538389

RESUMEN

Dietary interventions involving caloric restriction represent a powerful strategy to prevent or delay age-related deteriorations and diseases. Their beneficial effects have been observed in several tissues and species. This microarray study investigated the effects of aging, long-term moderate caloric restriction (LTMCR) and long-term dietary soy on the regulation of gene expression in the anterior pituitary and hypothalamus of 20-month-old Sprague-Dawley rats. In both tissues, aging regulated genes mainly involved in cell defense and repair mechanisms related to apoptosis, DNA repair, cellular stress, inflammatory and immune response. In the aging pituitary, the highest upregulated gene was the regenerating islet-derived 3ß (5.77-fold), coding for a secretory protein involved in acute stress and inflammation. A protective effect of LTMCR on age-related change of gene expression was observed for 35 pituitary genes. In addition, beneficial effects of LTMCR in the pituitary were observed on new regulated genes mainly involved in cell death and cell stress response. In the hypothalamus, the effects of LTMCR on age-related changes were modest. Finally, changing the quality of dietary protein (20% casein for soy) had a low impact on the regulation of mRNA levels in both tissues. Genes associated with the somatotroph function were also differentially expressed in the aging pituitary. Interestingly, LTMCR prevented the effect of aging on insulin-like growth factor-binding protein-3 gene. Altogether, this study proposes novel pituitary and hypothalamic molecular targets and signaling pathways to help in understanding the mechanisms involved in aging processes and LTMCR.


Asunto(s)
Envejecimiento/fisiología , Dieta , Hipotálamo/metabolismo , Adenohipófisis/metabolismo , Transcriptoma , Envejecimiento/sangre , Envejecimiento/genética , Envejecimiento/metabolismo , Animales , Restricción Calórica , Perfilación de la Expresión Génica , Hormonas/sangre , Hipotálamo/química , Masculino , Análisis por Micromatrices , Adenohipófisis/química , Ratas , Ratas Sprague-Dawley , Alimentos de Soja , Transcriptoma/genética , Transcriptoma/fisiología
17.
Nature ; 445(7130): 881-5, 2007 Feb 22.
Artículo en Inglés | MEDLINE | ID: mdl-17293876

RESUMEN

Type 2 diabetes mellitus results from the interaction of environmental factors with a combination of genetic variants, most of which were hitherto unknown. A systematic search for these variants was recently made possible by the development of high-density arrays that permit the genotyping of hundreds of thousands of polymorphisms. We tested 392,935 single-nucleotide polymorphisms in a French case-control cohort. Markers with the most significant difference in genotype frequencies between cases of type 2 diabetes and controls were fast-tracked for testing in a second cohort. This identified four loci containing variants that confer type 2 diabetes risk, in addition to confirming the known association with the TCF7L2 gene. These loci include a non-synonymous polymorphism in the zinc transporter SLC30A8, which is expressed exclusively in insulin-producing beta-cells, and two linkage disequilibrium blocks that contain genes potentially involved in beta-cell development or function (IDE-KIF11-HHEX and EXT2-ALX4). These associations explain a substantial portion of disease risk and constitute proof of principle for the genome-wide approach to the elucidation of complex genetic traits.


Asunto(s)
Diabetes Mellitus Tipo 2/genética , Predisposición Genética a la Enfermedad/genética , Genoma Humano , Estudios de Casos y Controles , Proteínas de Transporte de Catión/genética , Cromosomas Humanos Par 10/genética , Cromosomas Humanos Par 8/genética , Francia , Humanos , Desequilibrio de Ligamiento , Transportador 8 de Zinc
18.
medRxiv ; 2023 Oct 25.
Artículo en Inglés | MEDLINE | ID: mdl-37961657

RESUMEN

Metabolic dysfunction-associated steatotic liver disease (MASLD) affects 30% of the global population but is often underdiagnosed. To fill this diagnostic gap, we developed a digital score reflecting presence and severity of MASLD. We fitted a machine learning model to electronic health records from 37,212 UK Biobank participants with proton density fat fraction measurements and/or a MASLD diagnosis to generate a "MASLD score". In holdout testing, our model achieved areas under the receiver-operating curve of 0.83-0.84 for MASLD diagnosis and 0.90-0.91 for identifying MASLD-associated advanced fibrosis. MASLD score was significantly associated with MASLD risk factors, progression to cirrhosis, and mortality. External testing in 252,725 diverse American participants demonstrated consistent results, and hepatologist chart review showed MASLD score identified probable MASLD underdiagnosis. The MASLD score could improve early diagnosis and intervention of chronic liver disease by providing a non-invasive, low-cost method for population-wide screening of MASLD.

19.
Elife ; 122023 03 29.
Artículo en Inglés | MEDLINE | ID: mdl-36988189

RESUMEN

Background: Causality between plasma triglyceride (TG) levels and atherosclerotic cardiovascular disease (ASCVD) risk remains controversial despite more than four decades of study and two recent landmark trials, STRENGTH, and REDUCE-IT. Further unclear is the association between TG levels and non-atherosclerotic diseases across organ systems. Methods: Here, we conducted a phenome-wide, two-sample Mendelian randomization (MR) analysis using inverse-variance weighted (IVW) regression to systematically infer the causal effects of plasma TG levels on 2600 disease traits in the European ancestry population of UK Biobank. For replication, we externally tested 221 nominally significant associations (p<0.05) in an independent cohort from FinnGen. To account for potential horizontal pleiotropy and the influence of invalid instrumental variables, we performed sensitivity analyses using MR-Egger regression, weighted median estimator, and MR-PRESSO. Finally, we used multivariable MR (MVMR) controlling for correlated lipid fractions to distinguish the independent effect of plasma TG levels. Results: Our results identified seven disease traits reaching Bonferroni-corrected significance in both the discovery (p<1.92 × 10-5) and replication analyses (p<2.26 × 10-4), suggesting a causal relationship between plasma TG levels and ASCVDs, including coronary artery disease (OR 1.33, 95% CI 1.24-1.43, p=2.47 × 10-13). We also identified 12 disease traits that were Bonferroni-significant in the discovery or replication analysis and at least nominally significant in the other analysis (p<0.05), identifying plasma TG levels as a novel potential risk factor for nine non-ASCVD diseases, including uterine leiomyoma (OR 1.19, 95% CI 1.10-1.29, p=1.17 × 10-5). Conclusions: Taking a phenome-wide, two-sample MR approach, we identified causal associations between plasma TG levels and 19 disease traits across organ systems. Our findings suggest unrealized drug repurposing opportunities or adverse effects related to approved and emerging TG-lowering agents, as well as mechanistic insights for future studies. Funding: RD is supported by the National Institute of General Medical Sciences of the National Institutes of Health (NIH) (R35-GM124836) and the National Heart, Lung, and Blood Institute of the NIH (R01-HL139865 and R01-HL155915).


Asunto(s)
Aterosclerosis , Enfermedad de la Arteria Coronaria , Humanos , Análisis de la Aleatorización Mendeliana , Fenotipo , Enfermedad de la Arteria Coronaria/genética , Triglicéridos , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple
20.
Nat Commun ; 14(1): 2385, 2023 04 25.
Artículo en Inglés | MEDLINE | ID: mdl-37169741

RESUMEN

Systemic autoimmune rheumatic diseases (SARDs) can lead to irreversible damage if left untreated, yet these patients often endure long diagnostic journeys before being diagnosed and treated. Machine learning may help overcome the challenges of diagnosing SARDs and inform clinical decision-making. Here, we developed and tested a machine learning model to identify patients who should receive rheumatological evaluation for SARDs using longitudinal electronic health records of 161,584 individuals from two institutions. The model demonstrated high performance for predicting cases of autoantibody-tested individuals in a validation set, an external test set, and an independent cohort with a broader case definition. This approach identified more individuals for autoantibody testing compared with current clinical standards and a greater proportion of autoantibody carriers among those tested. Diagnoses of SARDs and other autoimmune conditions increased with higher model probabilities. The model detected a need for autoantibody testing and rheumatology encounters up to five years before the test date and assessment date, respectively. Altogether, these findings illustrate that the clinical manifestations of a diverse array of autoimmune conditions are detectable in electronic health records using machine learning, which may help systematize and accelerate autoimmune testing.


Asunto(s)
Enfermedades Autoinmunes , Registros Electrónicos de Salud , Humanos , Enfermedades Autoinmunes/diagnóstico , Pacientes , Autoanticuerpos , Aprendizaje Automático
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA