RESUMEN
Myeloproliferative neoplasms (MPNs) are blood cancers that are characterized by the excessive production of mature myeloid cells and arise from the acquisition of somatic driver mutations in haematopoietic stem cells (HSCs). Epidemiological studies indicate a substantial heritable component of MPNs that is among the highest known for cancers1. However, only a limited number of genetic risk loci have been identified, and the underlying biological mechanisms that lead to the acquisition of MPNs remain unclear. Here, by conducting a large-scale genome-wide association study (3,797 cases and 1,152,977 controls), we identify 17 MPN risk loci (P < 5.0 × 10-8), 7 of which have not been previously reported. We find that there is a shared genetic architecture between MPN risk and several haematopoietic traits from distinct lineages; that there is an enrichment for MPN risk variants within accessible chromatin of HSCs; and that increased MPN risk is associated with longer telomere length in leukocytes and other clonal haematopoietic states-collectively suggesting that MPN risk is associated with the function and self-renewal of HSCs. We use gene mapping to identify modulators of HSC biology linked to MPN risk, and show through targeted variant-to-function assays that CHEK2 and GFI1B have roles in altering the function of HSCs to confer disease risk. Overall, our results reveal a previously unappreciated mechanism for inherited MPN risk through the modulation of HSC function.
Asunto(s)
Predisposición Genética a la Enfermedad/genética , Células Madre Hematopoyéticas/patología , Trastornos Mieloproliferativos/genética , Trastornos Mieloproliferativos/patología , Neoplasias/genética , Neoplasias/patología , Linaje de la Célula/genética , Autorrenovación de las Células , Quinasa de Punto de Control 2/genética , Femenino , Humanos , Leucocitos/patología , Masculino , Proteínas Proto-Oncogénicas/genética , Proteínas Represoras/genética , Riesgo , Homeostasis del TelómeroRESUMEN
BACKGROUND: High circulating levels of Lp(a) (lipoprotein[a]) increase the risk of atherosclerosis and calcific aortic valve disease, affecting millions of patients worldwide. Although atherosclerosis is commonly treated with low-density lipoprotein-targeting therapies, these do not reduce Lp(a) or risk of calcific aortic valve disease, which has no available drug therapies. Targeting Lp(a) production and catabolism may provide therapeutic benefit, but little is known about Lp(a) cellular uptake. METHODS: Here, unbiased ligand-receptor capture mass spectrometry was used to identify MFSD5 (major facilitator superfamily domain containing 5) as a novel receptor/cofactor involved in Lp(a) uptake. RESULTS: Reducing MFSD5 expression by a computationally identified small molecule or small interfering RNA suppressed Lp(a) uptake and calcification in primary human valvular endothelial and interstitial cells. MFSD5 variants were associated with aortic stenosis (P=0.027 after multiple hypothesis testing) with evidence suggestive of an interaction with plasma Lp(a) levels. CONCLUSIONS: MFSD5 knockdown suppressing human valvular cell Lp(a) uptake and calcification, along with meta-analysis of MFSD5 variants associating with aortic stenosis, supports further preclinical assessment of MFSD5 in cardiovascular diseases, the leading cause of death worldwide.
Asunto(s)
Enfermedad de la Válvula Aórtica , Estenosis de la Válvula Aórtica , Aterosclerosis , Calcinosis , Enfermedades de las Válvulas Cardíacas , Humanos , Válvula Aórtica/metabolismo , Enfermedad de la Válvula Aórtica/metabolismo , Estenosis de la Válvula Aórtica/tratamiento farmacológico , Estenosis de la Válvula Aórtica/genética , Aterosclerosis/metabolismo , Enfermedades de las Válvulas Cardíacas/tratamiento farmacológico , Enfermedades de las Válvulas Cardíacas/genética , Enfermedades de las Válvulas Cardíacas/complicaciones , Lipoproteína(a) , Factores de RiesgoRESUMEN
Polygenic risk scores (PRSs) quantify the contribution of multiple genetic loci to an individual's likelihood of a complex trait or disease. However, existing PRSs estimate this likelihood with common genetic variants, excluding the impact of rare variants. Here, we report on a method to identify rare variants associated with outlier gene expression and integrate their impact into PRS predictions for body mass index (BMI), obesity, and bariatric surgery. Between the top and bottom 10%, we observed a 20.8% increase in risk for obesity (p = 3 × 10-14), 62.3% increase in risk for severe obesity (p = 1 × 10-6), and median 5.29 years earlier onset for bariatric surgery (p = 0.008), as a function of expression outlier-associated rare variant burden when controlling for common variant PRS. We show that these predictions were more significant than integrating the effects of rare protein-truncating variants (PTVs), observing a mean 19% increase in phenotypic variance explained with expression outlier-associated rare variants when compared with PTVs (p = 2 × 10-15). We replicated these findings by using data from the Million Veteran Program and demonstrated that PRSs across multiple traits and diseases can benefit from the inclusion of expression outlier-associated rare variants identified through population-scale transcriptome sequencing.
Asunto(s)
Herencia Multifactorial , Obesidad , Índice de Masa Corporal , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Herencia Multifactorial/genética , Obesidad/genética , Fenotipo , Factores de RiesgoRESUMEN
BACKGROUND: Height has been associated with many clinical traits but whether such associations are causal versus secondary to confounding remains unclear in many cases. To systematically examine this question, we performed a Mendelian Randomization-Phenome-wide association study (MR-PheWAS) using clinical and genetic data from a national healthcare system biobank. METHODS AND FINDINGS: Analyses were performed using data from the US Veterans Affairs (VA) Million Veteran Program in non-Hispanic White (EA, n = 222,300) and non-Hispanic Black (AA, n = 58,151) adults in the US. We estimated height genetic risk based on 3290 height-associated variants from a recent European-ancestry genome-wide meta-analysis. We compared associations of measured and genetically-predicted height with phenome-wide traits derived from the VA electronic health record, adjusting for age, sex, and genetic principal components. We found 345 clinical traits associated with measured height in EA and an additional 17 in AA. Of these, 127 were associated with genetically-predicted height at phenome-wide significance in EA and 2 in AA. These associations were largely independent from body mass index. We confirmed several previously described MR associations between height and cardiovascular disease traits such as hypertension, hyperlipidemia, coronary heart disease (CHD), and atrial fibrillation, and further uncovered MR associations with venous circulatory disorders and peripheral neuropathy in the presence and absence of diabetes. As a number of traits associated with genetically-predicted height frequently co-occur with CHD, we evaluated effect modification by CHD status of genetically-predicted height associations with risk factors for and complications of CHD. We found modification of effects of MR associations by CHD status for atrial fibrillation/flutter but not for hypertension, hyperlipidemia, or venous circulatory disorders. CONCLUSIONS: We conclude that height may be an unrecognized but biologically plausible risk factor for several common conditions in adults. However, more studies are needed to reliably exclude horizontal pleiotropy as a driving force behind at least some of the MR associations observed in this study.
Asunto(s)
Fibrilación Atrial , Hipertensión , Veteranos , Adulto , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Hipertensión/epidemiología , Hipertensión/genética , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
BACKGROUND: Calcific aortic stenosis (CAS) is the most common valvular heart disease in older adults and has no effective preventive therapies. Genome-wide association studies (GWAS) can identify genes influencing disease and may help prioritize therapeutic targets for CAS. METHODS: We performed a GWAS and gene association study of 14 451 patients with CAS and 398 544 controls in the Million Veteran Program. Replication was performed in the Million Veteran Program, Penn Medicine Biobank, Mass General Brigham Biobank, BioVU, and BioMe, totaling 12 889 cases and 348 094 controls. Causal genes were prioritized from genome-wide significant variants using polygenic priority score gene localization, expression quantitative trait locus colocalization, and nearest gene methods. CAS genetic architecture was compared with that of atherosclerotic cardiovascular disease. Causal inference for cardiometabolic biomarkers in CAS was performed using Mendelian randomization and genome-wide significant loci were characterized further through phenome-wide association study. RESULTS: We identified 23 genome-wide significant lead variants in our GWAS representing 17 unique genomic regions. Of the 23 lead variants, 14 were significant in replication, representing 11 unique genomic regions. Five replicated genomic regions were previously known risk loci for CAS (PALMD, TEX41, IL6, LPA, FADS) and 6 were novel (CEP85L, FTO, SLMAP, CELSR2, MECOM, CDAN1). Two novel lead variants were associated in non-White individuals (P<0.05): rs12740374 (CELSR2) in Black and Hispanic individuals and rs1522387 (SLMAP) in Black individuals. Of the 14 replicated lead variants, only 2 (rs10455872 [LPA], rs12740374 [CELSR2]) were also significant in atherosclerotic cardiovascular disease GWAS. In Mendelian randomization, lipoprotein(a) and low-density lipoprotein cholesterol were both associated with CAS, but the association between low-density lipoprotein cholesterol and CAS was attenuated when adjusting for lipoprotein(a). Phenome-wide association study highlighted varying degrees of pleiotropy, including between CAS and obesity at the FTO locus. However, the FTO locus remained associated with CAS after adjusting for body mass index and maintained a significant independent effect on CAS in mediation analysis. CONCLUSIONS: We performed a multiancestry GWAS in CAS and identified 6 novel genomic regions in the disease. Secondary analyses highlighted the roles of lipid metabolism, inflammation, cellular senescence, and adiposity in the pathobiology of CAS and clarified the shared and differential genetic architectures of CAS with atherosclerotic cardiovascular diseases.
Asunto(s)
Estenosis de la Válvula Aórtica , Veteranos , Humanos , Anciano , Estudio de Asociación del Genoma Completo/métodos , Predisposición Genética a la Enfermedad , Estenosis de la Válvula Aórtica/genética , Obesidad/genética , Factores de Transcripción/genética , Lipoproteína(a)/genética , Lipoproteínas LDL , Colesterol , Polimorfismo de Nucleótido Simple , Glicoproteínas/genética , Proteínas Nucleares/genéticaRESUMEN
Plasma levels of fibrinogen, coagulation factors VII and VIII and von Willebrand factor (vWF) are four intermediate phenotypes that are heritable and have been associated with the risk of clinical thrombotic events. To identify rare and low-frequency variants associated with these hemostatic factors, we conducted whole-exome sequencing in 10 860 individuals of European ancestry (EA) and 3529 African Americans (AAs) from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium and the National Heart, Lung and Blood Institute's Exome Sequencing Project. Gene-based tests demonstrated significant associations with rare variation (minor allele frequency < 5%) in fibrinogen gamma chain (FGG) (with fibrinogen, P = 9.1 × 10-13), coagulation factor VII (F7) (with factor VII, P = 1.3 × 10-72; seven novel variants) and VWF (with factor VIII and vWF; P = 3.2 × 10-14; one novel variant). These eight novel rare variant associations were independent of the known common variants at these loci and tended to have much larger effect sizes. In addition, one of the rare novel variants in F7 was significantly associated with an increased risk of venous thromboembolism in AAs (Ile200Ser; rs141219108; P = 4.2 × 10-5). After restricting gene-based analyses to only loss-of-function variants, a novel significant association was detected and replicated between factor VIII levels and a stop-gain mutation exclusive to AAs (rs3211938) in CD36 molecule (CD36). This variant has previously been linked to dyslipidemia but not with the levels of a hemostatic factor. These efforts represent the largest integration of whole-exome sequence data from two national projects to identify genetic variation associated with plasma hemostatic factors.
Asunto(s)
Factor VIII , Hemostáticos , Factor VII/genética , Factor VIII/genética , Fibrinógeno/genética , Humanos , Polimorfismo de Nucleótido Simple/genética , Secuenciación del Exoma , Factor de von Willebrand/análisis , Factor de von Willebrand/genéticaRESUMEN
Carotid intima media thickness (cIMT) is a biomarker of subclinical atherosclerosis and a predictor of future cardiovascular events. Identifying associations between gene expression levels and cIMT may provide insight to atherosclerosis etiology. Here, we use two approaches to identify associations between mRNA levels and cIMT: differential gene expression analysis in whole blood and S-PrediXcan. We used microarrays to measure genome-wide whole blood mRNA levels of 5647 European individuals from four studies. We examined the association of mRNA levels with cIMT adjusted for various potential confounders. Significant associations were tested for replication in three studies totaling 3943 participants. Next, we applied S-PrediXcan to summary statistics from a cIMT genome-wide association study (GWAS) of 71 128 individuals to estimate the association between genetically determined mRNA levels and cIMT and replicated these analyses using S-PrediXcan on an independent GWAS on cIMT that included 22 179 individuals from the UK Biobank. mRNA levels of TNFAIP3, CEBPD and METRNL were inversely associated with cIMT, but these associations were not significant in the replication analysis. S-PrediXcan identified associations between cIMT and genetically determined mRNA levels for 36 genes, of which six were significant in the replication analysis, including TLN2, which had not been previously reported for cIMT. There was weak correlation between our results using differential gene expression analysis and S-PrediXcan. Differential expression analysis and S-PrediXcan represent complementary approaches for the discovery of associations between phenotypes and gene expression. Using these approaches, we prioritize TNFAIP3, CEBPD, METRNL and TLN2 as new candidate genes whose differential expression might modulate cIMT.
Asunto(s)
Aterosclerosis , Grosor Intima-Media Carotídeo , Expresión Génica , Estudio de Asociación del Genoma Completo , Humanos , Factores de RiesgoRESUMEN
Progressive dilation of the infrarenal aortic diameter is a consequence of the ageing process and is considered the main determinant of abdominal aortic aneurysm (AAA). We aimed to investigate the genetic and clinical determinants of abdominal aortic diameter (AAD). We conducted a meta-analysis of genome-wide association studies in 10 cohorts (n = 13 542) imputed to the 1000 Genome Project reference panel including 12 815 subjects in the discovery phase and 727 subjects [Partners Biobank cohort 1 (PBIO)] as replication. Maximum anterior-posterior diameter of the infrarenal aorta was used as AAD. We also included exome array data (n = 14 480) from seven epidemiologic studies. Single-variant and gene-based associations were done using SeqMeta package. A Mendelian randomization analysis was applied to investigate the causal effect of a number of clinical risk factors on AAD. In genome-wide association study (GWAS) on AAD, rs74448815 in the intronic region of LDLRAD4 reached genome-wide significance (beta = -0.02, SE = 0.004, P-value = 2.10 × 10-8). The association replicated in the PBIO1 cohort (P-value = 8.19 × 10-4). In exome-array single-variant analysis (P-value threshold = 9 × 10-7), the lowest P-value was found for rs239259 located in SLC22A20 (beta = 0.007, P-value = 1.2 × 10-5). In the gene-based analysis (P-value threshold = 1.85 × 10-6), PCSK5 showed an association with AAD (P-value = 8.03 × 10-7). Furthermore, in Mendelian randomization analyses, we found evidence for genetic association of pulse pressure (beta = -0.003, P-value = 0.02), triglycerides (beta = -0.16, P-value = 0.008) and height (beta = 0.03, P-value < 0.0001), known risk factors for AAA, consistent with a causal association with AAD. Our findings point to new biology as well as highlighting gene regions in mechanisms that have previously been implicated in the genetics of other vascular diseases.
Asunto(s)
Estudio de Asociación del Genoma Completo , Análisis de la Aleatorización Mendeliana , Exoma/genética , Humanos , Polimorfismo de Nucleótido Simple/genética , TriglicéridosRESUMEN
AIMS: Although highly heritable, the genetic etiology of calcific aortic stenosis (AS) remains incompletely understood. The aim of this study was to discover novel genetic contributors to AS and to integrate functional, expression, and cross-phenotype data to identify mechanisms of AS. METHODS AND RESULTS: A genome-wide meta-analysis of 11.6 million variants in 10 cohorts involving 653 867 European ancestry participants (13 765 cases) was performed. Seventeen loci were associated with AS at P ≤ 5 × 10-8, of which 15 replicated in an independent cohort of 90 828 participants (7111 cases), including CELSR2-SORT1, NLRP6, and SMC2. A genetic risk score comprised of the index variants was associated with AS [odds ratio (OR) per standard deviation, 1.31; 95% confidence interval (CI), 1.26-1.35; P = 2.7 × 10-51] and aortic valve calcium (OR per standard deviation, 1.22; 95% CI, 1.08-1.37; P = 1.4 × 10-3), after adjustment for known risk factors. A phenome-wide association study indicated multiple associations with coronary artery disease, apolipoprotein B, and triglycerides. Mendelian randomization supported a causal role for apolipoprotein B-containing lipoprotein particles in AS (OR per g/L of apolipoprotein B, 3.85; 95% CI, 2.90-5.12; P = 2.1 × 10-20) and replicated previous findings of causality for lipoprotein(a) (OR per natural logarithm, 1.20; 95% CI, 1.17-1.23; P = 4.8 × 10-73) and body mass index (OR per kg/m2, 1.07; 95% CI, 1.05-1.9; P = 1.9 × 10-12). Colocalization analyses using the GTEx database identified a role for differential expression of the genes LPA, SORT1, ACTR2, NOTCH4, IL6R, and FADS. CONCLUSION: Dyslipidemia, inflammation, calcification, and adiposity play important roles in the etiology of AS, implicating novel treatments and prevention strategies.
Asunto(s)
Estenosis de la Válvula Aórtica , Dislipidemias , Humanos , Estudio de Asociación del Genoma Completo/métodos , Adiposidad/genética , Predisposición Genética a la Enfermedad , Estenosis de la Válvula Aórtica/genética , Obesidad , Factores de Riesgo , Inflamación , Dislipidemias/complicaciones , Dislipidemias/genética , Apolipoproteínas/genética , Análisis de la Aleatorización Mendeliana , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
Cardiovascular disease is the leading contributor to years lost due to disability or premature death among adults. Current efforts focus on risk prediction and risk factor mitigation' which have been recognized for the past half-century. However, despite advances, risk prediction remains imprecise with persistently high rates of incident cardiovascular disease. Genetic characterization has been proposed as an approach to enable earlier and potentially tailored prevention. Rare mendelian pathogenic variants predisposing to cardiometabolic conditions have long been known to contribute to disease risk in some families. However, twin and familial aggregation studies imply that diverse cardiovascular conditions are heritable in the general population. Significant technological and methodological advances since the Human Genome Project are facilitating population-based comprehensive genetic profiling at decreasing costs. Genome-wide association studies from such endeavors continue to elucidate causal mechanisms for cardiovascular diseases. Systematic cataloging for cardiovascular risk alleles also enabled the development of polygenic risk scores. Genetic profiling is becoming widespread in large-scale research, including in health care-associated biobanks, randomized controlled trials, and direct-to-consumer profiling in tens of millions of people. Thus, individuals and their physicians are increasingly presented with polygenic risk scores for cardiovascular conditions in clinical encounters. In this scientific statement, we review the contemporary science, clinical considerations, and future challenges for polygenic risk scores for cardiovascular diseases. We selected 5 cardiometabolic diseases (coronary artery disease, hypercholesterolemia, type 2 diabetes, atrial fibrillation, and venous thromboembolic disease) and response to drug therapy and offer provisional guidance to health care professionals, researchers, policymakers, and patients.
Asunto(s)
Enfermedades Cardiovasculares , Diabetes Mellitus Tipo 2 , Adulto , American Heart Association , Enfermedades Cardiovasculares/diagnóstico , Enfermedades Cardiovasculares/epidemiología , Enfermedades Cardiovasculares/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Herencia Multifactorial/genética , Factores de RiesgoRESUMEN
BACKGROUND: Venous thromboembolism (VTE) is a life-threatening vascular event with environmental and genetic determinants. Recent VTE genome-wide association studies (GWAS) meta-analyses involved nearly 30 000 VTE cases and identified up to 40 genetic loci associated with VTE risk, including loci not previously suspected to play a role in hemostasis. The aim of our research was to expand discovery of new genetic loci associated with VTE by using cross-ancestry genomic resources. METHODS: We present new cross-ancestry meta-analyzed GWAS results involving up to 81 669 VTE cases from 30 studies, with replication of novel loci in independent populations and loci characterization through in silico genomic interrogations. RESULTS: In our genetic discovery effort that included 55 330 participants with VTE (47 822 European, 6320 African, and 1188 Hispanic ancestry), we identified 48 novel associations, of which 34 were replicated after correction for multiple testing. In our combined discovery-replication analysis (81 669 VTE participants) and ancestry-stratified meta-analyses (European, African, and Hispanic), we identified another 44 novel associations, which are new candidate VTE-associated loci requiring replication. In total, across all GWAS meta-analyses, we identified 135 independent genomic loci significantly associated with VTE risk. A genetic risk score of the significantly associated loci in Europeans identified a 6-fold increase in risk for those in the top 1% of scores compared with those with average scores. We also identified 31 novel transcript associations in transcriptome-wide association studies and 8 novel candidate genes with protein quantitative-trait locus Mendelian randomization analyses. In silico interrogations of hemostasis and hematology traits and a large phenome-wide association analysis of the 135 GWAS loci provided insights to biological pathways contributing to VTE, with some loci contributing to VTE through well-characterized coagulation pathways and others providing new data on the role of hematology traits, particularly platelet function. Many of the replicated loci are outside of known or currently hypothesized pathways to thrombosis. CONCLUSIONS: Our cross-ancestry GWAS meta-analyses identified new loci associated with VTE. These findings highlight new pathways to thrombosis and provide novel molecules that may be useful in the development of improved antithrombosis treatments.
Asunto(s)
Trombosis , Tromboembolia Venosa , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Genómica , Humanos , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Trombosis/genética , Tromboembolia Venosa/diagnóstico , Tromboembolia Venosa/genéticaRESUMEN
Nonalcoholic fatty liver disease (NAFLD) is a leading cause of chronic liver disease and is highly correlated with metabolic disease. NAFLD results from environmental exposures acting on a susceptible polygenic background. This study performed the largest multiethnic investigation of exonic variation associated with NAFLD and correlated metabolic traits and diseases. An exome array meta-analysis was carried out among eight multiethnic population-based cohorts (n = 16 492) with computed tomography (CT) measured hepatic steatosis. A fixed effects meta-analysis identified five exome-wide significant loci (P < 5.30 × 10-7); including a novel signal near TOMM40/APOE. Joint analysis of TOMM40/APOE variants revealed the TOMM40 signal was attributed to APOE rs429358-T; APOE rs7412 was not associated with liver attenuation. Moreover, rs429358-T was associated with higher serum alanine aminotransferase, liver steatosis, cirrhosis, triglycerides and obesity; as well as, lower cholesterol and decreased risk of myocardial infarction and Alzheimer's disease (AD) in phenome-wide association analyses in the Michigan Genomics Initiative, United Kingdom Biobank and/or public datasets. These results implicate APOE in imaging-based identification of NAFLD. This association may or may not translate to nonalcoholic steatohepatitis; however, these results indicate a significant association with advanced liver disease and hepatic cirrhosis. These findings highlight allelic heterogeneity at the APOE locus and demonstrate an inverse link between NAFLD and AD at the exome level in the largest analysis to date.
Asunto(s)
Apolipoproteínas E/genética , Enfermedad del Hígado Graso no Alcohólico/genética , Obesidad/genética , Alanina Transaminasa , Alelos , Enfermedad de Alzheimer/genética , Apolipoproteínas E/metabolismo , Bases de Datos Genéticas , Exoma/genética , Frecuencia de los Genes/genética , Estudio de Asociación del Genoma Completo/métodos , Humanos , Hígado , Cirrosis Hepática/genética , Infarto del Miocardio/genética , Enfermedad del Hígado Graso no Alcohólico/metabolismo , Obesidad/metabolismo , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Pronóstico , Factores de Riesgo , TriglicéridosRESUMEN
The Million Veteran Program (MVP), initiated by the Department of Veterans Affairs (VA), aims to collect biosamples with consent from at least one million veterans. Presently, blood samples have been collected from over 800,000 enrolled participants. The size and diversity of the MVP cohort, as well as the availability of extensive VA electronic health records, make it a promising resource for precision medicine. MVP is conducting array-based genotyping to provide a genome-wide scan of the entire cohort, in parallel with whole-genome sequencing, methylation, and other 'omics assays. Here, we present the design and performance of the MVP 1.0 custom Axiom array, which was designed and developed as a single assay to be used across the multi-ethnic MVP cohort. A unified genetic quality-control analysis was developed and conducted on an initial tranche of 485,856 individuals, leading to a high-quality dataset of 459,777 unique individuals. 668,418 genetic markers passed quality control and showed high-quality genotypes not only on common variants but also on rare variants. We confirmed that, with non-European individuals making up nearly 30%, MVP's substantial ancestral diversity surpasses that of other large biobanks. We also demonstrated the quality of the MVP dataset by replicating established genetic associations with height in European Americans and African Americans ancestries. This current dataset has been made available to approved MVP researchers for genome-wide association studies and other downstream analyses. Further data releases will be available for analysis as recruitment at the VA continues and the cohort expands both in size and diversity.
Asunto(s)
Etnicidad/genética , Anciano , Anciano de 80 o más Años , Estudios de Cohortes , Femenino , Marcadores Genéticos/genética , Estudio de Asociación del Genoma Completo/métodos , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple/genética , Medicina de Precisión/métodos , Control de Calidad , Veteranos , Secuenciación Completa del Genoma/métodosRESUMEN
The association between coronary artery disease (CAD) and posttraumatic stress disorder (PTSD) contributes to the high morbidity and mortality observed for these conditions. To understand the dynamics underlying PTSD-CAD comorbidity, we investigated large-scale genome-wide association (GWA) statistics from the Million Veteran Program (MVP), the UK Biobank (UKB), the Psychiatric Genomics Consortium, and the CARDIoGRAMplusC4D Consortium. We observed a genetic correlation of CAD with PTSD case-control and quantitative outcomes, ranging from 0.18 to 0.32. To investigate possible cause-effect relationships underlying these genetic correlations, we performed a two-sample Mendelian randomization (MR) analysis, observing a significant bidirectional relationship between CAD and PTSD symptom severity. Genetically-determined PCL-17 (PTSD 17-item Checklist) total score was associated with increased CAD risk (odds ratio = 1.04; 95% confidence interval, 95% CI = 1.01-1.06). Conversely, CAD genetic liability was associated with reduced PCL-17 total score (beta = -0.42; 95% CI = -0.04 to -0.81). Because of these opposite-direction associations, we conducted a pleiotropic meta-analysis to investigate loci with concordant vs. discordant effects on PCL-17 and CAD, observing that concordant-effect loci were enriched for molecular pathways related to platelet amyloid precursor protein (beta = 1.53, p = 2.97 × 10-7) and astrocyte activation regulation (beta = 1.51, p = 2.48 × 10-6) while discordant-effect loci were enriched for biological processes related to lipid metabolism (e.g., triglyceride-rich lipoprotein particle clearance, beta = 2.32, p = 1.61 × 10-10). To follow up these results, we leveraged MVP and UKB electronic health records (EHR) to assess longitudinal changes in the association between CAD and posttraumatic stress severity. This EHR-based analysis highlighted that earlier CAD diagnosis is associated with increased PCL-total score later in life, while lower PCL total score was associated with increased risk of a later CAD diagnosis (Mann-Kendall trend test: MVP tau = 0.932, p < 2 × 10-16; UKB tau = 0.376, p = 0.005). In conclusion, both our genetically-informed analyses and our EHR-based follow-up investigation highlighted a bidirectional relationship between PTSD and CAD where multiple pleiotropic mechanisms are likely to be involved.
Asunto(s)
Enfermedad de la Arteria Coronaria , Trastornos por Estrés Postraumático , Humanos , Enfermedad de la Arteria Coronaria/genética , Enfermedad de la Arteria Coronaria/diagnóstico , Enfermedad de la Arteria Coronaria/epidemiología , Estudio de Asociación del Genoma Completo/métodos , Trastornos por Estrés Postraumático/genética , Polimorfismo de Nucleótido Simple , Registros Electrónicos de Salud , Comorbilidad , Factores de Riesgo , Predisposición Genética a la Enfermedad/genéticaRESUMEN
OBJECTIVE: Arterial stiffness is a risk factor for cardiovascular disease, including heart failure with preserved ejection fraction (HFpEF). MGP (matrix Gla protein) is implicated in vascular calcification in animal models, and circulating levels of the uncarboxylated, inactive form of MGP (ucMGP) are associated with cardiovascular disease-related and all-cause mortality in human studies. However, the role of MGP in arterial stiffness is uncertain. Approach and Results: We examined the association of ucMGP levels with vascular calcification, arterial stiffness including carotid-femoral pulse wave velocity (PWV), and incident heart failure in community-dwelling adults from the Framingham Heart Study. To further investigate the link between MGP and arterial stiffness, we compared aortic PWV in age- and sex-matched young (4-month-old) and aged (10-month-old) wild-type and Mgp+/- mice. Among 7066 adults, we observed significant associations between higher levels of ucMGP and measures of arterial stiffness, including higher PWV and pulse pressure. Longitudinal analyses demonstrated an association between higher ucMGP levels and future increases in systolic blood pressure and incident HFpEF. Aortic PWV was increased in older, but not young, female Mgp+/- mice compared with wild-type mice, and this augmentation in PWV was associated with increased aortic elastin fiber fragmentation and collagen accumulation. CONCLUSIONS: This translational study demonstrates an association between ucMGP levels and arterial stiffness and future HFpEF in a large observational study, findings that are substantiated by experimental studies showing that mice with Mgp heterozygosity develop arterial stiffness. Taken together, these complementary study designs suggest a potential role of therapeutically targeting MGP in HFpEF.
Asunto(s)
Proteínas de Unión al Calcio/sangre , Proteínas de la Matriz Extracelular/sangre , Insuficiencia Cardíaca/sangre , Rigidez Vascular , Animales , Presión Sanguínea , Proteínas de Unión al Calcio/genética , Proteínas de la Matriz Extracelular/genética , Femenino , Eliminación de Gen , Insuficiencia Cardíaca/genética , Insuficiencia Cardíaca/fisiopatología , Humanos , Estudios Longitudinales , Masculino , Ratones Endogámicos C57BL , Persona de Mediana Edad , Estudios Prospectivos , Volumen Sistólico , Proteína Gla de la MatrizRESUMEN
Rationale: A common MUC5B gene polymorphism, rs35705950-T, is associated with idiopathic pulmonary fibrosis (IPF), but its role in severe acute respiratory syndrome coronavirus 2 infection and disease severity is unclear. Objectives: To assess whether rs35705950-T confers differential risk for clinical outcomes associated with coronavirus disease (COVID-19) infection among participants in the Million Veteran Program (MVP). Methods: The MUC5B rs35705950-T allele was directly genotyped among MVP participants; clinical events and comorbidities were extracted from the electronic health records. Associations between the incidence or severity of COVID-19 and rs35705950-T were analyzed within each ancestry group in the MVP followed by transancestry meta-analysis. Replication and joint meta-analysis were conducted using summary statistics from the COVID-19 Host Genetics Initiative (HGI). Sensitivity analyses with adjustment for additional covariates (body mass index, Charlson comorbidity index, smoking, asbestosis, rheumatoid arthritis with interstitial lung disease, and IPF) and associations with post-COVID-19 pneumonia were performed in MVP subjects. Measurements and Main Results: The rs35705950-T allele was associated with fewer COVID-19 hospitalizations in transancestry meta-analyses within the MVP (Ncases = 4,325; Ncontrols = 507,640; OR = 0.89 [0.82-0.97]; P = 6.86 × 10-3) and joint meta-analyses with the HGI (Ncases = 13,320; Ncontrols = 1,508,841; OR, 0.90 [0.86-0.95]; P = 8.99 × 10-5). The rs35705950-T allele was not associated with reduced COVID-19 positivity in transancestry meta-analysis within the MVP (Ncases = 19,168/Ncontrols = 492,854; OR, 0.98 [0.95-1.01]; P = 0.06) but was nominally significant (P < 0.05) in the joint meta-analysis with the HGI (Ncases = 44,820; Ncontrols = 1,775,827; OR, 0.97 [0.95-1.00]; P = 0.03). Associations were not observed with severe outcomes or mortality. Among individuals of European ancestry in the MVP, rs35705950-T was associated with fewer post-COVID-19 pneumonia events (OR, 0.82 [0.72-0.93]; P = 0.001). Conclusions: The MUC5B variant rs35705950-T may confer protection in COVID-19 hospitalizations.
Asunto(s)
COVID-19 , Fibrosis Pulmonar Idiopática , Humanos , COVID-19/epidemiología , COVID-19/genética , Mucina 5B/genética , Polimorfismo Genético , Fibrosis Pulmonar Idiopática/genética , Genotipo , Hospitalización , Predisposición Genética a la Enfermedad/genéticaRESUMEN
Lipid levels are important markers for the development of cardio-metabolic diseases. Although hundreds of associated loci have been identified through genetic association studies, the contribution of genetic factors to variation in lipids is not fully understood, particularly in U.S. minority groups. We performed genome-wide association analyses for four lipid traits in over 45,000 ancestrally diverse participants from the Population Architecture using Genomics and Epidemiology (PAGE) Study, followed by a meta-analysis with several European ancestry studies. We identified nine novel lipid loci, five of which showed evidence of replication in independent studies. Furthermore, we discovered one novel gene in a PrediXcan analysis, minority-specific independent signals at eight previously reported loci, and potential functional variants at two known loci through fine-mapping. Systematic examination of known lipid loci revealed smaller effect estimates in African American and Hispanic ancestry populations than those in Europeans, and better performance of polygenic risk scores based on minority-specific effect estimates. Our findings provide new insight into the genetic architecture of lipid traits and highlight the importance of conducting genetic studies in diverse populations in the era of precision medicine.
Asunto(s)
Lípidos/sangre , Lípidos/genética , Grupos Raciales/genética , Bases de Datos Genéticas , Femenino , Estudio de Asociación del Genoma Completo/métodos , Genotipo , Humanos , Lípidos/análisis , Masculino , Metagenómica/métodos , Grupos Minoritarios , Herencia Multifactorial/genética , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Estados Unidos/epidemiologíaRESUMEN
Clinical observations have linked tobacco smoking with increased type 2 diabetes risk. Mendelian randomization analysis has recently suggested smoking may be a causal risk factor for type 2 diabetes. However, this association could be mediated by additional risk factors correlated with smoking behavior, which have not been investigated. We hypothesized that body mass index (BMI) could help to explain the association between smoking and diabetes risk. First, we confirmed that genetic determinants of smoking initiation increased risk for type 2 diabetes (OR 1.21, 95% CI: 1.15-1.27, P = 1 × 10-12) and coronary artery disease (CAD; OR 1.21, 95% CI: 1.16-1.26, P = 2 × 10-20). Additionally, 2-fold increased smoking risk was positively associated with increased BMI (~0.8 kg/m2, 95% CI: 0.54-0.98 kg/m2, P = 1.8 × 10-11). Multivariable Mendelian randomization analyses showed that BMI accounted for nearly all the risk smoking exerted on type 2 diabetes (OR 1.06, 95% CI: 1.01-1.11, P = 0.03). In contrast, the independent effect of smoking on increased CAD risk persisted (OR 1.12, 95% CI: 1.08-1.17, P = 3 × 10-8). Causal mediation analyses agreed with these estimates. Furthermore, analysis using individual-level data from the Million Veteran Program independently replicated the association of smoking behavior with CAD (OR 1.24, 95% CI: 1.12-1.37, P = 2 × 10-5), but not type 2 diabetes (OR 0.98, 95% CI: 0.89-1.08, P = 0.69), after controlling for BMI. Our findings support a model whereby genetic determinants of smoking increase type 2 diabetes risk indirectly through their relationship with obesity. Smokers should be advised to stop smoking to limit type 2 diabetes and CAD risk. Therapeutic efforts should consider pathophysiology relating smoking and obesity.
Asunto(s)
Índice de Masa Corporal , Enfermedad de la Arteria Coronaria/epidemiología , Diabetes Mellitus Tipo 2/epidemiología , Estudio de Asociación del Genoma Completo , Obesidad/genética , Polimorfismo de Nucleótido Simple , Fumar/efectos adversos , Enfermedad de la Arteria Coronaria/etiología , Enfermedad de la Arteria Coronaria/patología , Diabetes Mellitus Tipo 2/etiología , Diabetes Mellitus Tipo 2/patología , Predisposición Genética a la Enfermedad , Humanos , Análisis de la Aleatorización Mendeliana , Obesidad/patología , Factores de RiesgoRESUMEN
Large-scale multi-ethnic cohorts offer unprecedented opportunities to elucidate the genetic factors influencing complex traits related to health and disease among minority populations. At the same time, the genetic diversity in these cohorts presents new challenges for analysis and interpretation. We consider the utility of race and/or ethnicity categories in genome-wide association studies (GWASs) of multi-ethnic cohorts. We demonstrate that race/ethnicity information enhances the ability to understand population-specific genetic architecture. To address the practical issue that self-identified racial/ethnic information may be incomplete, we propose a machine learning algorithm that produces a surrogate variable, termed HARE. We use height as a model trait to demonstrate the utility of HARE and ethnicity-specific GWASs.
Asunto(s)
Etnicidad/genética , Estudio de Asociación del Genoma Completo , Grupos Raciales/genética , Algoritmos , Humanos , Aprendizaje Automático , Máquina de Vectores de SoporteRESUMEN
Background Arterial arteriosclerosis and atherosclerosis reflect vascular disease, the subclinical detection of which allows opportunity for cardiovascular disease (CVD) prevention. Larger cohort studies simultaneously quantifying anatomic thoracic and abdominal aortic pathologic abnormalities are lacking in the literature. Purpose To investigate the association of aortic wall area (AWA) and atherosclerotic plaque presence and burden as measured on MRI scans with incident CVD in a community sample. Materials and Methods In this prospective cohort study, participants in the Framingham Heart Study Offspring Cohort without prevalent CVD underwent 1.5-T MRI (between 2002-2005) of the descending thoracic and abdominal aorta with electrocardiogram-gated axial T2-weighted black-blood acquisitions. The wall thickness of the thoracic aorta was measured at the pulmonary bifurcation level and used to calculate the AWA as the difference between cross-sectional vessel area and lumen area. For primary or secondary analyses, multivariable Cox proportional hazards regression models were used to examine the association of aortic MRI measures with risk of first-incident CVD events or stroke and coronary heart disease, respectively. Results In 1513 study participants (mean age, 64 years ± 9 [SD]; 842 women [56%]), 223 CVD events occurred during follow-up (median, 13.1 years), of which 97 were major events (myocardial infarction, ischemic stroke, or CVD death). In multivariable analysis, thoracic AWA and prevalent thoracic plaque were associated with incident CVD (hazard ratio [HR], 1.20 per SD unit [95% CI: 1.05, 1.37] [P = .006] and HR, 1.63 [95% CI: 1.12, 2.35] [P = .01], respectively). AWA and prevalent thoracic plaque were associated with increased hazards: 1.32 (95% CI: 1.07, 1.62; P = .01) and 2.20 (95% CI: 1.28, 3.79; P = .005), for stroke and coronary heart disease, respectively. Conclusion In middle-aged community-dwelling adults, thoracic aortic wall area (AWA), plaque prevalence, and plaque volumes measured with MRI were independently associated with incident cardiovascular disease, with AWA associated in particular with stroke, and plaque associated with coronary heart disease. Clinical trial registration no. NCT00041418 © RSNA, 2022 Online supplemental material is available for this article. See also the editorial by Peshock in this issue.