RESUMO
Previous genome-wide association studies (GWASs) of stroke - the second leading cause of death worldwide - were conducted predominantly in populations of European ancestry1,2. Here, in cross-ancestry GWAS meta-analyses of 110,182 patients who have had a stroke (five ancestries, 33% non-European) and 1,503,898 control individuals, we identify association signals for stroke and its subtypes at 89 (61 new) independent loci: 60 in primary inverse-variance-weighted analyses and 29 in secondary meta-regression and multitrait analyses. On the basis of internal cross-ancestry validation and an independent follow-up in 89,084 additional cases of stroke (30% non-European) and 1,013,843 control individuals, 87% of the primary stroke risk loci and 60% of the secondary stroke risk loci were replicated (P < 0.05). Effect sizes were highly correlated across ancestries. Cross-ancestry fine-mapping, in silico mutagenesis analysis3, and transcriptome-wide and proteome-wide association analyses revealed putative causal genes (such as SH3PXD2A and FURIN) and variants (such as at GRK5 and NOS3). Using a three-pronged approach4, we provide genetic evidence for putative drug effects, highlighting F11, KLKB1, PROC, GP1BA, LAMC2 and VCAM1 as possible targets, with drugs already under investigation for stroke for F11 and PROC. A polygenic score integrating cross-ancestry and ancestry-specific stroke GWASs with vascular-risk factor GWASs (integrative polygenic scores) strongly predicted ischaemic stroke in populations of European, East Asian and African ancestry5. Stroke genetic risk scores were predictive of ischaemic stroke independent of clinical risk factors in 52,600 clinical-trial participants with cardiometabolic disease. Our results provide insights to inform biology, reveal potential drug targets and derive genetic risk prediction tools across ancestries.
Assuntos
Descoberta de Drogas , Predisposição Genética para Doença , AVC Isquêmico , Humanos , Isquemia Encefálica/genética , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla , AVC Isquêmico/genética , Terapia de Alvo Molecular , Herança Multifatorial , Europa (Continente)/etnologia , Ásia Oriental/etnologia , África/etnologiaRESUMO
'Genome-first' approaches to analyzing rare variants can reveal new insights into human biology and disease. Because pathogenic variants are often rare, new discovery requires aggregating rare coding variants into 'gene burdens' for sufficient power. However, a major challenge is deciding which variants to include in gene burden tests. Pathogenic variants in MYBPC3 and MYH7 are well-known causes of hypertrophic cardiomyopathy (HCM), and focusing on these 'positive control' genes in a genome-first approach could help inform variant selection methods and gene burdening strategies for other genes and diseases. Integrating exome sequences with electronic health records among 41 759 participants in the Penn Medicine BioBank, we evaluated the performance of aggregating predicted loss-of-function (pLOF) and/or predicted deleterious missense (pDM) variants in MYBPC3 and MYH7 for gene burden phenome-wide association studies (PheWAS). The approach to grouping rare variants for these two genes produced very different results: pLOFs but not pDM variants in MYBPC3 were strongly associated with HCM, whereas the opposite was true for MYH7. Detailed review of clinical charts revealed that only 38.5% of patients with HCM diagnoses carrying an HCM-associated variant in MYBPC3 or MYH7 had a clinical genetic test result. Additionally, 26.7% of MYBPC3 pLOF carriers without HCM diagnoses had clear evidence of left atrial enlargement and/or septal/LV hypertrophy on echocardiography. Our study shows the importance of evaluating both pLOF and pDM variants for gene burden testing in future studies to uncover novel gene-disease relationships and identify new pathogenic loss-of-function variants across the human genome through genome-first analyses of healthcare-based populations.
Assuntos
Miosinas Cardíacas , Cardiomiopatia Hipertrófica , Bancos de Espécimes Biológicos , Miosinas Cardíacas/genética , Cardiomiopatia Hipertrófica/genética , Proteínas de Transporte/genética , Proteínas do Citoesqueleto/genética , Humanos , Mutação , Cadeias Pesadas de Miosina/genéticaRESUMO
BACKGROUND: Venous thromboembolism (VTE) is a life-threatening vascular event with environmental and genetic determinants. Recent VTE genome-wide association studies (GWAS) meta-analyses involved nearly 30 000 VTE cases and identified up to 40 genetic loci associated with VTE risk, including loci not previously suspected to play a role in hemostasis. The aim of our research was to expand discovery of new genetic loci associated with VTE by using cross-ancestry genomic resources. METHODS: We present new cross-ancestry meta-analyzed GWAS results involving up to 81 669 VTE cases from 30 studies, with replication of novel loci in independent populations and loci characterization through in silico genomic interrogations. RESULTS: In our genetic discovery effort that included 55 330 participants with VTE (47 822 European, 6320 African, and 1188 Hispanic ancestry), we identified 48 novel associations, of which 34 were replicated after correction for multiple testing. In our combined discovery-replication analysis (81 669 VTE participants) and ancestry-stratified meta-analyses (European, African, and Hispanic), we identified another 44 novel associations, which are new candidate VTE-associated loci requiring replication. In total, across all GWAS meta-analyses, we identified 135 independent genomic loci significantly associated with VTE risk. A genetic risk score of the significantly associated loci in Europeans identified a 6-fold increase in risk for those in the top 1% of scores compared with those with average scores. We also identified 31 novel transcript associations in transcriptome-wide association studies and 8 novel candidate genes with protein quantitative-trait locus Mendelian randomization analyses. In silico interrogations of hemostasis and hematology traits and a large phenome-wide association analysis of the 135 GWAS loci provided insights to biological pathways contributing to VTE, with some loci contributing to VTE through well-characterized coagulation pathways and others providing new data on the role of hematology traits, particularly platelet function. Many of the replicated loci are outside of known or currently hypothesized pathways to thrombosis. CONCLUSIONS: Our cross-ancestry GWAS meta-analyses identified new loci associated with VTE. These findings highlight new pathways to thrombosis and provide novel molecules that may be useful in the development of improved antithrombosis treatments.
Assuntos
Trombose , Tromboembolia Venosa , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Genômica , Humanos , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Trombose/genética , Tromboembolia Venosa/diagnóstico , Tromboembolia Venosa/genéticaRESUMO
BACKGROUND: Substantial data support a heritable basis for supraventricular tachycardias, but the genetic determinants and molecular mechanisms of these arrhythmias are poorly understood. We sought to identify genetic loci associated with atrioventricular nodal reentrant tachycardia (AVNRT) and atrioventricular accessory pathways or atrioventricular reciprocating tachycardia (AVAPs/AVRT). METHODS: We performed multiancestry meta-analyses of genome-wide association studies to identify genetic loci for AVNRT (4 studies) and AVAP/AVRT (7 studies). We assessed evidence supporting the potential causal effects of candidate genes by analyzing relations between associated variants and cardiac gene expression, performing transcriptome-wide analyses, and examining prior genome-wide association studies. RESULTS: Analyses comprised 2384 AVNRT cases and 106â 489 referents, and 2811 AVAP/AVRT cases and 1,483â 093 referents. We identified 2 significant loci for AVNRT, which implicate NKX2-5 and TTN as disease susceptibility genes. A transcriptome-wide association analysis supported an association between reduced predicted cardiac expression of NKX2-5 and AVNRT. We identified 3 significant loci for AVAP/AVRT, which implicate SCN5A, SCN10A, and TTN/CCDC141. Variant associations at several loci have been previously reported for cardiac phenotypes, including atrial fibrillation, stroke, Brugada syndrome, and electrocardiographic intervals. CONCLUSIONS: Our findings highlight gene regions associated with ion channel function (AVAP/AVRT), as well as cardiac development and the sarcomere (AVAP/AVRT and AVNRT) as important potential effectors of supraventricular tachycardia susceptibility.
Assuntos
Estudo de Associação Genômica Ampla , Taquicardia Supraventricular , Humanos , Taquicardia Supraventricular/genética , Predisposição Genética para Doença , Taquicardia por Reentrada no Nó Atrioventricular/genética , Polimorfismo de Nucleotídeo Único , Conectina/genética , TranscriptomaRESUMO
BACKGROUND: Obesity is a complex, multifactorial disease associated with substantial morbidity and mortality worldwide. Although it is frequently assessed using BMI, many epidemiological studies have shown links between body fat distribution and obesity-related outcomes. This study examined the relationships between body fat distribution and metabolic syndrome traits using Mendelian Randomization (MR). METHODS/FINDINGS: Genetic variants associated with visceral adipose tissue (VAT), abdominal subcutaneous adipose tissue (ASAT), and gluteofemoral adipose tissue (GFAT), as well as their relative ratios, were identified from a genome wide association study (GWAS) performed with the United Kingdom BioBank. GWAS summary statistics for traits and outcomes related to metabolic syndrome were obtained from the IEU Open GWAS Project. Two-sample MR and BMI-controlled multivariable MR (MVMR) were performed to examine relationships between each body fat measure and ratio with the outcomes. Increases in absolute GFAT were associated with a protective cardiometabolic profile, including lower low density lipoprotein cholesterol (ß: -0.19, [95% CI: -0.28, -0.10], p < 0.001), higher high density lipoprotein cholesterol (ß: 0.23, [95% CI: 0.03, 0.43], p = 0.025), lower triglycerides (ß: -0.28, [95% CI: -0.45, -0.10], p = 0.0021), and decreased systolic (ß: -1.65, [95% CI: -2.69, -0.61], p = 0.0019) and diastolic blood pressures (ß: -0.95, [95% CI: -1.65, -0.25], p = 0.0075). These relationships were largely maintained in BMI-controlled MVMR analyses. Decreases in relative GFAT were linked with a worse cardiometabolic profile, with higher levels of detrimental lipids and increases in systolic and diastolic blood pressures. CONCLUSION: A MR analysis of ASAT, GFAT, and VAT depots and their relative ratios with metabolic syndrome related traits and outcomes revealed that increased absolute and relative GFAT were associated with a favorable cardiometabolic profile independently of BMI. These associations highlight the importance of body fat distribution in obesity and more precise means to categorize obesity beyond BMI.
Assuntos
Doenças Cardiovasculares , Síndrome Metabólica , Humanos , Síndrome Metabólica/genética , Análise da Randomização Mendeliana , Estudo de Associação Genômica Ampla , Índice de Massa Corporal , Distribuição da Gordura Corporal , Obesidade/genéticaRESUMO
Heart failure (HF) is a complex trait, influenced by environmental and genetic factors, that affects over 30 million individuals worldwide. Historically, the genetics of HF have been studied in Mendelian forms of disease, where rare genetic variants have been linked to familial cardiomyopathies. More recently, genome-wide association studies (GWAS) have successfully identified common genetic variants associated with risk of HF. However, the relative importance of genetic variants across the allele-frequency spectrum remains incompletely characterized. Here, we report the results of common- and rare-variant association studies of all-cause heart failure, applying recently developed methods to quantify the heritability of HF attributable to different classes of genetic variation. We combine GWAS data across multiple populations including 207,346 individuals with HF and 2,151,210 without, identifying 176 risk loci at genome-wide significance (p < 5×10-8). Signals at newly identified common-variant loci include coding variants in Mendelian cardiomyopathy genes (MYBPC3, BAG3), as well as regulators of lipoprotein (LPL) and glucose metabolism (GIPR, GLP1R), and are enriched in cardiac, muscle, nerve, and vascular tissues, as well as myocyte and adipocyte cell types. Gene burden studies across three biobanks (PMBB, UKB, AOU) including 27,208 individuals with HF and 349,126 without uncover exome-wide significant (p < 3.15×10-6) associations for HF and rare predicted loss-of-function (pLoF) variants in TTN, MYBPC3, FLNC, and BAG3. Total burden heritability of rare coding variants (2.2%, 95% CI 0.99-3.5%) is highly concentrated in a small set of Mendelian cardiomyopathy genes, and is lower than heritability attributable to common variants (4.3%, 95% CI 3.9-4.7%) which is more diffusely spread throughout the genome. Finally, we demonstrate that common-variant background, in the form of a polygenic risk score (PRS), significantly modifies the risk of HF among carriers of pathogenic truncating variants in the Mendelian cardiomyopathy gene TTN. These findings suggest a significant polygenic component to HF exists that is not captured by current clinical genetic testing.
RESUMO
Abdominal aortic aneurysm (AAA) is a common disease with substantial heritability. In this study, we performed a genome-wide association meta-analysis from 14 discovery cohorts and uncovered 141 independent associations, including 97 previously unreported loci. A polygenic risk score derived from meta-analysis explained AAA risk beyond clinical risk factors. Genes at AAA risk loci indicate involvement of lipid metabolism, vascular development and remodeling, extracellular matrix dysregulation and inflammation as key mechanisms in AAA pathogenesis. These genes also indicate overlap between the development of AAA and other monogenic aortopathies, particularly via transforming growth factor ß signaling. Motivated by the strong evidence for the role of lipid metabolism in AAA, we used Mendelian randomization to establish the central role of nonhigh-density lipoprotein cholesterol in AAA and identified the opportunity for repurposing of proprotein convertase, subtilisin/kexin-type 9 (PCSK9) inhibitors. This was supported by a study demonstrating that PCSK9 loss of function prevented the development of AAA in a preclinical mouse model.
Assuntos
Aneurisma da Aorta Abdominal , Estudo de Associação Genômica Ampla , Humanos , Animais , Camundongos , Pró-Proteína Convertase 9/genética , Pró-Proteína Convertase 9/metabolismo , Subtilisina , Pró-Proteína Convertases , Aneurisma da Aorta Abdominal/genéticaRESUMO
The current understanding of the genetic determinants of thoracic aortic aneurysms and dissections (TAAD) has largely been informed through studies of rare, Mendelian forms of disease. Here, we conducted a genome-wide association study (GWAS) of TAAD, testing ~25 million DNA sequence variants in 8,626 participants with and 453,043 participants without TAAD in the Million Veteran Program, with replication in an independent sample of 4,459 individuals with and 512,463 without TAAD from six cohorts. We identified 21 TAAD risk loci, 17 of which have not been previously reported. We leverage multiple downstream analytic methods to identify causal TAAD risk genes and cell types and provide human genetic evidence that TAAD is a non-atherosclerotic aortic disorder distinct from other forms of vascular disease. Our results demonstrate that the genetic architecture of TAAD mirrors that of other complex traits and that it is not solely inherited through protein-altering variants of large effect size.
Assuntos
Aneurisma da Aorta Torácica , Dissecção Aórtica , Veteranos , Humanos , Estudo de Associação Genômica Ampla , Linhagem , Aneurisma da Aorta Torácica/genética , Dissecção Aórtica/genéticaRESUMO
Nonalcoholic fatty liver disease is common and highly heritable. Genetic studies of hepatic fat have not sufficiently addressed non-European and rare variants. In a medical biobank, we quantitate hepatic fat from clinical computed tomography (CT) scans via deep learning in 10,283 participants with whole-exome sequences available. We conduct exome-wide associations of single variants and rare predicted loss-of-function (pLOF) variants with CT-based hepatic fat and perform cross-modality replication in the UK Biobank (UKB) by linking whole-exome sequences to MRI-based hepatic fat. We confirm single variants previously associated with hepatic fat and identify several additional variants, including two (FGD5 H600Y and CITED2 S198_G199del) that replicated in UKB. A burden of rare pLOF variants in LMF2 is associated with increased hepatic fat and replicates in UKB. Quantitative phenotypes generated from clinical imaging studies and intersected with genomic data in medical biobanks have the potential to identify molecular pathways associated with human traits and disease.
Assuntos
Exoma , Hepatopatia Gordurosa não Alcoólica , Humanos , Exoma/genética , Bancos de Espécimes Biológicos , Fenótipo , Tomografia Computadorizada por Raios X , Hepatopatia Gordurosa não Alcoólica/diagnóstico por imagem , Hepatopatia Gordurosa não Alcoólica/genética , Proteínas Repressoras/genética , Transativadores/genéticaRESUMO
BACKGROUND: Identification of germline mutations in DNA repair genes has significant implications for the personalized treatment of individuals with prostate cancer (PrCa). OBJECTIVE: To determine DNA repair genes associated with localized PrCa in a diverse academic biobank and to determine genetic testing burden. DESIGN, SETTING, AND PARTICIPANTS: A cross-sectional study of 2391 localized PrCa patients was carried out. OUTCOME MEASUREMENTS AND STATISTICAL ANALYSIS: Genetic ancestry and mutation rates (excluding somatic interference) in 17 DNA repair genes were determined in 1588 localized PrCa patients and 3273 cancer-free males. Burden testing within individuals of genetically determined European (EUR) and African (AFR) ancestry was performed between biobank PrCa cases and cancer-free biobank and gnomAD males. RESULTS AND LIMITATIONS: AFR individuals with localized PrCa had lower DNA repair gene mutation rates than EUR individuals (1.4% vs 4.0%, p = 0.02). Mutation rates in localized PrCa patients were similar to those in biobank and gnomAD controls (EUR: 4.0% vs 2.8%, p = 0.15, vs 3.1%, p = 0.04; AFR: 1.4% vs 1.8%, p = 0.8, vs 2.1%, p = 0.5). Gene-based rare variant association testing revealed that only BRCA2 mutations were significantly enriched compared with gnomAD controls of EUR ancestry (1.0% vs 0.28%, p = 0.03). Of the participants, 21% and 11% met high-risk and very-high-risk criteria; of them, 3.7% and 6.2% had any germline genetic mutation and 1.0% and 2.5% had a BRCA2 mutation, respectively. Limitations of this study include an analysis of a relatively small, single-institution cohort. CONCLUSIONS: DNA repair gene germline mutation rates are low in an academic biobank cohort of localized PrCa patients, particularly among individuals of AFR genetic ancestry. Mutation rates in genes with published evidence of association with PrCa exceed 2.5% only in high-risk, very-high-risk localized, and node-positive PrCa patients. These findings highlight the importance of risk stratification in localized PrCa patients to identify appropriate patients for germline genetic testing. PATIENT SUMMARY: In the majority of patients who develop localized prostate cancer, germline genetic testing is unlikely to reveal an inherited DNA repair mutation, regardless of race. High-risk features increase the possibility of a germline DNA repair mutation.
Assuntos
Mutação em Linhagem Germinativa , Neoplasias da Próstata , Estudos Transversais , Reparo do DNA/genética , Genes BRCA2 , Predisposição Genética para Doença , Humanos , Masculino , Neoplasias da Próstata/genéticaRESUMO
Heart failure is a leading cause of cardiovascular morbidity and mortality. However, the contribution of common genetic variation to heart failure risk has not been fully elucidated, particularly in comparison to other common cardiometabolic traits. We report a multi-ancestry genome-wide association study meta-analysis of all-cause heart failure including up to 115,150 cases and 1,550,331 controls of diverse genetic ancestry, identifying 47 risk loci. We also perform multivariate genome-wide association studies that integrate heart failure with related cardiac magnetic resonance imaging endophenotypes, identifying 61 risk loci. Gene-prioritization analyses including colocalization and transcriptome-wide association studies identify known and previously unreported candidate cardiomyopathy genes and cellular processes, which we validate in gene-expression profiling of failing and healthy human hearts. Colocalization, gene expression profiling, and Mendelian randomization provide convergent evidence for the roles of BCKDHA and circulating branch-chain amino acids in heart failure and cardiac structure. Finally, proteome-wide Mendelian randomization identifies 9 circulating proteins associated with heart failure or quantitative imaging traits. These analyses highlight similarities and differences among heart failure and associated cardiovascular imaging endophenotypes, implicate common genetic variation in the pathogenesis of heart failure, and identify circulating proteins that may represent cardiomyopathy treatment targets.
Assuntos
Estudo de Associação Genômica Ampla , Insuficiência Cardíaca , Humanos , Estudo de Associação Genômica Ampla/métodos , Fenótipo , Insuficiência Cardíaca/genética , Coração , Perfilação da Expressão Gênica , Polimorfismo de Nucleotídeo Único , Predisposição Genética para DoençaRESUMO
More than 800 million people in the world suffer from chronic kidney disease (CKD). Genome-wide association studies (GWAS) have identified hundreds of loci where genetic variants are associated with kidney function; however, causal genes and pathways for CKD remain unknown. Here, we performed integration of kidney function GWAS and human kidney-specific expression quantitative trait analysis and identified that the expression of beta-mannosidase (MANBA) was lower in kidneys of subjects with CKD risk genotype. We also show an increased incidence of renal failure in subjects with rare heterozygous loss-of-function coding variants in MANBA using phenome-wide association analysis of 40,963 subjects with exome sequencing data. MANBA is a lysosomal gene highly expressed in kidney tubule cells. Deep phenotyping revealed structural and functional lysosomal alterations in human kidneys from subjects with CKD risk alleles and mice with genetic deletion of Manba Manba heterozygous and knockout mice developed more severe kidney fibrosis when subjected to toxic injury induced by cisplatin or folic acid. Manba loss altered multiple pathways, including endocytosis and autophagy. In the absence of Manba, toxic acute tubule injury induced inflammasome activation and fibrosis. Together, these results illustrate the convergence of common noncoding and rare coding variants in MANBA in kidney disease development and demonstrate the role of the endolysosomal system in kidney disease development.
Assuntos
Nefropatias , beta-Manosidase , Animais , Estudo de Associação Genômica Ampla , Humanos , Nefropatias/genética , Lisossomos , Manosidases , Camundongos , Fatores de Risco , Índice de Gravidade de Doença , beta-Manosidase/genéticaRESUMO
The clinical impact of rare loss-of-function variants has yet to be determined for most genes. Integration of DNA sequencing data with electronic health records (EHRs) could enhance our understanding of the contribution of rare genetic variation to human disease1. By leveraging 10,900 whole-exome sequences linked to EHR data in the Penn Medicine Biobank, we addressed the association of the cumulative effects of rare predicted loss-of-function variants for each individual gene on human disease on an exome-wide scale, as assessed using a set of diverse EHR phenotypes. After discovering 97 genes with exome-by-phenome-wide significant phenotype associations (P < 10-6), we replicated 26 of these in the Penn Medicine Biobank, as well as in three other medical biobanks and the population-based UK Biobank. Of these 26 genes, five had associations that have been previously reported and represented positive controls, whereas 21 had phenotype associations not previously reported, among which were genes implicated in glaucoma, aortic ectasia, diabetes mellitus, muscular dystrophy and hearing loss. These findings show the value of aggregating rare predicted loss-of-function variants into 'gene burdens' for identifying new gene-disease associations using EHR phenotypes in a medical biobank. We suggest that application of this approach to even larger numbers of individuals will provide the statistical power required to uncover unexplored relationships between rare genetic variation and disease phenotypes.
Assuntos
Registros Eletrônicos de Saúde , Exoma , Genótipo , Fenótipo , Idoso , Biologia Computacional , Feminino , Estudo de Associação Genômica Ampla , Humanos , Masculino , Polimorfismo de Nucleotídeo Único , Sequenciamento do ExomaRESUMO
We investigated type 2 diabetes (T2D) genetic susceptibility via multi-ancestry meta-analysis of 228,499 cases and 1,178,783 controls in the Million Veteran Program (MVP), DIAMANTE, Biobank Japan and other studies. We report 568 associations, including 286 autosomal, 7 X-chromosomal and 25 identified in ancestry-specific analyses that were previously unreported. Transcriptome-wide association analysis detected 3,568 T2D associations with genetically predicted gene expression in 687 novel genes; of these, 54 are known to interact with FDA-approved drugs. A polygenic risk score (PRS) was strongly associated with increased risk of T2D-related retinopathy and modestly associated with chronic kidney disease (CKD), peripheral artery disease (PAD) and neuropathy. We investigated the genetic etiology of T2D-related vascular outcomes in the MVP and observed statistical SNP-T2D interactions at 13 variants, including coronary heart disease (CHD), CKD, PAD and neuropathy. These findings may help to identify potential therapeutic targets for T2D and genomic pathways that link T2D to vascular outcomes.