RESUMO
ABSTRACT: Coagulation factor VIII (FVIII) and its carrier protein von Willebrand factor (VWF) are critical to coagulation and platelet aggregation. We leveraged whole-genome sequence data from the Trans-Omics for Precision Medicine (TOPMed) program along with TOPMed-based imputation of genotypes in additional samples to identify genetic associations with circulating FVIII and VWF levels in a single-variant meta-analysis, including up to 45 289 participants. Gene-based aggregate tests were implemented in TOPMed. We identified 3 candidate causal genes and tested their functional effect on FVIII release from human liver endothelial cells (HLECs) and VWF release from human umbilical vein endothelial cells. Mendelian randomization was also performed to provide evidence for causal associations of FVIII and VWF with thrombotic outcomes. We identified associations (P < 5 × 10-9) at 7 new loci for FVIII (ST3GAL4, CLEC4M, B3GNT2, ASGR1, F12, KNG1, and TREM1/NCR2) and 1 for VWF (B3GNT2). VWF, ABO, and STAB2 were associated with FVIII and VWF in gene-based analyses. Multiphenotype analysis of FVIII and VWF identified another 3 new loci, including PDIA3. Silencing of B3GNT2 and the previously reported CD36 gene decreased release of FVIII by HLECs, whereas silencing of B3GNT2, CD36, and PDIA3 decreased release of VWF by HVECs. Mendelian randomization supports causal association of higher FVIII and VWF with increased risk of thrombotic outcomes. Seven new loci were identified for FVIII and 1 for VWF, with evidence supporting causal associations of FVIII and VWF with thrombotic outcomes. B3GNT2, CD36, and PDIA3 modulate the release of FVIII and/or VWF in vitro.
Assuntos
Moléculas de Adesão Celular , Fator VIII , Cininogênios , Lectinas Tipo C , Receptores de Superfície Celular , Fator de von Willebrand , Humanos , Fator de von Willebrand/genética , Fator de von Willebrand/metabolismo , Fator VIII/genética , Fator VIII/metabolismo , Polimorfismo de Nucleotídeo Único , Células Endoteliais da Veia Umbilical Humana/metabolismo , Análise da Randomização Mendeliana , Estudo de Associação Genômica Ampla , Trombose/genética , Trombose/sangue , Estudos de Associação Genética , Masculino , Células Endoteliais/metabolismo , FemininoRESUMO
Integrative approaches that simultaneously model multi-omics data have gained increasing popularity because they provide holistic system biology views of multiple or all components in a biological system of interest. Canonical correlation analysis (CCA) is a correlation-based integrative method designed to extract latent features shared between multiple assays by finding the linear combinations of features-referred to as canonical variables (CVs)-within each assay that achieve maximal across-assay correlation. Although widely acknowledged as a powerful approach for multi-omics data, CCA has not been systematically applied to multi-omics data in large cohort studies, which has only recently become available. Here, we adapted sparse multiple CCA (SMCCA), a widely-used derivative of CCA, to proteomics and methylomics data from the Multi-Ethnic Study of Atherosclerosis (MESA) and Jackson Heart Study (JHS). To tackle challenges encountered when applying SMCCA to MESA and JHS, our adaptations include the incorporation of the Gram-Schmidt (GS) algorithm with SMCCA to improve orthogonality among CVs, and the development of Sparse Supervised Multiple CCA (SSMCCA) to allow supervised integration analysis for more than two assays. Effective application of SMCCA to the two real datasets reveals important findings. Applying our SMCCA-GS to MESA and JHS, we identified strong associations between blood cell counts and protein abundance, suggesting that adjustment of blood cell composition should be considered in protein-based association studies. Importantly, CVs obtained from two independent cohorts also demonstrate transferability across the cohorts. For example, proteomic CVs learned from JHS, when transferred to MESA, explain similar amounts of blood cell count phenotypic variance in MESA, explaining 39.0% ~ 50.0% variation in JHS and 38.9% ~ 49.1% in MESA. Similar transferability was observed for other omics-CV-trait pairs. This suggests that biologically meaningful and cohort-agnostic variation is captured by CVs. We anticipate that applying our SMCCA-GS and SSMCCA on various cohorts would help identify cohort-agnostic biologically meaningful relationships between multi-omics data and phenotypic traits.
Assuntos
Análise de Correlação Canônica , Proteômica , Humanos , Proteômica/métodos , Multiômica , Estudos de CoortesRESUMO
Current publicly available tools that allow rapid exploration of linkage disequilibrium (LD) between markers (e.g., HaploReg and LDlink) are based on whole-genome sequence (WGS) data from 2,504 individuals in the 1000 Genomes Project. Here, we present TOP-LD, an online tool to explore LD inferred with high-coverage (â¼30×) WGS data from 15,578 individuals in the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. TOP-LD provides a significant upgrade compared to current LD tools, as the TOPMed WGS data provide a more comprehensive representation of genetic variation than the 1000 Genomes data, particularly for rare variants and in the specific populations that we analyzed. For example, TOP-LD encompasses LD information for 150.3, 62.2, and 36.7 million variants for European, African, and East Asian ancestral samples, respectively, offering 2.6- to 9.1-fold increase in variant coverage compared to HaploReg 4.0 or LDlink. In addition, TOP-LD includes tens of thousands of structural variants (SVs). We demonstrate the value of TOP-LD in fine-mapping at the GGT1 locus associated with gamma glutamyltransferase in the African ancestry participants in UK Biobank. Beyond fine-mapping, TOP-LD can facilitate a wide range of applications that are based on summary statistics and estimates of LD. TOP-LD is freely available online.
Assuntos
Estudo de Associação Genômica Ampla , Medicina de Precisão , Povo Asiático , Humanos , Desequilíbrio de Ligação/genética , Polimorfismo de Nucleotídeo Único/genética , Sequenciamento Completo do GenomaRESUMO
Plasma levels of fibrinogen, coagulation factors VII and VIII and von Willebrand factor (vWF) are four intermediate phenotypes that are heritable and have been associated with the risk of clinical thrombotic events. To identify rare and low-frequency variants associated with these hemostatic factors, we conducted whole-exome sequencing in 10 860 individuals of European ancestry (EA) and 3529 African Americans (AAs) from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium and the National Heart, Lung and Blood Institute's Exome Sequencing Project. Gene-based tests demonstrated significant associations with rare variation (minor allele frequency < 5%) in fibrinogen gamma chain (FGG) (with fibrinogen, P = 9.1 × 10-13), coagulation factor VII (F7) (with factor VII, P = 1.3 × 10-72; seven novel variants) and VWF (with factor VIII and vWF; P = 3.2 × 10-14; one novel variant). These eight novel rare variant associations were independent of the known common variants at these loci and tended to have much larger effect sizes. In addition, one of the rare novel variants in F7 was significantly associated with an increased risk of venous thromboembolism in AAs (Ile200Ser; rs141219108; P = 4.2 × 10-5). After restricting gene-based analyses to only loss-of-function variants, a novel significant association was detected and replicated between factor VIII levels and a stop-gain mutation exclusive to AAs (rs3211938) in CD36 molecule (CD36). This variant has previously been linked to dyslipidemia but not with the levels of a hemostatic factor. These efforts represent the largest integration of whole-exome sequence data from two national projects to identify genetic variation associated with plasma hemostatic factors.
Assuntos
Fator VIII , Hemostáticos , Fator VII/genética , Fator VIII/genética , Fibrinogênio/genética , Humanos , Polimorfismo de Nucleotídeo Único/genética , Sequenciamento do Exoma , Fator de von Willebrand/análise , Fator de von Willebrand/genéticaRESUMO
BACKGROUND: Intercellular adhesion molecule-1 (ICAM-1) is a cell surface protein that participates in endothelial activation and is hypothesized to play a central role in heart failure (HF). We evaluated associations of ICAM1 missense genetic variants with circulating ICAM-1 levels and with incident HF. METHODS AND RESULTS: We identified 3 missense variants within ICAM1 (rs5491, rs5498 and rs1799969) and evaluated their associations with ICAM-1 levels in the Coronary Artery Risk Development in Young Adults Study and the Multi-Ethnic Study of Atherosclerosis (MESA). We determined the association among these 3 variants and incident HF in MESA. We separately evaluated significant associations in the Atherosclerosis Risk in Communities (ARIC) study. Of the 3 missense variants, rs5491 was common in Black participants (minor allele frequency [MAF] > 20%) and rare in other race/ethnic groups (MAF < 5%). In Black participants, the presence of rs5491 was associated with higher levels of circulating ICAM-1 at 2 timepoints separated by 8 years. Among Black participants in MESA (nâ¯=â¯1600), the presence of rs5491 was associated with an increased risk of incident HF with preserved ejection fraction (HFpEF; HRâ¯=â¯2.30; [95% CI 1.25-4.21; Pâ¯=â¯0.007]). The other ICAM1 missense variants (rs5498 and rs1799969) were associated with ICAM-1 levels, but there were no associations with HF. In ARIC, rs5491 was significantly associated with incident HF (HRâ¯=â¯1.24 [95% CI 1.02 - 1.51]; Pâ¯=â¯0.03), with a similar direction of effect for HFpEF that was not statistically significant. CONCLUSIONS: A common ICAM1 missense variant among Black individuals may be associated with increased risk of HF, which may be HFpEF-specific.
Assuntos
Aterosclerose , Insuficiência Cardíaca , Adulto Jovem , Humanos , Insuficiência Cardíaca/diagnóstico , Insuficiência Cardíaca/epidemiologia , Insuficiência Cardíaca/genética , Molécula 1 de Adesão Intercelular/genética , Volume Sistólico , Variação Genética/genéticaRESUMO
Mendelian randomization (MR) is an established approach for assessing the causal effects of heritable exposures on outcomes. Outcomes of interest often include binary clinical endpoints, but may also include censored survival times. We explore the implications of both the Cox proportional hazard model and the additive hazard model in the context of MR, with a specific emphasis on two-stage methods. We show that naive application of standard MR approaches to censored survival times may induce significant bias. Through simulations and analysis of data from the Women's Health Initiative, we provide practical advice on modeling survival outcomes in MRs.
Assuntos
Análise da Randomização Mendeliana , Modelos Genéticos , Viés , Causalidade , Feminino , Humanos , Modelos de Riscos ProporcionaisRESUMO
BACKGROUND AND PURPOSE: Stroke is the leading cause of death and long-term disability worldwide. Previous genome-wide association studies identified 51 loci associated with stroke (mostly ischemic) and its subtypes among predominantly European populations. Using whole-genome sequencing in ancestrally diverse populations from the Trans-Omics for Precision Medicine (TOPMed) Program, we aimed to identify novel variants, especially low-frequency or ancestry-specific variants, associated with all stroke, ischemic stroke and its subtypes (large artery, cardioembolic, and small vessel), and hemorrhagic stroke and its subtypes (intracerebral and subarachnoid). METHODS: Whole-genome sequencing data were available for 6833 stroke cases and 27 116 controls, including 22 315 European, 7877 Black, 2616 Hispanic/Latino, 850 Asian, 54 Native American, and 237 other ancestry participants. In TOPMed, we performed single variant association analysis examining 40 million common variants and aggregated association analysis focusing on rare variants. We also combined TOPMed European populations with over 28 000 additional European participants from the UK BioBank genome-wide array data through meta-analysis. RESULTS: In the single variant association analysis in TOPMed, we identified one novel locus 13q33 for large artery at whole-genome-wide significance (P<5.00×10-9) and 4 novel loci at genome-wide significance (P<5.00×10-8), all of which need confirmation in independent studies. Lead variants in all 5 loci are low-frequency but are more common in non-European populations. An aggregation of synonymous rare variants within the gene C6orf26 demonstrated suggestive evidence of association for hemorrhagic stroke (P<3.11×10-6). By meta-analyzing European ancestry samples in TOPMed and UK BioBank, we replicated several previously reported stroke loci including PITX2, HDAC9, ZFHX3, and LRCH1. CONCLUSIONS: We represent the first association analysis for stroke and its subtypes using whole-genome sequencing data from ancestrally diverse populations. While our findings suggest the potential benefits of combining whole-genome sequencing data with populations of diverse genetic backgrounds to identify possible low-frequency or ancestry-specific variants, they also highlight the need to increase genome coverage and sample sizes.
Assuntos
Loci Gênicos , Predisposição Genética para Doença , Polimorfismo de Nucleotídeo Único , Medicina de Precisão , Grupos Raciais/genética , Acidente Vascular Cerebral/genética , Idoso , Idoso de 80 Anos ou mais , Feminino , Estudo de Associação Genômica Ampla , Humanos , Masculino , Pessoa de Meia-Idade , Sequenciamento Completo do GenomaRESUMO
With advances in whole-genome sequencing (WGS) technology, more advanced statistical methods for testing genetic association with rare variants are being developed. Methods in which variants are grouped for analysis are also known as variant-set, gene-based, and aggregate unit tests. The burden test and sequence kernel association test (SKAT) are two widely used variant-set tests, which were originally developed for samples of unrelated individuals and later have been extended to family data with known pedigree structures. However, computationally efficient and powerful variant-set tests are needed to make analyses tractable in large-scale WGS studies with complex study samples. In this paper, we propose the variant-set mixed model association tests (SMMAT) for continuous and binary traits using the generalized linear mixed model framework. These tests can be applied to large-scale WGS studies involving samples with population structure and relatedness, such as in the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program. SMMATs share the same null model for different variant sets, and a virtue of this null model, which includes covariates only, is that it needs to be fit only once for all tests in each genome-wide analysis. Simulation studies show that all the proposed SMMATs correctly control type I error rates for both continuous and binary traits in the presence of population structure and relatedness. We also illustrate our tests in a real data example of analysis of plasma fibrinogen levels in the TOPMed program (n = 23,763), using the Analysis Commons, a cloud-based computing platform.
Assuntos
Estudos de Associação Genética , Modelos Genéticos , Sequenciamento Completo do Genoma , Cromossomos Humanos Par 4/genética , Computação em Nuvem , Feminino , Fibrinogênio/análise , Fibrinogênio/genética , Genética Populacional , Humanos , Masculino , National Heart, Lung, and Blood Institute (U.S.) , Medicina de Precisão , Projetos de Pesquisa , Fatores de Tempo , Estados UnidosRESUMO
[Figure: see text].
Assuntos
Doenças Cardiovasculares/sangue , Doenças Cardiovasculares/genética , Receptores de Lipopolissacarídeos/sangue , Receptores de Lipopolissacarídeos/genética , Polimorfismo de Nucleotídeo Único , Adulto , Negro ou Afro-Americano/genética , Fatores Etários , Idoso , Biomarcadores/sangue , Doenças Cardiovasculares/etnologia , Doenças Cardiovasculares/mortalidade , Estudos Transversais , Feminino , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Fatores de Risco de Doenças Cardíacas , Humanos , Incidência , Masculino , Pessoa de Meia-Idade , Mississippi/epidemiologia , Fenótipo , Prognóstico , Fatores Raciais , Medição de Risco , Fatores de TempoRESUMO
Genotype-phenotype association studies often combine phenotype data from multiple studies to increase statistical power. Harmonization of the data usually requires substantial effort due to heterogeneity in phenotype definitions, study design, data collection procedures, and data-set organization. Here we describe a centralized system for phenotype harmonization that includes input from phenotype domain and study experts, quality control, documentation, reproducible results, and data-sharing mechanisms. This system was developed for the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program, which is generating genomic and other -omics data for more than 80 studies with extensive phenotype data. To date, 63 phenotypes have been harmonized across thousands of participants (recruited in 1948-2012) from up to 17 studies per phenotype. Here we discuss challenges in this undertaking and how they were addressed. The harmonized phenotype data and associated documentation have been submitted to National Institutes of Health data repositories for controlled access by the scientific community. We also provide materials to facilitate future harmonization efforts by the community, which include 1) the software code used to generate the 63 harmonized phenotypes, enabling others to reproduce, modify, or extend these harmonizations to additional studies, and 2) the results of labeling thousands of phenotype variables with controlled vocabulary terms.
Assuntos
Estudos de Associação Genética/métodos , Fenômica/métodos , Medicina de Precisão/métodos , Agregação de Dados , Humanos , Disseminação de Informação , National Heart, Lung, and Blood Institute (U.S.) , Fenótipo , Avaliação de Programas e Projetos de Saúde , Estados UnidosRESUMO
E-selectin mediates the rolling of circulating leukocytes during inflammatory processes. Previous genome-wide association studies in European and Asian individuals have identified the ABO locus associated with E-selectin levels. Using Trans-Omics for Precision Medicine whole genome sequencing data in 2249 African Americans (AAs) from the Jackson Heart Study, we examined genome-wide associations with soluble E-selectin levels. In addition to replicating known signals at ABO, we identified a novel association of a common loss-of-function, missense variant in Fucosyltransferase 6 (FUT6; rs17855739,p.Glu274Lys, P = 9.02 × 10-24) with higher soluble E-selectin levels. This variant is considerably more common in populations of African ancestry compared to non-African ancestry populations. We replicated the association of FUT6 p.Glu274Lys with higher soluble E-selectin in an independent population of 748 AAs from the Women's Health Initiative and identified an additional pleiotropic association with vitamin B12 levels. Despite the broad role of both selectins and fucosyltransferases in various inflammatory, immune and cancer-related processes, we were unable to identify any additional disease associations of the FUT6 p.Glu274Lys variant in an electronic medical record-based phenome-wide association scan of over 9000 AAs.
Assuntos
Negro ou Afro-Americano/genética , Selectina E/genética , Fucosiltransferases/genética , Adulto , Feminino , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Masculino , Polimorfismo de Nucleotídeo Único , Sequenciamento Completo do Genoma/métodosRESUMO
Factor VII (FVII) is an important component of the coagulation cascade. Few genetic loci regulating FVII activity and/or levels have been discovered to date. We conducted a meta-analysis of 9 genome-wide association studies of plasma FVII levels (7 FVII activity and 2 FVII antigen) among 27 495 participants of European and African ancestry. Each study performed ancestry-specific association analyses. Inverse variance weighted meta-analysis was performed within each ancestry group and then combined for a trans-ancestry meta-analysis. Our primary analysis included the 7 studies that measured FVII activity, and a secondary analysis included all 9 studies. We provided functional genomic validation for newly identified significant loci by silencing candidate genes in a human liver cell line (HuH7) using small-interfering RNA and then measuring F7 messenger RNA and FVII protein expression. Lastly, we used meta-analysis results to perform Mendelian randomization analysis to estimate the causal effect of FVII activity on coronary artery disease, ischemic stroke (IS), and venous thromboembolism. We identified 2 novel (REEP3 and JAZF1-AS1) and 6 known loci associated with FVII activity, explaining 19.0% of the phenotypic variance. Adding FVII antigen data to the meta-analysis did not result in the discovery of further loci. Silencing REEP3 in HuH7 cells upregulated FVII, whereas silencing JAZF1 downregulated FVII. Mendelian randomization analyses suggest that FVII activity has a positive causal effect on the risk of IS. Variants at REEP3 and JAZF1 contribute to FVII activity by regulating F7 expression levels. FVII activity appears to contribute to the etiology of IS in the general population.
Assuntos
Isquemia Encefálica/etiologia , Fator VII/genética , Estudo de Associação Genômica Ampla , Proteínas de Membrana Transportadoras/genética , Proteínas de Neoplasias/genética , Polimorfismo de Nucleotídeo Único , Acidente Vascular Cerebral/etiologia , Isquemia Encefálica/metabolismo , Isquemia Encefálica/patologia , Proteínas Correpressoras , Estudos de Coortes , Doença da Artéria Coronariana/etiologia , Doença da Artéria Coronariana/metabolismo , Doença da Artéria Coronariana/patologia , Proteínas de Ligação a DNA , Fator VII/metabolismo , Feminino , Seguimentos , Loci Gênicos , Predisposição Genética para Doença , Humanos , Masculino , Proteínas de Membrana Transportadoras/metabolismo , Análise da Randomização Mendeliana , Pessoa de Meia-Idade , Proteínas de Neoplasias/metabolismo , Fenótipo , Prognóstico , Acidente Vascular Cerebral/metabolismo , Acidente Vascular Cerebral/patologia , Tromboembolia Venosa/etiologia , Tromboembolia Venosa/metabolismo , Tromboembolia Venosa/patologiaRESUMO
Co-inheritance of α-thalassemia has a significant protective effect on the severity of complications of sickle cell disease (SCD), including stroke. However, little information exists on the association and interactions for the common African ancestral α-thalassemia mutation (-α3.7 deletion) and ß-globin traits (HbS trait [SCT] and HbC trait) on important clinical phenotypes such as red blood cell parameters, anemia, and chronic kidney disease (CKD). In a community-based cohort of 2,916 African Americans from the Jackson Heart Study, we confirmed the expected associations between SCT, HbC trait, and the -α3.7 deletion with lower mean corpuscular volume/mean corpuscular hemoglobin and higher red blood cell count and red cell distribution width. In addition to the recently recognized association of SCT with lower estimated glomerular filtration rate and glycated hemoglobin (HbA1c), we observed a novel association of the -α3.7 deletion with higher HbA1c levels. Co-inheritance of each additional copy of the -α3.7 deletion significantly lowered the risk of anemia and chronic kidney disease among individuals with SCT (P-interaction = 0.031 and 0.019, respectively). Furthermore, co-inheritance of a novel α-globin regulatory variant was associated with normalization of red cell parameters in individuals with the -α3.7 deletion and significantly negated the protective effect of α-thalassemia on stroke in 1,139 patients with sickle cell anemia from the Cooperative Study of Sickle Cell Disease (CSSCD) (P-interaction = 0.0049). Functional assays determined that rs11865131, located in the major alpha-globin enhancer MCS-R2, was the most likely causal variant. These findings suggest that common α- and ß-globin variants interact to influence hematologic and clinical phenotypes in African Americans, with potential implications for risk-stratification and counseling of individuals with SCD and SCT.
Assuntos
Anemia Falciforme/genética , Hemoglobina Falciforme/genética , Traço Falciforme , alfa-Globinas/genética , Adulto , Negro ou Afro-Americano , Anemia Falciforme/sangue , Anemia Falciforme/fisiopatologia , Estudos de Coortes , Variações do Número de Cópias de DNA , Eritrócitos Anormais , Taxa de Filtração Glomerular , Hemoglobinas Glicadas/metabolismo , Humanos , Fenótipo , Adulto Jovem , Talassemia alfa/genéticaRESUMO
[This corrects the article DOI: 10.1371/journal.pgen.1006728.].
RESUMO
Novel proteomics platforms, such as the aptamer-based SOMAscan platform, can quantify large numbers of proteins efficiently and cost-effectively and are rapidly growing in popularity. However, comparisons to conventional immunoassays remain underexplored, leaving investigators unsure when cross-assay comparisons are appropriate. The correlation of results from immunoassays with relative protein quantification is explored by SOMAscan. For 63 proteins assessed in two chronic obstructive pulmonary disease (COPD) cohorts, subpopulations and intermediate outcome measures in COPD Study (SPIROMICS), and COPDGene, using myriad rules based medicine multiplex immunoassays and SOMAscan, Spearman correlation coefficients range from -0.13 to 0.97, with a median correlation coefficient of ≈0.5 and consistent results across cohorts. A similar range is observed for immunoassays in the population-based Multi-Ethnic Study of Atherosclerosis and for other assays in COPDGene and SPIROMICS. Comparisons of relative quantification from the antibody-based Olink platform and SOMAscan in a small cohort of myocardial infarction patients also show a wide correlation range. Finally, cis pQTL data, mass spectrometry aptamer confirmation, and other publicly available data are integrated to assess relationships with observed correlations. Correlation between proteomics assays shows a wide range and should be carefully considered when comparing and meta-analyzing proteomics data across assays and studies.
Assuntos
Infarto do Miocárdio/metabolismo , Proteoma/metabolismo , Proteômica/métodos , Doença Pulmonar Obstrutiva Crônica/metabolismo , Fumantes/estatística & dados numéricos , Adulto , Idoso , Idoso de 80 Anos ou mais , Estudos de Coortes , Feminino , Humanos , Imunoensaio/métodos , Masculino , Pessoa de Meia-Idade , Infarto do Miocárdio/sangue , Doença Pulmonar Obstrutiva Crônica/sangueRESUMO
BACKGROUND: Factor VIII (FVIII) and its carrier protein von Willebrand factor (VWF) are associated with risk of arterial and venous thrombosis and with hemorrhagic disorders. We aimed to identify and functionally test novel genetic associations regulating plasma FVIII and VWF. METHODS: We meta-analyzed genome-wide association results from 46 354 individuals of European, African, East Asian, and Hispanic ancestry. All studies performed linear regression analysis using an additive genetic model and associated ≈35 million imputed variants with natural log-transformed phenotype levels. In vitro gene silencing in cultured endothelial cells was performed for candidate genes to provide additional evidence on association and function. Two-sample Mendelian randomization analyses were applied to test the causal role of FVIII and VWF plasma levels on the risk of arterial and venous thrombotic events. RESULTS: We identified 13 novel genome-wide significant ( P≤2.5×10-8) associations, 7 with FVIII levels ( FCHO2/TMEM171/TNPO1, HLA, SOX17/RP1, LINC00583/NFIB, RAB5C-KAT2A, RPL3/TAB1/SYNGR1, and ARSA) and 11 with VWF levels ( PDHB/PXK/KCTD6, SLC39A8, FCHO2/TMEM171/TNPO1, HLA, GIMAP7/GIMAP4, OR13C5/NIPSNAP, DAB2IP, C2CD4B, RAB5C-KAT2A, TAB1/SYNGR1, and ARSA), beyond 10 previously reported associations with these phenotypes. Functional validation provided further evidence of association for all loci on VWF except ARSA and DAB2IP. Mendelian randomization suggested causal effects of plasma FVIII activity levels on venous thrombosis and coronary artery disease risk and plasma VWF levels on ischemic stroke risk. CONCLUSIONS: The meta-analysis identified 13 novel genetic loci regulating FVIII and VWF plasma levels, 10 of which we validated functionally. We provide some evidence for a causal role of these proteins in thrombotic events.
Assuntos
Arteriopatias Oclusivas/genética , Transtornos Herdados da Coagulação Sanguínea/genética , Coagulação Sanguínea/genética , Fator VIII/análise , Loci Gênicos , Trombose Venosa/genética , Fator de von Willebrand/análise , Arteriopatias Oclusivas/sangue , Arteriopatias Oclusivas/etnologia , Biomarcadores/sangue , Transtornos Herdados da Coagulação Sanguínea/sangue , Transtornos Herdados da Coagulação Sanguínea/etnologia , Marcadores Genéticos , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Análise da Randomização Mendeliana , Fenótipo , Proteína Ribossômica L3 , Fatores de Risco , Trombose Venosa/sangue , Trombose Venosa/etnologiaRESUMO
BACKGROUND: Quantitative red blood cell (RBC) traits are highly polygenic clinically relevant traits, with approximately 500 reported GWAS loci. The majority of RBC trait GWAS have been performed in European- or East Asian-ancestry populations, despite evidence that rare or ancestry-specific variation contributes substantially to RBC trait heritability. Recently developed combined-phenotype methods which leverage genetic trait correlation to improve statistical power have not yet been applied to these traits. Here we leveraged correlation of seven quantitative RBC traits in performing a combined-phenotype analysis in a multi-ethnic study population. RESULTS: We used the adaptive sum of powered scores (aSPU) test to assess combined-phenotype associations between ~ 21 million SNPs and seven RBC traits in a multi-ethnic population (maximum n = 67,885 participants; 24% African American, 30% Hispanic/Latino, and 43% European American; 76% female). Thirty-nine loci in our multi-ethnic population contained at least one significant association signal (p < 5E-9), with lead SNPs at nine loci significantly associated with three or more RBC traits. A majority of the lead SNPs were common (MAF > 5%) across all ancestral populations. Nineteen additional independent association signals were identified at seven known loci (HFE, KIT, HBS1L/MYB, CITED2/FILNC1, ABO, HBA1/2, and PLIN4/5). For example, the HBA1/2 locus contained 14 conditionally independent association signals, 11 of which were previously unreported and are specific to African and Amerindian ancestries. One variant in this region was common in all ancestries, but exhibited a narrower LD block in African Americans than European Americans or Hispanics/Latinos. GTEx eQTL analysis of all independent lead SNPs yielded 31 significant associations in relevant tissues, over half of which were not at the gene immediately proximal to the lead SNP. CONCLUSION: This work identified seven loci containing multiple independent association signals for RBC traits using a combined-phenotype approach, which may improve discovery in genetically correlated traits. Highly complex genetic architecture at the HBA1/2 locus was only revealed by the inclusion of African Americans and Hispanics/Latinos, underscoring the continued importance of expanding large GWAS to include ancestrally diverse populations.
Assuntos
Negro ou Afro-Americano/genética , Eritrócitos/metabolismo , Estudo de Associação Genômica Ampla/métodos , Hispânico ou Latino/genética , Característica Quantitativa Herdável , População Branca/genética , Feminino , Genética Populacional , Humanos , Masculino , Herança Multifatorial , Fenótipo , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA , Estados Unidos/etnologiaRESUMO
BACKGROUND: CD14 is a membrane glycoprotein primarily expressed by myeloid cells that plays a key role in inflammation. Soluble CD14 (sCD14) levels carry a poor prognosis in chronic heart failure (HF), but whether elevations in sCD14 precede HF is unknown. We tested the hypothesis that sCD14 is associated with HF incidence and its subtypes independent of major inflammatory biomarkers among older adults. METHODS AND RESULTS: We included participants in the Cardiovascular Health Study without preexisting HF and available baseline sCD14. We evaluated the associations of sCD14, high-sensitivity C-reactive protein (hsCRP), interleukin (IL)-6, and white blood cell count (WBC) with incident HF and subtypes using Cox regression. Among 5217 participants, 1878 had incident HF over 13.6 years (609 classifiable as HF with preserved ejection fraction [HFpEF] and 419 as HF with reduced ejection fraction [HFrEF]). After adjusting for clinical and laboratory covariates, sCD14 was significantly associated with incident HF (hazard ratio [HR]: 1.56 per doubling, 95% confidence interval [CI]: 1.29-1.89), an association that was numerically stronger than for hsCRP (HR per doubling: 1.10, 95% CI: 1.06-1.15), IL-6 (HR: 1.18, 95% CI: 1.10-1.25), and WBC (HR: 1.24, 95% CI: 1.09-1.42), and that remained significant after adjustment for the other markers of inflammation. This association for sCD14 was observed with HFpEF (HR: 1.50, 95% CI: 1.07-2.10) but not HFrEF (HR: 0.99, 95% CI: 0.67-1.49). CONCLUSIONS: Plasma sCD14 was associated with incident HF independently and numerically more strongly than other major inflammatory markers. This association was only observed with HFpEF in the subset with classifiable HF subtypes. Pending replication, these findings have potentially important therapeutic implications.
Assuntos
Insuficiência Cardíaca , Receptores de Lipopolissacarídeos , Idoso , Biomarcadores , Insuficiência Cardíaca/diagnóstico , Insuficiência Cardíaca/epidemiologia , Humanos , Incidência , Prognóstico , Fatores de Risco , Volume SistólicoRESUMO
Prior GWAS have identified loci associated with red blood cell (RBC) traits in populations of European, African, and Asian ancestry. These studies have not included individuals with an Amerindian ancestral background, such as Hispanics/Latinos, nor evaluated the full spectrum of genomic variation beyond single nucleotide variants. Using a custom genotyping array enriched for Amerindian ancestral content and 1000 Genomes imputation, we performed GWAS in 12,502 participants of Hispanic Community Health Study and Study of Latinos (HCHS/SOL) for hematocrit, hemoglobin, RBC count, RBC distribution width (RDW), and RBC indices. Approximately 60% of previously reported RBC trait loci generalized to HCHS/SOL Hispanics/Latinos, including African ancestral alpha- and beta-globin gene variants. In addition to the known 3.8kb alpha-globin copy number variant, we identified an Amerindian ancestral association in an alpha-globin regulatory region on chromosome 16p13.3 for mean corpuscular volume and mean corpuscular hemoglobin. We also discovered and replicated three genome-wide significant variants in previously unreported loci for RDW (SLC12A2 rs17764730, PSMB5 rs941718), and hematocrit (PROX1 rs3754140). Among the proxy variants at the SLC12A2 locus we identified rs3812049, located in a bi-directional promoter between SLC12A2 (which encodes a red cell membrane ion-transport protein) and an upstream anti-sense long-noncoding RNA, LINC01184, as the likely causal variant. We further demonstrate that disruption of the regulatory element harboring rs3812049 affects transcription of SLC12A2 and LINC01184 in human erythroid progenitor cells. Together, these results reinforce the importance of genetic study of diverse ancestral populations, in particular Hispanics/Latinos.
Assuntos
Proteínas de Homeodomínio/genética , Complexo de Endopeptidases do Proteassoma/genética , RNA Longo não Codificante/genética , Membro 2 da Família 12 de Carreador de Soluto/genética , Proteínas Supressoras de Tumor/genética , alfa-Globinas/genética , Contagem de Eritrócitos , Eritrócitos , Feminino , Estudo de Associação Genômica Ampla , Hemoglobinas/genética , Hispânico ou Latino/genética , Humanos , Masculino , Polimorfismo de Nucleotídeo Único , Globinas beta/genéticaRESUMO
Hypertension is a leading cause of global disease, mortality, and disability. While individuals of African descent suffer a disproportionate burden of hypertension and its complications, they have been underrepresented in genetic studies. To identify novel susceptibility loci for blood pressure and hypertension in people of African ancestry, we performed both single and multiple-trait genome-wide association analyses. We analyzed 21 genome-wide association studies comprised of 31,968 individuals of African ancestry, and validated our results with additional 54,395 individuals from multi-ethnic studies. These analyses identified nine loci with eleven independent variants which reached genome-wide significance (P < 1.25×10-8) for either systolic and diastolic blood pressure, hypertension, or for combined traits. Single-trait analyses identified two loci (TARID/TCF21 and LLPH/TMBIM4) and multiple-trait analyses identified one novel locus (FRMD3) for blood pressure. At these three loci, as well as at GRP20/CDH17, associated variants had alleles common only in African-ancestry populations. Functional annotation showed enrichment for genes expressed in immune and kidney cells, as well as in heart and vascular cells/tissues. Experiments driven by these findings and using angiotensin-II induced hypertension in mice showed altered kidney mRNA expression of six genes, suggesting their potential role in hypertension. Our study provides new evidence for genes related to hypertension susceptibility, and the need to study African-ancestry populations in order to identify biologic factors contributing to hypertension.