ABSTRACT
Clonal hematopoiesis of indeterminate potential (CHIP) arises from aging-associated acquired mutations in hematopoietic progenitors, which display clonal expansion and produce phenotypically altered leukocytes. We associated CHIP-DNMT3A mutations with a higher prevalence of periodontitis and gingival inflammation among 4,946 community-dwelling adults. To model DNMT3A-driven CHIP, we used mice with the heterozygous loss-of-function mutation R878H, equivalent to the human hotspot mutation R882H. Partial transplantation with Dnmt3aR878H/+ bone marrow (BM) cells resulted in clonal expansion of mutant cells into both myeloid and lymphoid lineages and an elevated abundance of osteoclast precursors in the BM and osteoclastogenic macrophages in the periphery. DNMT3A-driven clonal hematopoiesis in recipient mice promoted naturally occurring periodontitis and aggravated experimentally induced periodontitis and arthritis, associated with enhanced osteoclastogenesis, IL-17-dependent inflammation and neutrophil responses, and impaired regulatory T cell immunosuppressive activity. DNMT3A-driven clonal hematopoiesis and, subsequently, periodontitis were suppressed by rapamycin treatment. DNMT3A-driven CHIP represents a treatable state of maladaptive hematopoiesis promoting inflammatory bone loss.
Subject(s)
Clonal Hematopoiesis , DNA (Cytosine-5-)-Methyltransferases , DNA Methyltransferase 3A , Periodontitis , Animals , DNA (Cytosine-5-)-Methyltransferases/metabolism , DNA (Cytosine-5-)-Methyltransferases/genetics , Mice , Clonal Hematopoiesis/genetics , Humans , Periodontitis/genetics , Periodontitis/pathology , Mutation , Male , Female , Inflammation/genetics , Inflammation/pathology , Osteoclasts/metabolism , Mice, Inbred C57BL , Adult , Interleukin-17/metabolism , Interleukin-17/genetics , T-Lymphocytes, Regulatory/immunology , T-Lymphocytes, Regulatory/metabolism , Hematopoiesis/genetics , Osteogenesis/genetics , Hematopoietic Stem Cells/metabolism , Bone Resorption/genetics , Bone Resorption/pathology , Middle AgedABSTRACT
Obesity and poverty disproportionally affect African American persons. Epigenetic mechanisms could partially explain the association between socioeconomic disadvantage and body mass index (BMI). We examined the extent to which epigenetic mechanisms mediate the effect of socioeconomic status (SES) on BMI. Using data from African American adults from the Atherosclerosis Risk in Communities (ARIC) Study (n = 2664, mean age = 57 years), education, income, and occupation were used to create a composite SES score at visit 1 (1987-1989). We conducted two methylation-wide association analyses to identify associations between SES (visit 1), BMI and cytosine-phosphate-guanine (CpG) sites measured at a subsequent visit (1990-1995). We then utilized structural equation modeling (SEM) to test whether identified sites mediated the association between earlier SES and BMI in sex-stratified models adjusted for demographic and risk factor covariates. Independent replication and meta-analyses were conducted using the Jackson Heart Study (JHS, n = 874, mean age 51 years, 2000-2004). Three CpG sites near MAD1L1, KDM2B, and SOCS3 (cg05095590, cg1370865, and cg18181703) were suggestively associated (P-value < 1.3×10-5) in ARIC and at array-wide significance (P-value < 1.3×10-7) in a combined meta-analysis of ARIC with JHS. SEM of these three sites revealed significant indirect effects in females (P-value < 5.8×10-3), each mediating 7%-20% of the total effect of SES on BMI. Nominally significant indirect effects were observed for two sites near MAD1L1 and KDM2B in males (P-value < 3.4×10-2), mediating -17 and -22% of the SES-BMI effect. These results provide further evidence that epigenetic modifications may be a potential pathway through which SES may "get under the skin" and contribute to downstream health disparities.
Subject(s)
Black or African American , Body Mass Index , CpG Islands , DNA Methylation , Jumonji Domain-Containing Histone Demethylases , Nuclear Proteins , Social Class , Suppressor of Cytokine Signaling 3 Protein , Humans , Female , Male , Black or African American/genetics , Jumonji Domain-Containing Histone Demethylases/genetics , Jumonji Domain-Containing Histone Demethylases/metabolism , Middle Aged , Suppressor of Cytokine Signaling 3 Protein/genetics , Suppressor of Cytokine Signaling 3 Protein/metabolism , Nuclear Proteins/genetics , Nuclear Proteins/metabolism , CpG Islands/genetics , F-Box Proteins/genetics , F-Box Proteins/metabolism , Epigenesis, Genetic , Obesity/genetics , Adult , Aged , Risk Factors , Genome-Wide Association Study , Cell Cycle ProteinsABSTRACT
Inflammation biomarkers can provide valuable insight into the role of inflammatory processes in many diseases and conditions. Sequencing based analyses of such biomarkers can also serve as an exemplar of the genetic architecture of quantitative traits. To evaluate the biological insight, which can be provided by a multi-ancestry, whole-genome based association study, we performed a comprehensive analysis of 21 inflammation biomarkers from up to 38 465 individuals with whole-genome sequencing from the Trans-Omics for Precision Medicine (TOPMed) program (with varying sample size by trait, where the minimum sample size was n = 737 for MMP-1). We identified 22 distinct single-variant associations across 6 traits-E-selectin, intercellular adhesion molecule 1, interleukin-6, lipoprotein-associated phospholipase A2 activity and mass, and P-selectin-that remained significant after conditioning on previously identified associations for these inflammatory biomarkers. We further expanded upon known biomarker associations by pairing the single-variant analysis with a rare variant set-based analysis that further identified 19 significant rare variant set-based associations with 5 traits. These signals were distinct from both significant single variant association signals within TOPMed and genetic signals observed in prior studies, demonstrating the complementary value of performing both single and rare variant analyses when analyzing quantitative traits. We also confirm several previously reported signals from semi-quantitative proteomics platforms. Many of these signals demonstrate the extensive allelic heterogeneity and ancestry-differentiated variant-trait associations common for inflammation biomarkers, a characteristic we hypothesize will be increasingly observed with well-powered, large-scale analyses of complex traits.
Subject(s)
Biomarkers , Genome-Wide Association Study , Inflammation , Precision Medicine , Whole Genome Sequencing , Humans , Precision Medicine/methods , Inflammation/genetics , Genome-Wide Association Study/methods , Whole Genome Sequencing/methods , Polymorphism, Single Nucleotide , Quantitative Trait Loci , Genetic Predisposition to Disease , Female , Interleukin-6/geneticsABSTRACT
The heritability explained by local ancestry markers in an admixed population (hγ2) provides crucial insight into the genetic architecture of a complex disease or trait. Estimation of hγ2 can be susceptible to biases due to population structure in ancestral populations. Here, we present heritability estimation from admixture mapping summary statistics (HAMSTA), an approach that uses summary statistics from admixture mapping to infer heritability explained by local ancestry while adjusting for biases due to ancestral stratification. Through extensive simulations, we demonstrate that HAMSTA hγ2 estimates are approximately unbiased and are robust to ancestral stratification compared to existing approaches. In the presence of ancestral stratification, we show a HAMSTA-derived sampling scheme provides a calibrated family-wise error rate (FWER) of â¼5% for admixture mapping, unlike existing FWER estimation approaches. We apply HAMSTA to 20 quantitative phenotypes of up to 15,988 self-reported African American individuals in the Population Architecture using Genomics and Epidemiology (PAGE) study. We observe hËγ2 in the 20 phenotypes range from 0.0025 to 0.033 (mean hËγ2 = 0.012 ± 9.2 × 10-4), which translates to hË2 ranging from 0.062 to 0.85 (mean hË2 = 0.30 ± 0.023). Across these phenotypes we find little evidence of inflation due to ancestral population stratification in current admixture mapping studies (mean inflation factor of 0.99 ± 0.001). Overall, HAMSTA provides a fast and powerful approach to estimate genome-wide heritability and evaluate biases in test statistics of admixture mapping studies.
Subject(s)
Black or African American , Genetics, Population , Humans , Chromosome Mapping , Phenotype , Polymorphism, Single Nucleotide/geneticsABSTRACT
A growing body of work has addressed human adaptations to diverse environments using genomic data, but few studies have connected putatively selected alleles to phenotypes, much less among underrepresented populations such as Amerindians. Studies of natural selection and genotype-phenotype relationships in underrepresented populations hold potential to uncover previously undescribed loci underlying evolutionarily and biomedically relevant traits. Here, we worked with the Tsimane and the Moseten, two Amerindian populations inhabiting the Bolivian lowlands. We focused most intensively on the Tsimane, because long-term anthropological work with this group has shown that they have a high burden of both macro and microparasites, as well as minimal cardiometabolic disease or dementia. We therefore generated genome-wide genotype data for Tsimane individuals to study natural selection, and paired this with blood mRNA-seq as well as cardiometabolic and immune biomarker data generated from a larger sample that included both populations. In the Tsimane, we identified 21 regions that are candidates for selective sweeps, as well as 5 immune traits that show evidence for polygenic selection (e.g., C-reactive protein levels and the response to coronaviruses). Genes overlapping candidate regions were strongly enriched for known involvement in immune-related traits, such as abundance of lymphocytes and eosinophils. Importantly, we were also able to draw on extensive phenotype information for the Tsimane and Moseten and link five regions (containing PSD4, MUC21 and MUC22, TOX2, ANXA6, and ABCA1) with biomarkers of immune and metabolic function. Together, our work highlights the utility of pairing evolutionary analyses with anthropological and biomedical data to gain insight into the genetic basis of health-related traits.
Subject(s)
Genetics, Population , Health Status , Humans , Biomarkers , Bolivia , Genomics , Genotype , Phenotype , Polymorphism, Single Nucleotide , Selection, Genetic , Genome, HumanABSTRACT
Diabetic kidney disease (DKD) is recognized as an important public health challenge. However, its genomic mechanisms are poorly understood. To identify rare variants for DKD, we conducted a whole-exome sequencing (WES) study leveraging large cohorts well-phenotyped for chronic kidney disease and diabetes. Our two-stage WES study included 4372 European and African ancestry participants from the Chronic Renal Insufficiency Cohort and Atherosclerosis Risk in Communities studies (stage 1) and 11 487 multi-ancestry Trans-Omics for Precision Medicine participants (stage 2). Generalized linear mixed models, which accounted for genetic relatedness and adjusted for age, sex and ancestry, were used to test associations between single variants and DKD. Gene-based aggregate rare variant analyses were conducted using an optimized sequence kernel association test implemented within our mixed model framework. We identified four novel exome-wide significant DKD-related loci through initiating diabetes. In single-variant analyses, participants carrying a rare, in-frame insertion in the DIS3L2 gene (rs141560952) exhibited a 193-fold increased odds [95% confidence interval (CI): 33.6, 1105] of DKD compared with noncarriers (P = 3.59 × 10-9). Likewise, each copy of a low-frequency KRT6B splice-site variant (rs425827) conferred a 5.31-fold higher odds (95% CI: 3.06, 9.21) of DKD (P = 2.72 × 10-9). Aggregate gene-based analyses further identified ERAP2 (P = 4.03 × 10-8) and NPEPPS (P = 1.51 × 10-7), which are both expressed in the kidney and implicated in renin-angiotensin-aldosterone system modulated immune response. In the largest WES study of DKD, we identified novel rare variant loci attaining exome-wide significance. These findings provide new insights into the molecular mechanisms underlying DKD.
Subject(s)
Diabetes Mellitus , Diabetic Nephropathies , Renal Insufficiency, Chronic , Humans , Aminopeptidases , Diabetic Nephropathies/genetics , Exome Sequencing , Kidney , Renal Insufficiency, Chronic/geneticsABSTRACT
For the genomics community, allele frequencies within defined groups (or "strata") are useful across multiple research and clinical contexts. Benefits include allowing researchers to identify populations for replication or "look up" studies, enabling researchers to compare population-specific frequencies to validate findings, and facilitating assessment of variant pathogenicity in clinical contexts. However, there are potential concerns with stratified allele frequencies. These include potential re-identification (determining whether or not an individual participated in a given research study based on allele frequencies and individual-level genetic data), harm from associating stigmatizing variants with specific groups, potential reification of race as a biological rather than a socio-political category, and whether presenting stratified frequencies-and the downstream applications that this presentation enables-is consistent with participants' informed consents. The NHLBI Trans-Omics for Precision Medicine (TOPMed) program considered the scientific and social implications of different approaches for adding stratified frequencies to the TOPMed BRAVO (Browse All Variants Online) variant server. We recommend a novel approach of presenting ancestry-specific allele frequencies using a statistical method based upon local genetic ancestry inference. Notably, this approach does not require grouping individuals by either predominant global ancestry or race/ethnicity and, therefore, mitigates re-identification and other concerns as the mixture distribution of ancestral allele frequencies varies across the genome. Here we describe our considerations and approach, which can assist other genomics research programs facing similar issues of how to define and present stratified frequencies in publicly available variant databases.
Subject(s)
Motivation , Precision Medicine , Ethnicity/genetics , Gene Frequency/genetics , Genomics/methods , HumansABSTRACT
Since 2005, genome-wide association (GWA) datasets have been largely biased toward sampling European ancestry individuals, and recent studies have shown that GWA results estimated from self-identified European individuals are not transferable to non-European individuals because of various confounding challenges. Here, we demonstrate that enrichment analyses that aggregate SNP-level association statistics at multiple genomic scales-from genes to genomic regions and pathways-have been underutilized in the GWA era and can generate biologically interpretable hypotheses regarding the genetic basis of complex trait architecture. We illustrate examples of the robust associations generated by enrichment analyses while studying 25 continuous traits assayed in 566,786 individuals from seven diverse self-identified human ancestries in the UK Biobank and the Biobank Japan as well as 44,348 admixed individuals from the PAGE consortium including cohorts of African American, Hispanic and Latin American, Native Hawaiian, and American Indian/Alaska Native individuals. We identify 1,000 gene-level associations that are genome-wide significant in at least two ancestry cohorts across these 25 traits as well as highly conserved pathway associations with triglyceride levels in European, East Asian, and Native Hawaiian cohorts.
Subject(s)
Genome-Wide Association Study , Polymorphism, Single Nucleotide , Genome-Wide Association Study/methods , Humans , Multifactorial Inheritance , Phenotype , Polymorphism, Single Nucleotide/genetics , Racial GroupsABSTRACT
One mechanism by which genetic factors influence complex traits and diseases is altering gene expression. Direct measurement of gene expression in relevant tissues is rarely tenable; however, genetically regulated gene expression (GReX) can be estimated using prediction models derived from large multi-omic datasets. These approaches have led to the discovery of many gene-trait associations, but whether models derived from predominantly European ancestry (EA) reference panels can map novel associations in ancestrally diverse populations remains unclear. We applied PrediXcan to impute GReX in 51,520 ancestrally diverse Population Architecture using Genomics and Epidemiology (PAGE) participants (35% African American, 45% Hispanic/Latino, 10% Asian, and 7% Hawaiian) across 25 key cardiometabolic traits and relevant tissues to identify 102 novel associations. We then compared associations in PAGE to those in a random subset of 50,000 White British participants from UK Biobank (UKBB50k) for height and body mass index (BMI). We identified 517 associations across 47 tissues in PAGE but not UKBB50k, demonstrating the importance of diverse samples in identifying trait-associated GReX. We observed that variants used in PrediXcan models were either more or less differentiated across continental-level populations than matched-control variants depending on the specific population reflecting sampling bias. Additionally, variants from identified genes specific to either PAGE or UKBB50k analyses were more ancestrally differentiated than those in genes detected in both analyses, underlining the value of population-specific discoveries. This suggests that while EA-derived transcriptome imputation models can identify new associations in non-EA populations, models derived from closely matched reference panels may yield further insights. Our findings call for more diversity in reference datasets of tissue-specific gene expression.
Subject(s)
Cardiovascular Diseases , Genome-Wide Association Study , Genetic Predisposition to Disease , Humans , Life Style , Polymorphism, Single Nucleotide , TranscriptomeABSTRACT
BACKGROUND: Height has been associated with many clinical traits but whether such associations are causal versus secondary to confounding remains unclear in many cases. To systematically examine this question, we performed a Mendelian Randomization-Phenome-wide association study (MR-PheWAS) using clinical and genetic data from a national healthcare system biobank. METHODS AND FINDINGS: Analyses were performed using data from the US Veterans Affairs (VA) Million Veteran Program in non-Hispanic White (EA, n = 222,300) and non-Hispanic Black (AA, n = 58,151) adults in the US. We estimated height genetic risk based on 3290 height-associated variants from a recent European-ancestry genome-wide meta-analysis. We compared associations of measured and genetically-predicted height with phenome-wide traits derived from the VA electronic health record, adjusting for age, sex, and genetic principal components. We found 345 clinical traits associated with measured height in EA and an additional 17 in AA. Of these, 127 were associated with genetically-predicted height at phenome-wide significance in EA and 2 in AA. These associations were largely independent from body mass index. We confirmed several previously described MR associations between height and cardiovascular disease traits such as hypertension, hyperlipidemia, coronary heart disease (CHD), and atrial fibrillation, and further uncovered MR associations with venous circulatory disorders and peripheral neuropathy in the presence and absence of diabetes. As a number of traits associated with genetically-predicted height frequently co-occur with CHD, we evaluated effect modification by CHD status of genetically-predicted height associations with risk factors for and complications of CHD. We found modification of effects of MR associations by CHD status for atrial fibrillation/flutter but not for hypertension, hyperlipidemia, or venous circulatory disorders. CONCLUSIONS: We conclude that height may be an unrecognized but biologically plausible risk factor for several common conditions in adults. However, more studies are needed to reliably exclude horizontal pleiotropy as a driving force behind at least some of the MR associations observed in this study.
Subject(s)
Atrial Fibrillation , Hypertension , Veterans , Adult , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Hypertension/epidemiology , Hypertension/genetics , Polymorphism, Single Nucleotide/geneticsABSTRACT
OBJECTIVE: Colonic diverticulosis is a prevalent condition among older adults, marked by the presence of thin-walled pockets in the colon wall that can become inflamed, infected, haemorrhage or rupture. We present a case-control genetic and transcriptomic study aimed at identifying the genetic and cellular determinants underlying this condition and the relationship with other gastrointestinal disorders. DESIGN: We conducted DNA and RNA sequencing on colonic tissue from 404 patients with (N=172) and without (N=232) diverticulosis. We investigated variation in the transcriptome associated with diverticulosis and further integrated this variation with single-cell RNA-seq data from the human intestine. We also integrated our expression quantitative trait loci with genome-wide association study using Mendelian randomisation (MR). Furthermore, a Polygenic Risk Score analysis gauged associations between diverticulosis severity and other gastrointestinal disorders. RESULTS: We discerned 38 genes with differential expression and 17 with varied transcript usage linked to diverticulosis, indicating tissue remodelling as a primary diverticula formation mechanism. Diverticula formation was primarily linked to stromal and epithelial cells in the colon including endothelial cells, myofibroblasts, fibroblasts, goblet, tuft, enterocytes, neurons and glia. MR highlighted five genes including CCN3, CRISPLD2, ENTPD7, PHGR1 and TNFSF13, with potential causal effects on diverticulosis. Notably, ENTPD7 upregulation was confirmed in diverticulosis cases. Additionally, diverticulosis severity was positively correlated with genetic predisposition to diverticulitis. CONCLUSION: Our results suggest that tissue remodelling is a primary mechanism for diverticula formation. Individuals with an increased genetic proclivity to diverticulitis exhibit a larger numbers of diverticula on colonoscopy.
Subject(s)
Diverticulosis, Colonic , Genome-Wide Association Study , Transcriptome , Humans , Diverticulosis, Colonic/genetics , Male , Female , Aged , Case-Control Studies , Middle Aged , Quantitative Trait Loci , Mendelian Randomization Analysis , Genetic Predisposition to DiseaseABSTRACT
AIMS/HYPOTHESIS: Several studies have reported associations between specific proteins and type 2 diabetes risk in European populations. To better understand the role played by proteins in type 2 diabetes aetiology across diverse populations, we conducted a large proteome-wide association study using genetic instruments across four racial and ethnic groups: African; Asian; Hispanic/Latino; and European. METHODS: Genome and plasma proteome data from the Multi-Ethnic Study of Atherosclerosis (MESA) study involving 182 African, 69 Asian, 284 Hispanic/Latino and 409 European individuals residing in the USA were used to establish protein prediction models by using potentially associated cis- and trans-SNPs. The models were applied to genome-wide association study summary statistics of 250,127 type 2 diabetes cases and 1,222,941 controls from different racial and ethnic populations. RESULTS: We identified three, 44 and one protein associated with type 2 diabetes risk in Asian, European and Hispanic/Latino populations, respectively. Meta-analysis identified 40 proteins associated with type 2 diabetes risk across the populations, including well-established as well as novel proteins not yet implicated in type 2 diabetes development. CONCLUSIONS/INTERPRETATION: Our study improves our understanding of the aetiology of type 2 diabetes in diverse populations. DATA AVAILABILITY: The summary statistics of multi-ethnic type 2 diabetes GWAS of MVP, DIAMANTE, Biobank Japan and other studies are available from The database of Genotypes and Phenotypes (dbGaP) under accession number phs001672.v3.p1. MESA genetic, proteome and covariate data can be accessed through dbGaP under phs000209.v13.p3. All code is available on GitHub ( https://github.com/Arthur1021/MESA-1K-PWAS ).
ABSTRACT
CpG site methylation patterns have potential to improve differentiation of high-grade screening-detected cervical abnormalities. We assessed CpG differential methylation (DM) and differential variability (DV) in high-grade (CIN2+) vs. low-grade (≤CIN1) lesions. In ≤CIN1 (n=117) and CIN2+ (n=31) samples, cervical sample DNA underwent testing with Illumina HumanMethylation arrays. We assessed DM and DV of CpG methylation M values among nine cervical cancer-associated genes. We fit CpG-specific linear models and estimated empirical Bayes standard errors and false discovery rates (FDR). An exploratory epigenome-wide association study (EWAS) aimed to detect novel DM and DV CpGs (FDR<0.05) and Gene Ontology (GO) term enrichment. Compared to ≤CIN1, CIN2+ exhibited greater methylation at CCNA1 Cluster 1 (M value difference 0.24; 95% CI 0.04, 0.43) and RARB Cluster 2 (0.16; 95% CI 0.05, 0.28), and lower methylation at CDH1 Cluster 1 (-0.15; 95% CI -0.26, -0.04). CIN2+ exhibited lower variability at CDH1 Cluster 2 (variation difference -0.24; 95% CI -0.41, -0.05) and FHIT Cluster 1 (-0.30; 95% CI -0.50, -0.09). EWAS detected 3,534 DM and 270 DV CpGs. Forty-four GO terms were enriched with DM CpGs related to transcriptional, structural, developmental, and neuronal processes. Methylation patterns may help triage screening-detected cervical abnormalities and inform US screening algorithms.
ABSTRACT
Type 2 diabetes is a complex, systemic disease affected by both genetic and environmental factors. Previous research has identified genetic variants associated with type 2 diabetes risk; however, gene regulatory changes underlying progression to metabolic dysfunction are still largely unknown. We investigated RNA expression changes that occur during diabetes progression using a two-stage approach. In our discovery stage, we compared changes in gene expression using two longitudinally collected blood samples from subjects whose fasting blood glucose transitioned to a level consistent with type 2 diabetes diagnosis between the time points against those who did not with a novel analytical network approach. Our network methodology identified 17 networks, one of which was significantly associated with transition status. This 822-gene network harbors many genes novel to the type 2 diabetes literature but is also significantly enriched for genes previously associated with type 2 diabetes. In the validation stage, we queried associations of genetically determined expression with diabetes-related traits in a large biobank with linked electronic health records. We observed a significant enrichment of genes in our identified network whose genetically determined expression is associated with type 2 diabetes and other metabolic traits and validated 31 genes that are not near previously reported type 2 diabetes loci. Finally, we provide additional functional support, which suggests that the genes in this network are regulated by enhancers that operate in human pancreatic islet cells. We present an innovative and systematic approach that identified and validated key gene expression changes associated with type 2 diabetes transition status and demonstrated their translational relevance in a large clinical resource.
Subject(s)
Diabetes Mellitus, Type 2 , Blood Glucose/genetics , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/metabolism , Gene Expression , Gene Expression Profiling , Gene Regulatory Networks/genetics , Genetic Association Studies , Humans , RNAABSTRACT
Platelets play a key role in thrombosis and hemostasis. Platelet count (PLT) and mean platelet volume (MPV) are highly heritable quantitative traits, with hundreds of genetic signals previously identified, mostly in European ancestry populations. We here utilize whole genome sequencing (WGS) from NHLBI's Trans-Omics for Precision Medicine initiative (TOPMed) in a large multi-ethnic sample to further explore common and rare variation contributing to PLT (n = 61 200) and MPV (n = 23 485). We identified and replicated secondary signals at MPL (rs532784633) and PECAM1 (rs73345162), both more common in African ancestry populations. We also observed rare variation in Mendelian platelet-related disorder genes influencing variation in platelet traits in TOPMed cohorts (not enriched for blood disorders). For example, association of GP9 with lower PLT and higher MPV was partly driven by a pathogenic Bernard-Soulier syndrome variant (rs5030764, p.Asn61Ser), and the signals at TUBB1 and CD36 were partly driven by loss of function variants not annotated as pathogenic in ClinVar (rs199948010 and rs571975065). However, residual signal remained for these gene-based signals after adjusting for lead variants, suggesting that additional variants in Mendelian genes with impacts in general population cohorts remain to be identified. Gene-based signals were also identified at several genome-wide association study identified loci for genes not annotated for Mendelian platelet disorders (PTPRH, TET2, CHEK2), with somatic variation driving the result at TET2. These results highlight the value of WGS in populations of diverse genetic ancestry to identify novel regulatory and coding signals, even for well-studied traits like platelet traits.
Subject(s)
Genome-Wide Association Study , Precision Medicine , Blood Platelets , Humans , National Heart, Lung, and Blood Institute (U.S.) , Phenotype , Polymorphism, Single Nucleotide , Precision Medicine/methods , United StatesABSTRACT
Given the coronavirus disease 2019 (COVID-19) pandemic, investigations into host susceptibility to infectious diseases and downstream sequelae have never been more relevant. Pneumonia is a lung disease that can cause respiratory failure and hypoxia and is a common complication of infectious diseases, including COVID-19. Few genome-wide association studies (GWASs) of host susceptibility and severity of pneumonia have been conducted. We performed GWASs of pneumonia susceptibility and severity in the Vanderbilt University biobank (BioVU) with linked electronic health records (EHRs), including Illumina Expanded Multi-Ethnic Global Array (MEGAEX)-genotyped European ancestry (EA, n= 69,819) and African ancestry (AA, n = 15,603) individuals. Two regions of large effect were identified: the CFTR locus in EA (rs113827944; OR = 1.84, p value = 1.2 × 10-36) and HBB in AA (rs334 [p.Glu7Val]; OR = 1.63, p value = 3.5 × 10-13). Mutations in these genes cause cystic fibrosis (CF) and sickle cell disease (SCD), respectively. After removing individuals diagnosed with CF and SCD, we assessed heterozygosity effects at our lead variants. Further GWASs after removing individuals with CF uncovered an additional association in R3HCC1L (rs10786398; OR = 1.22, p value = 3.5 × 10-8), which was replicated in two independent datasets: UK Biobank (n = 459,741) and 7,985 non-overlapping BioVU subjects, who are genotyped on arrays other than MEGAEX. This variant was also validated in GWASs of COVID-19 hospitalization and lung function. Our results highlight the importance of the host genome in infectious disease susceptibility and severity and offer crucial insight into genetic effects that could potentially influence severity of COVID-19 sequelae.
Subject(s)
COVID-19/complications , COVID-19/genetics , Host-Pathogen Interactions/genetics , Pneumonia, Viral/complications , Pneumonia, Viral/genetics , Bronchitis/genetics , COVID-19/pathology , COVID-19/physiopathology , Cystic Fibrosis Transmembrane Conductance Regulator/genetics , Databases, Genetic , Electronic Health Records , Female , Genome-Wide Association Study , Genotype , Hemoglobins/genetics , Humans , Inpatients , Linkage Disequilibrium , Male , Outpatients , Pneumonia, Viral/pathology , Pneumonia, Viral/physiopathology , Polymorphism, Single Nucleotide/genetics , Principal Component Analysis , Pulmonary Disease, Chronic Obstructive/genetics , Reproducibility of Results , United KingdomABSTRACT
Whole-genome sequencing (WGS), a powerful tool for detecting novel coding and non-coding disease-causing variants, has largely been applied to clinical diagnosis of inherited disorders. Here we leveraged WGS data in up to 62,653 ethnically diverse participants from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program and assessed statistical association of variants with seven red blood cell (RBC) quantitative traits. We discovered 14 single variant-RBC trait associations at 12 genomic loci, which have not been reported previously. Several of the RBC trait-variant associations (RPN1, ELL2, MIDN, HBB, HBA1, PIEZO1, and G6PD) were replicated in independent GWAS datasets imputed to the TOPMed reference panel. Most of these discovered variants are rare/low frequency, and several are observed disproportionately among non-European Ancestry (African, Hispanic/Latino, or East Asian) populations. We identified a 3 bp indel p.Lys2169del (g.88717175_88717177TCT[4]) (common only in the Ashkenazi Jewish population) of PIEZO1, a gene responsible for the Mendelian red cell disorder hereditary xerocytosis (MIM: 194380), associated with higher mean corpuscular hemoglobin concentration (MCHC). In stepwise conditional analysis and in gene-based rare variant aggregated association analysis, we identified several of the variants in HBB, HBA1, TMPRSS6, and G6PD that represent the carrier state for known coding, promoter, or splice site loss-of-function variants that cause inherited RBC disorders. Finally, we applied base and nuclease editing to demonstrate that the sentinel variant rs112097551 (nearest gene RPN1) acts through a cis-regulatory element that exerts long-range control of the gene RUVBL1 which is essential for hematopoiesis. Together, these results demonstrate the utility of WGS in ethnically diverse population-based samples and gene editing for expanding knowledge of the genetic architecture of quantitative hematologic traits and suggest a continuum between complex trait and Mendelian red cell disorders.
Subject(s)
Erythrocytes/metabolism , Erythrocytes/pathology , Genome-Wide Association Study , National Heart, Lung, and Blood Institute (U.S.)/organization & administration , Phenotype , Adult , Aged , Chromosomes, Human, Pair 16/genetics , Datasets as Topic , Female , Gene Editing , Genetic Variation/genetics , HEK293 Cells , Humans , Male , Middle Aged , Quality Control , Reproducibility of Results , United StatesABSTRACT
Disordered eating (DE) is associated with elevated cardiometabolic risk (CMR) factors, yet little is known about this association in non-Western countries. We examined the association between DE characteristics and CMR and tested the potential mediating role of BMI. This cross-sectional study included 2005 Chinese women (aged 18-50 years) from the 2015 China Health and Nutrition Survey. Loss of control, restraint, shape concern and weight concern were assessed using selected questions from the SCOFF questionnaire and the Eating Disorder Examination-Questionnaire. Eight CMR were measured by trained staff. Generalised linear models examined associations between DE characteristics with CMR accounting for dependencies between individuals in the same household. We tested whether BMI potentially mediated significant associations using structural equation modelling. Shape concern was associated with systolic blood pressure (ß (95 % CI) 0·06 (0·01, 0·10)), diastolic blood pressure (DBP) (0·07 (95 % CI 0·03, 0·11)) and high-density lipoprotein (HDL)-cholesterol (-0·08 (95 % CI -0·12, -0·04)). Weight concern was associated with DBP (0·06 (95 % CI 0·02, 0·10)), triglyceride (0·06 (95 % CI 0·02, 0·10)) and HDL-cholesterol (-0·10 (95 % CI -0·14, -0·07)). Higher scores on DE characteristics were associated with higher BMI, and higher BMI was further associated with lower HDL-cholesterol and higher other CMR. In summary, we observed significant associations between shape and weight concerns with some CMR in Chinese women, and these associations were potentially partially mediated by BMI. Our findings suggest that prevention and intervention strategies focusing on addressing DE could potentially help reduce the burden of CMR in China, possibly through controlling BMI.
ABSTRACT
Rationale: Inflammation contributes to lung function decline and the development of chronic obstructive pulmonary disease. Omega-3 fatty acids have antiinflammatory properties and may benefit lung health. Objectives: To investigate associations of omega-3 fatty acids with lung function decline and incident airway obstruction in a diverse sample of adults from general-population cohorts. Methods: Complementary study designs: 1) longitudinal study of plasma phospholipid omega-3 fatty acids and repeated FEV1 and FVC measures in the NHLBI Pooled Cohorts Study and 2) two-sample Mendelian randomization (MR) study of genetically predicted omega-3 fatty acids and lung function parameters. Measurements and Main Results: The longitudinal study found that higher omega-3 fatty acid levels were associated with attenuated lung function decline in 15,063 participants, with the largest effect sizes for the most metabolically downstream omega-3 fatty acid, docosahexaenoic acid (DHA). An increase in DHA of 1% of total fatty acids was associated with attenuations of 1.4 ml/yr for FEV1 (95% confidence interval [CI], 1.1-1.8) and 2.0 ml/yr for FVC (95% CI, 1.6-2.4) and a 7% lower incidence of spirometry-defined airway obstruction (95% CI, 0.89-0.97). DHA associations persisted across sexes and smoking histories and in Black, White, and Hispanic participants, with associations of the largest magnitude in former smokers and Hispanic participants. The MR study showed similar trends toward positive associations of genetically predicted downstream omega-3 fatty acids with FEV1 and FVC. Conclusions: The longitudinal and MR studies provide evidence supporting beneficial effects of higher levels of downstream omega-3 fatty acids, especially DHA, on lung health.
Subject(s)
Airway Obstruction , Fatty Acids, Omega-3 , Pulmonary Disease, Chronic Obstructive , Adult , Humans , Longitudinal Studies , Lung , Pulmonary Disease, Chronic Obstructive/genetics , Docosahexaenoic AcidsABSTRACT
Hematological measures are important intermediate clinical phenotypes for many acute and chronic diseases and are highly heritable. Although genome-wide association studies (GWAS) have identified thousands of loci containing trait-associated variants, the causal genes underlying these associations are often uncertain. To better understand the underlying genetic regulatory mechanisms, we performed a transcriptome-wide association study (TWAS) to systematically investigate the association between genetically predicted gene expression and hematological measures in 54,542 Europeans from the Genetic Epidemiology Research on Aging cohort. We found 239 significant gene-trait associations with hematological measures; we replicated 71 associations at p < 0.05 in a TWAS meta-analysis consisting of up to 35,900 Europeans from the Women's Health Initiative, Atherosclerosis Risk in Communities Study, and BioMe Biobank. Additionally, we attempted to refine this list of candidate genes by performing conditional analyses, adjusting for individual variants previously associated with hematological measures, and performed further fine-mapping of TWAS loci. To facilitate interpretation of our findings, we designed an R Shiny application to interactively visualize our TWAS results by integrating them with additional genetic data sources (GWAS, TWAS from multiple reference panels, conditional analyses, known GWAS variants, etc.). Our results and application highlight frequently overlooked TWAS challenges and illustrate the complexity of TWAS fine-mapping.