RESUMO
Regular cigarette smoking and cannabis consumption are strongly positively related to each other, yet few studies explore their underlying variation and covariation. We evaluated the genetic and environmental decomposition of variance and covariance of these two traits in twin data from three countries with different social norms and legislation. Data from the Netherlands Twin Register, FinnTwin12/16, and the Minnesota Center for Twin Family Research (total N = 21,617) were analyzed in bivariate threshold models of lifetime regular smoking initiation (RSI) and lifetime cannabis initiation (CI). We ran unstratified models and models stratified by sex and country. Prevalence of RSI was lowest in the Netherlands and prevalence of CI was highest in Minnesota. In the unstratified model, genetic (A) and common environmental factors (C) contributed substantially to the liabilities of RSI (A = 0.47, C = 0.34) and CI (A = 0.28, C = 0.51). The two liabilities were significantly phenotypically (rP = 0.56), genetically (rA = 0.74), and environmentally correlated in the unstratified model (rC = 0.47and rE = 0.48, representing correlations between common and unique environmental factors). The magnitude of phenotypic correlation between liabilities varied by country but not sex (Minnesota rP ~ 0.70, Netherlands rP ~ 0.59, Finland rP ~ 0.45). Comparisons of decomposed correlations could not be reliably tested in the stratified models. The prevalence and association of RSI and CI vary by sex and country. These two behaviors are correlated because there is genetic and environmental overlap between their underlying latent liabilities. There is heterogeneity in the genetic architecture of these traits across country.
Assuntos
Fumar Tabaco , Humanos , Masculino , Feminino , Países Baixos/epidemiologia , Adulto , Finlândia/epidemiologia , Minnesota/epidemiologia , Adolescente , Prevalência , Adulto Jovem , Pessoa de Meia-Idade , Fenótipo , Gêmeos Dizigóticos/genética , Fumar Maconha/genética , Fumar Maconha/epidemiologia , Gêmeos Monozigóticos/genética , Sistema de Registros , Fumar/genética , Fumar/epidemiologiaRESUMO
Epigenome-wide association studies (EWAS) aim to identify differentially methylated loci associated with complex traits and disorders. EWAS of cigarette smoking shows some of the most widespread DNA methylation (DNAm) associations in blood. However, traditional EWAS cannot differentiate between causation and confounding, leading to ambiguity in etiological interpretations. Here, we apply an integrated approach combining Mendelian Randomization and twin-based Direction-of-Causation analyses (MR-DoC) to examine causality underlying smoking-associated blood DNAm changes in the Netherlands Twin Register (N=2577). Evidence across models suggests that current smoking's causal effects on DNAm likely drive many of the previous EWAS findings, implicating functional pathways relevant to several adverse health outcomes of smoking, including hemopoiesis, cell- and neuro-development, and immune regulation. Additionally, we find evidence of potential reverse causal influences at some DNAm sites, with 17 of these sites enriched for gene regulatory functional elements in the brain. The top three sites with evidence of DNAm's effects on smoking annotate to genes involved in G protein-coupled receptor signaling (GNG7, RGS3) and innate immune response (SLC15A4), elucidating potential biological risk factors for smoking. This study highlights the utility of integrating genotypic and DNAm measures in twin cohorts to clarify the causal relationships between health behaviors and blood DNAm.
RESUMO
The evolving field of multi-omics combines data and provides methods for simultaneous analysis across several omics levels. Here, we integrated genomics (transmitted and non-transmitted polygenic scores [PGSs]), epigenomics, and metabolomics data in a multi-omics framework to identify biomarkers for Attention-Deficit/Hyperactivity Disorder (ADHD) and investigated the connections among the three omics levels. We first trained single- and next multi-omics models to differentiate between cases and controls in 596 twins (cases = 14.8%) from the Netherlands Twin Register (NTR) demonstrating reasonable in-sample prediction through cross-validation. The multi-omics model selected 30 PGSs, 143 CpGs, and 90 metabolites. We confirmed previous associations of ADHD with glucocorticoid exposure and the transmembrane protein family TMEM, show that the DNA methylation of the MAD1L1 gene associated with ADHD has a relation with parental smoking behavior, and present novel findings including associations between indirect genetic effects and CpGs of the STAP2 gene. However, out-of-sample prediction in NTR participants (N = 258, cases = 14.3%) and in a clinical sample (N = 145, cases = 51%) did not perform well (range misclassification was [0.40, 0.57]). The results highlighted connections between omics levels, with the strongest connections between non-transmitted PGSs, CpGs, and amino acid levels and show that multi-omics designs considering interrelated omics levels can help unravel the complex biology underlying ADHD.
Assuntos
Transtorno do Deficit de Atenção com Hiperatividade , Humanos , Transtorno do Deficit de Atenção com Hiperatividade/genética , Transtorno do Deficit de Atenção com Hiperatividade/psicologia , Epigenômica , Multiômica , Genômica , MetabolômicaRESUMO
STUDY QUESTION: Which genetic factors regulate female propensity for giving birth to spontaneous dizygotic (DZ) twins? SUMMARY ANSWER: We identified four new loci, GNRH1, FSHR, ZFPM1, and IPO8, in addition to previously identified loci, FSHB and SMAD3. WHAT IS KNOWN ALREADY: The propensity to give birth to DZ twins runs in families. Earlier, we reported that FSHB and SMAD3 as associated with DZ twinning and female fertility measures. STUDY DESIGN, SIZE, DURATION: We conducted a genome-wide association meta-analysis (GWAMA) of mothers of spontaneous dizygotic (DZ) twins (8265 cases, 264â567 controls) and of independent DZ twin offspring (26â252 cases, 417â433 controls). PARTICIPANTS/MATERIALS, SETTING, METHODS: Over 700â000 mothers of DZ twins, twin individuals and singletons from large cohorts in Australia/New Zealand, Europe, and the USA were carefully screened to exclude twins born after use of ARTs. Genetic association analyses by cohort were followed by meta-analysis, phenome wide association studies (PheWAS), in silico and in vivo annotations, and Zebrafish functional validation. MAIN RESULTS AND THE ROLE OF CHANCE: This study enlarges the sample size considerably from previous efforts, finding four genome-wide significant loci, including two novel signals and a further two novel genes that are implicated by gene level enrichment analyses. The novel loci, GNRH1 and FSHR, have well-established roles in female reproduction whereas ZFPM1 and IPO8 have not previously been implicated in female fertility. We found significant genetic correlations with multiple aspects of female reproduction and body size as well as evidence for significant selection against DZ twinning during human evolution. The 26 top single nucleotide polymorphisms (SNPs) from our GWAMA in European-origin participants weakly predicted the crude twinning rates in 47 non-European populations (r = 0.23 between risk score and population prevalence, s.e. 0.11, 1-tail P = 0.058) indicating that genome-wide association studies (GWAS) are needed in African and Asian populations to explore the causes of their respectively high and low DZ twinning rates. In vivo functional tests in zebrafish for IPO8 validated its essential role in female, but not male, fertility. In most regions, risk SNPs linked to known expression quantitative trait loci (eQTLs). Top SNPs were associated with in vivo reproductive hormone levels with the top pathways including hormone ligand binding receptors and the ovulation cycle. LARGE SCALE DATA: The full DZT GWAS summary statistics will made available after publication through the GWAS catalog (https://www.ebi.ac.uk/gwas/). LIMITATIONS, REASONS FOR CAUTION: Our study only included European ancestry cohorts. Inclusion of data from Africa (with the highest twining rate) and Asia (with the lowest rate) would illuminate further the biology of twinning and female fertility. WIDER IMPLICATIONS OF THE FINDINGS: About one in 40 babies born in the world is a twin and there is much speculation on why twinning runs in families. We hope our results will inform investigations of ovarian response in new and existing ARTs and the causes of female infertility. STUDY FUNDING/COMPETING INTEREST(S): Support for the Netherlands Twin Register came from the Netherlands Organization for Scientific Research (NWO) and The Netherlands Organization for Health Research and Development (ZonMW) grants, 904-61-193, 480-04-004, 400-05-717, Addiction-31160008, 911-09-032, Biobanking and Biomolecular Resources Research Infrastructure (BBMRI.NL, 184.021.007), Royal Netherlands Academy of Science Professor Award (PAH/6635) to DIB, European Research Council (ERC-230374), Rutgers University Cell and DNA Repository (NIMH U24 MH068457-06), the Avera Institute, Sioux Falls, South Dakota (USA) and the National Institutes of Health (NIH R01 HD042157-01A1) and the Genetic Association Information Network (GAIN) of the Foundation for the National Institutes of Health and Grand Opportunity grants 1RC2 MH089951. The QIMR Berghofer Medical Research Institute (QIMR) study was supported by grants from the National Health and Medical Research Council (NHMRC) of Australia (241944, 339462, 389927, 389875, 389891, 389892, 389938, 443036, 442915, 442981, 496610, 496739, 552485, 552498, 1050208, 1075175). L.Y. is funded by Australian Research Council (Grant number DE200100425). The Minnesota Center for Twin and Family Research (MCTFR) was supported in part by USPHS Grants from the National Institute on Alcohol Abuse and Alcoholism (AA09367 and AA11886) and the National Institute on Drug Abuse (DA05147, DA13240, and DA024417). The Women's Genome Health Study (WGHS) was funded by the National Heart, Lung, and Blood Institute (HL043851 and HL080467) and the National Cancer Institute (CA047988 and UM1CA182913), with support for genotyping provided by Amgen. Data collection in the Finnish Twin Registry has been supported by the Wellcome Trust Sanger Institute, the Broad Institute, ENGAGE-European Network for Genetic and Genomic Epidemiology, FP7-HEALTH-F4-2007, grant agreement number 201413, National Institute of Alcohol Abuse and Alcoholism (grants AA-12502, AA-00145, AA-09203, AA15416, and K02AA018755) and the Academy of Finland (grants 100499, 205585, 118555, 141054, 264146, 308248, 312073 and 336823 to J. Kaprio). TwinsUK is funded by the Wellcome Trust, Medical Research Council, Versus Arthritis, European Union Horizon 2020, Chronic Disease Research Foundation (CDRF), Zoe Ltd and the National Institute for Health Research (NIHR) Clinical Research Network (CRN) and Biomedical Research Centre based at Guy's and St Thomas' NHS Foundation Trust in partnership with King's College London. For NESDA, funding was obtained from the Netherlands Organization for Scientific Research (Geestkracht program grant 10000-1002), the Center for Medical Systems Biology (CSMB, NVVO Genomics), Biobanking and Biomolecular Resources Research Infrastructure (BBMRI-NL), VU University's Institutes for Health and Care Research (EMGO+) and Neuroscience Campus Amsterdam, University Medical Center Groningen, Leiden University Medical Center, National Institutes of Health (NIH, ROI D0042157-01A, MH081802, Grand Opportunity grants 1 RC2 Ml-1089951 and IRC2 MH089995). Part of the genotyping and analyses were funded by the Genetic Association Information Network (GAIN) of the Foundation for the National Institutes of Health. Computing was supported by BiG Grid, the Dutch e-Science Grid, which is financially supported by NWO. Work in the Del Bene lab was supported by the Programme Investissements d'Avenir IHU FOReSIGHT (ANR-18-IAHU-01). C.R. was supported by an EU Horizon 2020 Marie Sklodowska-Curie Action fellowship (H2020-MSCA-IF-2014 #661527). H.S. and K.S. are employees of deCODE Genetics/Amgen. The other authors declare no competing financial interests. TRIAL REGISTRATION NUMBER: N/A.
Assuntos
Fertilidade , Estudo de Associação Genômica Ampla , Gemelação Dizigótica , Animais , Feminino , Humanos , Gravidez , Proteínas de Transporte/genética , Fertilidade/genética , Hormônios , Proteínas/genética , Estados Unidos , Peixe-Zebra/genéticaRESUMO
Cannabis is widely used worldwide, yet its links to health outcomes are not fully understood. DNA methylation can serve as a mediator to link environmental exposures to health outcomes. We conducted an epigenome-wide association study (EWAS) of peripheral blood-based DNA methylation and lifetime cannabis use (ever vs. never) in a meta-analysis including 9436 participants (7795 European and 1641 African ancestry) from seven cohorts. Accounting for effects of cigarette smoking, our trans-ancestry EWAS meta-analysis revealed four CpG sites significantly associated with lifetime cannabis use at a false discovery rate of 0.05 [Formula: see text]: cg22572071 near gene ADGRF1, cg15280358 in ADAM12, cg00813162 in ACTN1, and cg01101459 near LINC01132. Additionally, our EWAS analysis in participants who never smoked cigarettes identified another epigenome-wide significant CpG site, cg14237301 annotated to APOBR. We used a leave-one-out approach to evaluate methylation scores constructed as a weighted sum of the significant CpGs. The best model can explain 3.79% of the variance in lifetime cannabis use. These findings unravel the DNA methylation changes associated with lifetime cannabis use that are independent of cigarette smoking and may serve as a starting point for further research on the mechanisms through which cannabis exposure impacts health outcomes.
RESUMO
Background: Smoking-associated DNA methylation levels identified through epigenome-wide association studies (EWASs) are generally ascribed to smoking-reactive mechanisms, but the contribution of a shared genetic predisposition to smoking and DNA methylation levels is typically not accounted for. Methods: We exploited a strong within-family design, that is, the discordant monozygotic twin design, to study reactiveness of DNA methylation in blood cells to smoking and reversibility of methylation patterns upon quitting smoking. Illumina HumanMethylation450 BeadChip data were available for 769 monozygotic twin pairs (mean age = 36 years, range = 18-78, 70% female), including pairs discordant or concordant for current or former smoking. Results: In pairs discordant for current smoking, 13 differentially methylated CpGs were found between current smoking twins and their genetically identical co-twin who never smoked. Top sites include multiple CpGs in CACNA1D and GNG12, which encode subunits of a calcium voltage-gated channel and G protein, respectively. These proteins interact with the nicotinic acetylcholine receptor, suggesting that methylation levels at these CpGs might be reactive to nicotine exposure. All 13 CpGs have been previously associated with smoking in unrelated individuals and data from monozygotic pairs discordant for former smoking indicated that methylation patterns are to a large extent reversible upon smoking cessation. We further showed that differences in smoking level exposure for monozygotic twins who are both current smokers but differ in the number of cigarettes they smoke are reflected in their DNA methylation profiles. Conclusions: In conclusion, by analysing data from monozygotic twins, we robustly demonstrate that DNA methylation level in human blood cells is reactive to cigarette smoking. Funding: We acknowledge funding from the National Institute on Drug Abuse grant DA049867, the Netherlands Organization for Scientific Research (NWO): Biobanking and Biomolecular Research Infrastructure (BBMRI-NL, NWO 184.033.111) and the BBRMI-NL-financed BIOS Consortium (NWO 184.021.007), NWO Large Scale infrastructures X-Omics (184.034.019), Genotype/phenotype database for behaviour genetic and genetic epidemiological studies (ZonMw Middelgroot 911-09-032); Netherlands Twin Registry Repository: researching the interplay between genome and environment (NWO-Groot 480-15-001/674); the Avera Institute, Sioux Falls (USA), and the National Institutes of Health (NIH R01 HD042157-01A1, MH081802, Grand Opportunity grants 1RC2 MH089951 and 1RC2 MH089995); epigenetic data were generated at the Human Genomics Facility (HuGe-F) at ErasmusMC Rotterdam. Cotinine assaying was sponsored by the Neuroscience Campus Amsterdam. DIB acknowledges the Royal Netherlands Academy of Science Professor Award (PAH/6635).
The genetic information of people who smoke present distinctive characteristics. In particular, previous research has revealed differences in patterns of DNA methylation, a type of chemical modification that helps cells switch certain genes on or off. However, most of these studies could not establish for sure whether these changes were caused by smoking, predisposed individuals to smoke, or were driven by underlying genetic variation in the DNA sequence itself. To investigate this question, van Dongen et al. examined DNA methylation data from the blood cells of over 700 pairs of identical twins. These individuals share the exact same genetic information, making it possible to better evaluate the impact of lifestyle on DNA modifications. The analyses identified differences in methylation at 13 DNA locations in pairs of twins where one was a current smoker and their sibling had never smoked. Two of the genes code for proteins involved in the response to nicotine, the primary addictive chemical in cigarette smoke. The differences were smaller if one of the twins had stopped smoking, suggesting that quitting can help to reverse some of these changes. These findings confirm that DNA methylation in blood cells is influenced by cigarette smoke, which could help to better understand smoking-associated diseases. They also demonstrate how useful identical twins studies can be to identify methylation changes that are markers of lifestyle.
Assuntos
Metilação de DNA , Gêmeos Monozigóticos , Estados Unidos , Feminino , Humanos , Adolescente , Adulto Jovem , Adulto , Pessoa de Meia-Idade , Idoso , Masculino , Gêmeos Monozigóticos/genética , Bancos de Espécimes Biológicos , Fumar/genética , EpigenomaRESUMO
This study introduces and illustrates the potential of an integrated multi-omics approach in investigating the underlying biology of complex traits such as childhood aggressive behavior. In 645 twins (cases = 42%), we trained single- and integrative multi-omics models to identify biomarkers for subclinical aggression and investigated the connections among these biomarkers. Our data comprised transmitted and two non-transmitted polygenic scores (PGSs) for 15 traits, 78,772 CpGs, and 90 metabolites. The single-omics models selected 31 PGSs, 1614 CpGs, and 90 metabolites, and the multi-omics model comprised 44 PGSs, 746 CpGs, and 90 metabolites. The predictive accuracy for these models in the test (N = 277, cases = 42%) and independent clinical data (N = 142, cases = 45%) ranged from 43 to 57%. We observed strong connections between DNA methylation, amino acids, and parental non-transmitted PGSs for ADHD, Autism Spectrum Disorder, intelligence, smoking initiation, and self-reported health. Aggression-related omics traits link to known and novel risk factors, including inflammation, carcinogens, and smoking.
Assuntos
Transtorno do Espectro Autista , Multiômica , Humanos , Cognição , Biomarcadores , AgressãoRESUMO
BACKGROUND: DNA methylation (DNAm) is associated with time-varying environmental factors that contribute to major depressive disorder (MDD) risk. We sought to test whether DNAm signatures of lifestyle and biochemical factors were associated with MDD to reveal dynamic biomarkers of MDD risk that may be amenable to lifestyle interventions. METHODS: Here, we calculated methylation scores (MS) at multiple p-value thresholds for lifestyle (BMI, smoking, alcohol consumption, and educational attainment) and biochemical (high-density lipoprotein (HDL) and total cholesterol) factors in Generation Scotland (GS) (N=9,502) and in a replication cohort (ALSPACadults, N=565), using CpG sites reported in previous well-powered methylome-wide association studies. We also compared their predictive accuracy for MDD to a MDD MS in an independent GS sub-sample (N=4,432). FINDINGS: Each trait MS was significantly associated with its corresponding phenotype in GS (ßrange=0.089-1.457) and in ALSPAC (ßrange=0.078-2.533). Each MS was also significantly associated with MDD before and after adjustment for its corresponding phenotype in GS (ßrange=0.053-0.145). After accounting for relevant lifestyle factors, MS for educational attainment (ß=0.094) and alcohol consumption (MSp-value<0.01-0.5; ßrange=-0.069-0.083) remained significantly associated with MDD in GS. Smoking (AUC=0.569) and educational attainment (AUC=0.585) MSs could discriminate MDD from controls better than the MDD MS (AUC=0.553) in the independent GS sub-sample. Analyses implicating MDD did not replicate across ALSPAC, although the direction of effect was consistent for all traits when adjusting for the MS corresponding phenotypes. INTERPRETATION: We showed that lifestyle and biochemical MS were associated with MDD before and after adjustment for their corresponding phenotypes (pnominal<0.05), but not when smoking, alcohol consumption, and BMI were also included as covariates. MDD results did not replicate in the smaller, female-only independent ALSPAC cohort (NALSPAC=565; NGS=9,502), potentially due to demographic differences or low statistical power, but effect sizes were consistent with the direction reported in GS. DNAm scores for modifiable MDD risk factors may contribute to disease vulnerability and, in some cases, explain additional variance to their observed phenotypes. FUNDING: Wellcome Trust.
Assuntos
Transtorno Depressivo Maior , Herança Multifatorial , Estudos de Coortes , Metilação de DNA , Transtorno Depressivo Maior/etiologia , Transtorno Depressivo Maior/genética , Epigenoma , Feminino , Estudo de Associação Genômica Ampla , HumanosRESUMO
We examined the performance of methylation scores (MS) and polygenic scores (PGS) for birth weight, BMI, prenatal maternal smoking exposure, and smoking status to assess the extent to which MS could predict these traits and exposures over and above the PGS in a multi-omics prediction model. MS may be seen as the epigenetic equivalent of PGS, but because of their dynamic nature and sensitivity of non-genetic exposures may add to complex trait prediction independently of PGS. MS and PGS were calculated based on genotype data and DNA-methylation data in blood samples from adults (Illumina 450 K; N = 2,431; mean age 35.6) and in buccal samples from children (Illumina EPIC; N = 1,128; mean age 9.6) from the Netherlands Twin Register. Weights to construct the scores were obtained from results of large epigenome-wide association studies (EWASs) based on whole blood or cord blood methylation data and genome-wide association studies (GWASs). In adults, MSs in blood predicted independently from PGSs, and outperformed PGSs for BMI, prenatal maternal smoking, and smoking status, but not for birth weight. The largest amount of variance explained by the multi-omics prediction model was for current vs. never smoking (54.6%) of which 54.4% was captured by the MS. The two predictors captured 16% of former vs. never smoking initiation variance (MS:15.5%, PGS: 0.5%), 17.7% of prenatal maternal smoking variance (MS:16.9%, PGS: 0.8%), 11.9% of BMI variance (MS: 6.4%, PGS 5.5%), and 1.9% of birth weight variance (MS: 0.4%, PGS: 1.5%). In children, MSs in buccal samples did not show independent predictive value. The largest amount of variance explained by the two predictors was for prenatal maternal smoking (2.6%), where the MSs contributed 1.5%. These results demonstrate that blood DNA MS in adults explain substantial variance in current smoking, large variance in former smoking, prenatal smoking, and BMI, but not in birth weight. Buccal cell DNA methylation scores have lower predictive value, which could be due to different tissues in the EWAS discovery studies and target sample, as well as to different ages. This study illustrates the value of combining polygenic scores with information from methylation data for complex traits and exposure prediction.
RESUMO
Strong associations between neural tube defects (NTDs) and monozygotic (MZ) twinning have long been noted, and it has been suggested that NTD cases who do not present as MZ twins may be the survivors of MZ twinning events. We have recently shown that MZ twins carry a strong, distinctive DNA methylation signature and have developed an algorithm based on genomewide DNA methylation array data that distinguishes MZ twins from dizygotic twins and other relatives at well above chance level. We have applied this algorithm to published methylation data from five fetal tissues (placental chorionic villi, kidney, spinal cord, brain and muscle) collected from spina bifida cases (n = 22), anencephalic cases (n = 15) and controls (n = 19). We see no difference in signature between cases and controls, providing no support for a common etiological role of MZ twinning in NTDs. The strong associations therefore continue to await elucidation.
Assuntos
Defeitos do Tubo Neural , Gemelaridade Monozigótica , Doenças em Gêmeos/genética , Epigênese Genética , Feminino , Humanos , Defeitos do Tubo Neural/genética , Placenta , Gravidez , Gemelaridade Monozigótica/genética , Gêmeos Dizigóticos/genética , Gêmeos Monozigóticos/genéticaRESUMO
DNA methylation profiles of aggressive behavior may capture lifetime cumulative effects of genetic, stochastic, and environmental influences associated with aggression. Here, we report the first large meta-analysis of epigenome-wide association studies (EWAS) of aggressive behavior (N = 15,324 participants). In peripheral blood samples of 14,434 participants from 18 cohorts with mean ages ranging from 7 to 68 years, 13 methylation sites were significantly associated with aggression (alpha = 1.2 × 10-7; Bonferroni correction). In cord blood samples of 2425 children from five cohorts with aggression assessed at mean ages ranging from 4 to 7 years, 83% of these sites showed the same direction of association with childhood aggression (r = 0.74, p = 0.006) but no epigenome-wide significant sites were found. Top-sites (48 at a false discovery rate of 5% in the peripheral blood meta-analysis or in a combined meta-analysis of peripheral blood and cord blood) have been associated with chemical exposures, smoking, cognition, metabolic traits, and genetic variation (mQTLs). Three genes whose expression levels were associated with top-sites were previously linked to schizophrenia and general risk tolerance. At six CpGs, DNA methylation variation in blood mirrors variation in the brain. On average 44% (range = 3-82%) of the aggression-methylation association was explained by current and former smoking and BMI. These findings point at loci that are sensitive to chemical exposures with potential implications for neuronal functions. We hope these results to be a starting point for studies leading to applications as peripheral biomarkers and to reveal causal relationships with aggression and related traits.
Assuntos
Metilação de DNA , Epigenoma , Adolescente , Adulto , Idoso , Agressão , Criança , Pré-Escolar , Ilhas de CpG/genética , Metilação de DNA/genética , Epigênese Genética/genética , Estudo de Associação Genômica Ampla , Humanos , Longevidade , Pessoa de Meia-Idade , Adulto JovemRESUMO
Insights into individual differences in gene expression and its heritability (h2) can help in understanding pathways from DNA to phenotype. We estimated the heritability of gene expression of 52,844 genes measured in whole blood in the largest twin RNA-Seq sample to date (1497 individuals including 459 monozygotic twin pairs and 150 dizygotic twin pairs) from classical twin modeling and identity-by-state-based approaches. We estimated for each gene h2total, composed of cis-heritability (h2cis, the variance explained by single nucleotide polymorphisms in the cis-window of the gene), and trans-heritability (h2res, the residual variance explained by all other genome-wide variants). Mean h2total was 0.26, which was significantly higher than heritability estimates earlier found in a microarray-based study using largely overlapping (>60%) RNA samples (mean h2 = 0.14, p = 6.15 × 10-258). Mean h2cis was 0.06 and strongly correlated with beta of the top cis expression quantitative loci (eQTL, ρ = 0.76, p < 10-308) and with estimates from earlier RNA-Seq-based studies. Mean h2res was 0.20 and correlated with the beta of the corresponding trans-eQTL (ρ = 0.04, p < 1.89 × 10-3) and was significantly higher for genes involved in cytokine-cytokine interactions (p = 4.22 × 10-15), many other immune system pathways, and genes identified in genome-wide association studies for various traits including behavioral disorders and cancer. This study provides a thorough characterization of cis- and trans-h2 estimates of gene expression, which is of value for interpretation of GWAS and gene expression studies.
Assuntos
Interação Gene-Ambiente , Polimorfismo de Nucleotídeo Único , Característica Quantitativa Herdável , Adolescente , Adulto , Idoso , Feminino , Estudo de Associação Genômica Ampla/métodos , Genótipo , Humanos , Masculino , Pessoa de Meia-Idade , Locos de Características Quantitativas , RNA-Seq/métodos , Gêmeos Dizigóticos/genética , Gêmeos Monozigóticos/genéticaRESUMO
BACKGROUND: A large number of analysis strategies are available for DNA methylation (DNAm) array and RNA-seq datasets, but it is unclear which strategies are best to use. We compare commonly used strategies and report how they influence results in large cohort studies. RESULTS: We tested the associations of DNAm and RNA expression with age, BMI, and smoking in four different cohorts (n = ~ 2900). By comparing strategies against the base model on the number and percentage of replicated CpGs for DNAm analyses or genes for RNA-seq analyses in a leave-one-out cohort replication approach, we find the choice of the normalization method and statistical test does not strongly influence the results for DNAm array data. However, adjusting for cell counts or hidden confounders substantially decreases the number of replicated CpGs for age and increases the number of replicated CpGs for BMI and smoking. For RNA-seq data, the choice of the normalization method, gene expression inclusion threshold, and statistical test does not strongly influence the results. Including five principal components or excluding correction of technical covariates or cell counts decreases the number of replicated genes. CONCLUSIONS: Results were not influenced by the normalization method or statistical test. However, the correction method for cell counts, technical covariates, principal components, and/or hidden confounders does influence the results.
Assuntos
Metilação de DNA , Epigenômica/métodos , Perfilação da Expressão Gênica/métodos , Análise de Sequência de RNA , Adulto , Idoso , Estudos de Coortes , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Adulto JovemRESUMO
Inferring a person's smoking habit and history from blood is relevant for complementing or replacing self-reports in epidemiological and public health research, and for forensic applications. However, a finite DNA methylation marker set and a validated statistical model based on a large dataset are not yet available. Employing 14 epigenome-wide association studies for marker discovery, and using data from six population-based cohorts (N = 3764) for model building, we identified 13 CpGs most suitable for inferring smoking versus non-smoking status from blood with a cumulative Area Under the Curve (AUC) of 0.901. Internal fivefold cross-validation yielded an average AUC of 0.897 ± 0.137, while external model validation in an independent population-based cohort (N = 1608) achieved an AUC of 0.911. These 13 CpGs also provided accurate inference of current (average AUCcrossvalidation 0.925 ± 0.021, AUCexternalvalidation0.914), former (0.766 ± 0.023, 0.699) and never smoking (0.830 ± 0.019, 0.781) status, allowed inferring pack-years in current smokers (10 pack-years 0.800 ± 0.068, 0.796; 15 pack-years 0.767 ± 0.102, 0.752) and inferring smoking cessation time in former smokers (5 years 0.774 ± 0.024, 0.760; 10 years 0.766 ± 0.033, 0.764; 15 years 0.767 ± 0.020, 0.754). Model application to children revealed highly accurate inference of the true non-smoking status (6 years of age: accuracy 0.994, N = 355; 10 years: 0.994, N = 309), suggesting prenatal and passive smoking exposure having no impact on model applications in adults. The finite set of DNA methylation markers allow accurate inference of smoking habit, with comparable accuracy as plasma cotinine use, and smoking history from blood, which we envision becoming useful in epidemiology and public health research, and in medical and forensic applications.
Assuntos
Cotinina/sangue , Metilação de DNA , DNA/sangue , Epigenômica/métodos , Fumar/efeitos adversos , Adulto , Área Sob a Curva , Biomarcadores/sangue , Feminino , Humanos , Masculino , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Fumar/genética , Abandono do Hábito de FumarRESUMO
Birthweight is associated with health outcomes across the life course, DNA methylation may be an underlying mechanism. In this meta-analysis of epigenome-wide association studies of 8,825 neonates from 24 birth cohorts in the Pregnancy And Childhood Epigenetics Consortium, we find that DNA methylation in neonatal blood is associated with birthweight at 914 sites, with a difference in birthweight ranging from -183 to 178 grams per 10% increase in methylation (PBonferroni < 1.06 x 10-7). In additional analyses in 7,278 participants, <1.3% of birthweight-associated differential methylation is also observed in childhood and adolescence, but not adulthood. Birthweight-related CpGs overlap with some Bonferroni-significant CpGs that were previously reported to be related to maternal smoking (55/914, p = 6.12 x 10-74) and BMI in pregnancy (3/914, p = 1.13x10-3), but not with those related to folate levels in pregnancy. Whether the associations that we observe are causal or explained by confounding or fetal growth influencing DNA methylation (i.e. reverse causality) requires further research.
Assuntos
Peso ao Nascer/genética , DNA/metabolismo , Epigênese Genética , Genoma Humano , Adolescente , Adulto , Índice de Massa Corporal , Criança , Ilhas de CpG , DNA/genética , Metilação de DNA , Feminino , Desenvolvimento Fetal/genética , Feto , Ácido Fólico/sangue , Estudo de Associação Genômica Ampla , Humanos , Recém-Nascido , Masculino , Gravidez , Efeitos Tardios da Exposição Pré-Natal/sangue , Efeitos Tardios da Exposição Pré-Natal/etiologia , Efeitos Tardios da Exposição Pré-Natal/genética , Efeitos Tardios da Exposição Pré-Natal/fisiopatologia , Fumar/efeitos adversos , Fumar/sangue , Fumar/genéticaRESUMO
BACKGROUND: DNA methylation alteration extensively associates with smoking and is a plausible link between smoking and adverse health. We examined the association between epigenome-wide DNA methylation and serum cotinine levels as a proxy of nicotine exposure and smoking quantity, assessed the role of SNPs in these associations, and evaluated molecular mediation by methylation in a sample of biochemically verified current smokers (N = 310). RESULTS: DNA methylation at 50 CpG sites was associated (FDR < 0.05) with cotinine levels, 17 of which are novel associations. As cotinine levels are influenced not only by nicotine intake but also by CYP2A6-mediated nicotine metabolism rate, we performed secondary analyses adjusting for genetic risk score of nicotine metabolism rate and identified five additional novel associations. We further assessed the potential role of genetic variants in the detected association between methylation and cotinine levels observing 124 cis and 3898 trans methylation quantitative trait loci (meQTLs). Nineteen of these SNPs were also associated with cotinine levels (FDR < 0.05). Further, at seven CpG sites, we observed a trend (P < 0.05) that altered DNA methylation mediates the effect of SNPs on nicotine exposure rather than a direct consequence of smoking. Finally, we performed replication of our findings in two independent cohorts of biochemically verified smokers (N = 450 and N = 79). CONCLUSIONS: Using cotinine, a biomarker of nicotine exposure, we replicated and extended identification of novel epigenetic associations in smoking-related genes. We also demonstrated that DNA methylation in some of the identified loci is driven by the underlying genotype and may mediate the causal effect of genotype on cotinine levels.
Assuntos
Cotinina/sangue , Metilação de DNA , Epigenômica/métodos , Polimorfismo de Nucleotídeo Único , Fumar/genética , Adulto , Idoso , Ilhas de CpG , Citocromo P-450 CYP2A6/genética , Feminino , Estudos de Associação Genética , Genótipo , Humanos , Masculino , Pessoa de Meia-Idade , Fumar/sangue , Adulto JovemRESUMO
The majority of lung cancer is caused by tobacco smoking, and lung cancer-relevant epigenetic markers have been identified in relation to smoking exposure. Still, smoking-related markers appear to mediate little of the effect of smoking on lung cancer. Thus in order to identify disease-relevant markers and enhance our understanding of pathways, a wide search is warranted. Through an epigenome-wide search within a case-control study (131 cases, 129 controls) nested in a Norwegian prospective cohort of women, we found 25 CpG sites associated with lung cancer. Twenty-three were classified as associated with smoking (LC-AwS), and two were classified as unassociated with smoking (LC-non-AwS), as they remained associated with lung cancer after stringent adjustment for smoking exposure using the comprehensive smoking index (CSI): cg10151248 (PC, CSI-adjusted odds ratio (OR) = 0.34 [0.23-0.52] per standard deviation change in methylation) and cg13482620 (B3GNTL1, CSI-adjusted OR = 0.33 [0.22-0.50]). Analysis among never smokers and a cohort of smoking-discordant twins confirmed the classification of the two LC-non-AwS CpG sites. Gene expression profiles demonstrated that the LC-AwS CpG sites had different enriched pathways than LC-non-AwS sites. In conclusion, using blood-derived DNA methylation and gene expression profiles from a prospective lung cancer case-control study in women, we identified 25 CpG lung cancer markers prior to diagnosis, two of which were LC-non-AwS markers and related to distinct pathways.
Assuntos
Metilação de DNA , Neoplasias Pulmonares/sangue , Neoplasias Pulmonares/genética , Transcriptoma , Estudos de Coortes , Ilhas de CpG/genética , Feminino , Predisposição Genética para Doença/genética , Humanos , Neoplasias Pulmonares/diagnóstico , Pessoa de Meia-Idade , Noruega , Análise de Sequência com Séries de OligonucleotídeosRESUMO
Educational attainment is a key behavioural measure in studies of cognitive and physical health, and socioeconomic status. We measured DNA methylation at 410,746 CpGs (N = 4152) and identified 58 CpGs associated with educational attainment at loci characterized by pleiotropic functions shared with neuronal, immune and developmental processes. Associations overlapped with those for smoking behaviour, but remained after accounting for smoking at many CpGs: Effect sizes were on average 28% smaller and genome-wide significant at 11 CpGs after adjusting for smoking and were 62% smaller in never smokers. We examined sources and biological implications of education-related methylation differences, demonstrating correlations with maternal prenatal folate, smoking and air pollution signatures, and associations with gene expression in cis, dynamic methylation in foetal brain, and correlations between blood and brain. Our findings show that the methylome of lower-educated people resembles that of smokers beyond effects of their own smoking behaviour and shows traces of various other exposures.
RESUMO
Epigenome-wide association studies (EWAS) provide an alternative approach for studying human disease through consideration of non-genetic variants such as altered DNA methylation. To advance the complex interpretation of EWAS, we developed eFORGE (http://eforge.cs.ucl.ac.uk/), a new standalone and web-based tool for the analysis and interpretation of EWAS data. eFORGE determines the cell type-specific regulatory component of a set of EWAS-identified differentially methylated positions. This is achieved by detecting enrichment of overlap with DNase I hypersensitive sites across 454 samples (tissues, primary cell types, and cell lines) from the ENCODE, Roadmap Epigenomics, and BLUEPRINT projects. Application of eFORGE to 20 publicly available EWAS datasets identified disease-relevant cell types for several common diseases, a stem cell-like signature in cancer, and demonstrated the ability to detect cell-composition effects for EWAS performed on heterogeneous tissues. Our approach bridges the gap between large-scale epigenomics data and EWAS-derived target selection to yield insight into disease etiology.
Assuntos
Epigenômica , Transdução de Sinais , Software , Estatística como Assunto , Metilação de DNA/genética , Estudo de Associação Genômica Ampla , Humanos , Cariotipagem , Esclerose Múltipla/genética , Especificidade de Órgãos/genética , Transdução de Sinais/genética , Células-Tronco/metabolismoRESUMO
BACKGROUND: Epigenetic change is a hallmark of ageing but its link to ageing mechanisms in humans remains poorly understood. While DNA methylation at many CpG sites closely tracks chronological age, DNA methylation changes relevant to biological age are expected to gradually dissociate from chronological age, mirroring the increased heterogeneity in health status at older ages. RESULTS: Here, we report on the large-scale identification of 6366 age-related variably methylated positions (aVMPs) identified in 3295 whole blood DNA methylation profiles, 2044 of which have a matching RNA-seq gene expression profile. aVMPs are enriched at polycomb repressed regions and, accordingly, methylation at those positions is associated with the expression of genes encoding components of polycomb repressive complex 2 (PRC2) in trans. Further analysis revealed trans-associations for 1816 aVMPs with an additional 854 genes. These trans-associated aVMPs are characterized by either an age-related gain of methylation at CpG islands marked by PRC2 or a loss of methylation at enhancers. This distinct pattern extends to other tissues and multiple cancer types. Finally, genes associated with aVMPs in trans whose expression is variably upregulated with age (733 genes) play a key role in DNA repair and apoptosis, whereas downregulated aVMP-associated genes (121 genes) are mapped to defined pathways in cellular metabolism. CONCLUSIONS: Our results link age-related changes in DNA methylation to fundamental mechanisms that are thought to drive human ageing.