RESUMEN
The rising prevalence and legalisation of cannabis worldwide have underscored the need for a comprehensive understanding of its biological impact, particularly on mental health. Epigenetic mechanisms, specifically DNA methylation, have gained increasing recognition as vital factors in the interplay between risk factors and mental health. This study aimed to explore the effects of current cannabis use and high-potency cannabis on DNA methylation in two independent cohorts of individuals experiencing first-episode psychosis (FEP) compared to control subjects. The combined sample consisted of 682 participants (188 current cannabis users and 494 never users). DNA methylation profiles were generated on blood-derived DNA samples using the Illumina DNA methylation array platform. A meta-analysis across cohorts identified one CpG site (cg11669285) in the CAVIN1 gene that showed differential methylation with current cannabis use, surpassing the array-wide significance threshold, and independent of the tobacco-related epigenetic signature. Furthermore, a CpG site localised in the MCU gene (cg11669285) achieved array-wide significance in an analysis of the effect of high-potency (THC = > 10%) current cannabis use. Pathway and regional analyses identified cannabis-related epigenetic variation proximal to genes linked to immune and mitochondrial function, both of which are known to be influenced by cannabinoids. Interestingly, a model including an interaction term between cannabis use and FEP status identified two sites that were significantly associated with current cannabis use with a nominally significant interaction suggesting that FEP status might moderate how cannabis use affects DNA methylation. Overall, these findings contribute to our understanding of the epigenetic impact of current cannabis use and highlight potential molecular pathways affected by cannabis exposure.
RESUMEN
BACKGROUND: Due to interindividual variation in the cellular composition of the human cortex, it is essential that covariates that capture these differences are included in epigenome-wide association studies using bulk tissue. As experimentally derived cell counts are often unavailable, computational solutions have been adopted to estimate the proportion of different cell types using DNA methylation data. Here, we validate and profile the use of an expanded reference DNA methylation dataset incorporating two neuronal and three glial cell subtypes for quantifying the cellular composition of the human cortex. RESULTS: We tested eight reference panels containing different combinations of neuronal- and glial cell types and characterised their performance in deconvoluting cell proportions from computationally reconstructed or empirically derived human cortex DNA methylation data. Our analyses demonstrate that while these novel brain deconvolution models produce accurate estimates of cellular proportions from profiles generated on postnatal human cortex samples, they are not appropriate for the use in prenatal cortex or cerebellum tissue samples. Applying our models to an extensive collection of empirical datasets, we show that glial cells are twice as abundant as neuronal cells in the human cortex and identify significant associations between increased Alzheimer's disease neuropathology and the proportion of specific cell types including a decrease in NeuNNeg/SOX10Neg nuclei and an increase of NeuNNeg/SOX10Pos nuclei. CONCLUSIONS: Our novel deconvolution models produce accurate estimates for cell proportions in the human cortex. These models are available as a resource to the community enabling the control of cellular heterogeneity in epigenetic studies of brain disorders performed on bulk cortex tissue.
Asunto(s)
Metilación de ADN , Epigénesis Genética , Femenino , Embarazo , Recién Nacido , Humanos , Neuroglía , Corteza Cerebral , Neuronas/metabolismoRESUMEN
Development of the human pancreas requires the precise temporal control of gene expression via epigenetic mechanisms and the binding of key transcription factors. We quantified genome-wide patterns of DNA methylation in human fetal pancreatic samples from donors aged 6 to 21 post-conception weeks. We found dramatic changes in DNA methylation across pancreas development, with > 21% of sites characterized as developmental differentially methylated positions (dDMPs) including many annotated to genes associated with monogenic diabetes. An analysis of DNA methylation in postnatal pancreas tissue showed that the dramatic temporal changes in DNA methylation occurring in the developing pancreas are largely limited to the prenatal period. Significant differences in DNA methylation were observed between males and females at a number of autosomal sites, with a small proportion of sites showing sex-specific DNA methylation trajectories across pancreas development. Pancreas dDMPs were not distributed equally across the genome and were depleted in regulatory domains characterized by open chromatin and the binding of known pancreatic development transcription factors. Finally, we compared our pancreas dDMPs to previous findings from the human brain, identifying evidence for tissue-specific developmental changes in DNA methylation. This study represents the first systematic exploration of DNA methylation patterns during human fetal pancreas development and confirms the prenatal period as a time of major epigenomic plasticity.
Asunto(s)
Metilación de ADN , Páncreas , Humanos , Páncreas/metabolismo , Páncreas/embriología , Femenino , Masculino , Regulación del Desarrollo de la Expresión Génica , Islas de CpG , Epigénesis Genética , Genoma Humano , Feto/metabolismoRESUMEN
Most epigenetic epidemiology to date has utilized microarrays to identify positions in the genome where variation in DNA methylation is associated with environmental exposures or disease. However, these profile less than 3% of DNA methylation sites in the human genome, potentially missing affected loci and preventing the discovery of disrupted biological pathways. Third generation sequencing technologies, including Nanopore sequencing, have the potential to revolutionize the generation of epigenetic data, not only by providing genuine genome-wide coverage but profiling epigenetic modifications direct from native DNA. Here we assess the viability of using Nanopore sequencing for epidemiology by performing a comparison with DNA methylation quantified using the most comprehensive microarray available, the Illumina EPIC array. We implemented a CRISPR-Cas9 targeted sequencing approach in concert with Nanopore sequencing to profile DNA methylation in three genomic regions to attempt to rediscover genomic positions that existing technologies have shown are differentially methylated in tobacco smokers. Using Nanopore sequencing reads, DNA methylation was quantified at 1779 CpGs across three regions, providing a finer resolution of DNA methylation patterns compared to the EPIC array. The correlation of estimated levels of DNA methylation between platforms was high. Furthermore, we identified 12 CpGs where hypomethylation was significantly associated with smoking status, including 10 within the AHRR gene. In summary, Nanopore sequencing is a valid option for identifying genomic loci where large differences in DNAm are associated with a phenotype and has the potential to advance our understanding of the role differential methylation plays in the etiology of complex disease.
Asunto(s)
Metilación de ADN , Secuenciación de Nanoporos , Islas de CpG/genética , Metilación de ADN/genética , Epigénesis Genética/genética , Epigenómica , HumanosRESUMEN
Despite exercise intolerance being predictive of outcomes in pulmonary arterial hypertension (PAH), its underlying cardiac mechanisms are not well described. The aim of the study was to explore the biventricular response to exercise and its associations with cardiorespiratory fitness in children with PAH. Participants underwent incremental cardiopulmonary exercise testing and simultaneous exercise echocardiography on a recumbent cycle ergometer. Linear mixed models were used to assess cardiac function variance and associations between cardiac and metabolic parameters during exercise. Eleven participants were included with a mean age of 13.4 ± 2.9 yr old. Right ventricle (RV) systolic pressure (RVsp) increased from a mean of 59 ± 25 mmHg at rest to 130 ± 40 mmHg at peak exercise (P < 0.001), whereas RV fractional area change (RV-FAC) and RV-free wall longitudinal strain (RVFW-Sl) worsened (35.2 vs. 27%, P = 0.09 and -16.6 vs. -14.6%, P = 0.1, respectively). At low- and moderate-intensity exercise, RVsp was positively associated with stroke volume and O2 pulse (P < 0.1). At high-intensity exercise, RV-FAC, RVFW-Sl, and left ventricular longitudinal strain were positively associated with oxygen uptake and O2 pulse (P < 0.1), whereas stroke volume decreased toward peak (P = 0.04). In children with PAH, the increase of pulmonary pressure alone does not limit peak exercise, but rather the concomitant reduced RV functional reserve, resulting in RV to pulmonary artery (RV-PA) uncoupling, worsening of interventricular interaction and LV dysfunction. A better mechanistic understanding of PAH exercise physiopathology can inform stress testing and cardiac rehabilitation in this population.NEW & NOTEWORTHY In children with pulmonary arterial hypertension, there is a marked increase in pulmonary artery pressure during physical activity, but this is not the underlying mechanism that limits exercise. Instead, right ventricle-to-pulmonary artery uncoupling occurs at the transition from moderate to high-intensity exercise and correlates with lower peak oxygen uptake. This highlights the more complex underlying pathological responses and the need for multiparametric assessment of cardiac function reserve in these patients when feasible.
Asunto(s)
Capacidad Cardiovascular , Prueba de Esfuerzo , Función Ventricular Derecha , Humanos , Masculino , Femenino , Niño , Adolescente , Ejercicio Físico , Tolerancia al Ejercicio , Función Ventricular Izquierda , Hipertensión Arterial Pulmonar/fisiopatología , Hipertensión Pulmonar/fisiopatología , Consumo de Oxígeno , Presión Ventricular , Arteria Pulmonar/fisiopatología , Ecocardiografía , Volumen SistólicoRESUMEN
ABTRACT: Studies conducted in psychotic disorders have shown that DNA-methylation (DNAm) is sensitive to the impact of Childhood Adversity (CA). However, whether it mediates the association between CA and psychosis is yet to be explored. Epigenome wide association studies (EWAS) using the Illumina Infinium-Methylation EPIC array in peripheral blood tissue from 366 First-episode of psychosis and 517 healthy controls was performed. Adversity scores were created for abuse, neglect and composite adversity with the Childhood Trauma Questionnaire (CTQ). Regressions examining (I) CTQ scores with psychosis; (II) with DNAm EWAS level and (III) between DNAm and caseness, adjusted for a variety of confounders were conducted. Divide-Aggregate Composite-null Test for the composite null-hypothesis of no mediation effect was conducted. Enrichment analyses were conducted with missMethyl package and the KEGG database. Our results show that CA was associated with psychosis (Composite: OR = 1.68; p = <0.001; abuse: OR = 2.16; p < 0.001; neglect: OR = 2.27; p = <0.001). None of the CpG sites significantly mediated the adversity-psychosis association after Bonferroni correction (p < 8.1 × 10-8). However, 28, 34 and 29 differentially methylated probes associated with 21, 27, 20 genes passed a less stringent discovery threshold (p < 5 × 10-5) for composite, abuse and neglect respectively, with a lack of overlap between abuse and neglect. These included genes previously associated to psychosis in EWAS studies, such as PANK1, SPEG TBKBP1, TSNARE1 or H2R. Downstream gene ontology analyses did not reveal any biological pathways that survived false discovery rate correction. Although at a non-significant level, DNAm changes in genes previously associated with schizophrenia in EWAS studies may mediate the CA-psychosis association. These results and associated involved processes such as mitochondrial or histaminergic disfunction, immunity or neural signalling requires replication in well powered samples. The lack of overlap between mediating genes associated with abuse and neglect suggests differential biological trajectories linking CA subtypes and psychosis.
Asunto(s)
Experiencias Adversas de la Infancia , Pruebas Psicológicas , Trastornos Psicóticos , Autoinforme , Humanos , Niño , Metilación de ADN/genética , Epigenoma , Trastornos Psicóticos/genéticaRESUMEN
Most epigenome-wide association studies (EWAS) quantify DNA methylation (DNAm) in peripheral tissues such as whole blood to identify positions in the genome where variation is statistically associated with a trait or exposure. As whole blood comprises a mix of cell types, it is unclear whether trait-associated DNAm variation is specific to an individual cellular population. We collected three peripheral tissues (whole blood, buccal epithelial and nasal epithelial cells) from thirty individuals. Whole blood samples were subsequently processed using fluorescence-activated cell sorting (FACS) to purify five constituent cell-types (monocytes, granulocytes, CD4+ T cells, CD8+ T cells, and B cells). DNAm was profiled in all eight sample-types from each individual using the Illumina EPIC array. We identified significant differences in both the level and variability of DNAm between different sample types, and DNAm data-derived estimates of age and smoking were found to differ dramatically across sample types from the same individual. We found that for the majority of loci variation in DNAm in individual blood cell types was only weakly predictive of variance in DNAm measured in whole blood, although the proportion of variance explained was greater than that explained by either buccal or nasal epithelial samples. Covariation across sample types was much higher for DNAm sites influenced by genetic factors. Overall, we observe that DNAm variation in whole blood is additively influenced by a combination of the major blood cell types. For a subset of sites, however, variable DNAm detected in whole blood can be attributed to variation in a single blood cell type providing potential mechanistic insight about EWAS findings. Our results suggest that associations between whole blood DNAm and traits or exposures reflect differences in multiple cell types and our data will facilitate the interpretation of findings in epigenetic epidemiology.
Asunto(s)
Metilación de ADN , Epigénesis Genética , Epigenómica , Epidemiología Molecular , Células Sanguíneas , Epigenómica/métodos , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Estudio de Asociación del Genoma Completo , Humanos , Especificidad de Órganos/genética , TranscriptomaRESUMEN
INTRODUCTION: The established link between DNA methylation and pathophysiology of dementia, along with its potential role as a molecular mediator of lifestyle and environmental influences, positions blood-derived DNA methylation as a promising tool for early dementia risk detection. METHODS: In conjunction with an extensive array of machine learning techniques, we employed whole blood genome-wide DNA methylation data as a surrogate for 14 modifiable and non-modifiable factors in the assessment of dementia risk in independent dementia cohorts. RESULTS: We established a multivariate methylation risk score (MMRS) for identifying mild cognitive impairment cross-sectionally, independent of age and sex (P = 2.0 × 10-3). This score significantly predicted the prospective development of cognitive impairments in independent studies of Alzheimer's disease (hazard ratio for Rey's Auditory Verbal Learning Test (RAVLT)-Learning = 2.47) and Parkinson's disease (hazard ratio for MCI/dementia = 2.59). DISCUSSION: Our work shows the potential of employing blood-derived DNA methylation data in the assessment of dementia risk. HIGHLIGHTS: We used whole blood DNA methylation as a surrogate for 14 dementia risk factors. Created a multivariate methylation risk score for predicting cognitive impairment. Emphasized the role of machine learning and omics data in predicting dementia. The score predicts cognitive impairment development at the population level.
Asunto(s)
Disfunción Cognitiva , Metilación de ADN , Demencia , Humanos , Metilación de ADN/genética , Disfunción Cognitiva/genética , Disfunción Cognitiva/sangre , Disfunción Cognitiva/diagnóstico , Masculino , Femenino , Anciano , Demencia/genética , Demencia/sangre , Demencia/diagnóstico , Factores de Riesgo , Aprendizaje Automático , Estudios Transversales , Enfermedad de Alzheimer/genética , Enfermedad de Alzheimer/sangre , Enfermedad de Alzheimer/diagnóstico , Estudios Prospectivos , Medición de Riesgo , Anciano de 80 o más AñosRESUMEN
BACKGROUND: The field of epigenomics holds great promise in understanding and treating disease with advances in machine learning (ML) and artificial intelligence being vitally important in this pursuit. Increasingly, research now utilises DNA methylation measures at cytosine-guanine dinucleotides (CpG) to detect disease and estimate biological traits such as aging. Given the challenge of high dimensionality of DNA methylation data, feature-selection techniques are commonly employed to reduce dimensionality and identify the most important subset of features. In this study, our aim was to test and compare a range of feature-selection methods and ML algorithms in the development of a novel DNA methylation-based telomere length (TL) estimator. We utilised both nested cross-validation and two independent test sets for the comparisons. RESULTS: We found that principal component analysis in advance of elastic net regression led to the overall best performing estimator when evaluated using a nested cross-validation analysis and two independent test cohorts. This approach achieved a correlation between estimated and actual TL of 0.295 (83.4% CI [0.201, 0.384]) on the EXTEND test data set. Contrastingly, the baseline model of elastic net regression with no prior feature reduction stage performed less well in general-suggesting a prior feature-selection stage may have important utility. A previously developed TL estimator, DNAmTL, achieved a correlation of 0.216 (83.4% CI [0.118, 0.310]) on the EXTEND data. Additionally, we observed that different DNA methylation-based TL estimators, which have few common CpGs, are associated with many of the same biological entities. CONCLUSIONS: The variance in performance across tested approaches shows that estimators are sensitive to data set heterogeneity and the development of an optimal DNA methylation-based estimator should benefit from the robust methodological approach used in this study. Moreover, our methodology which utilises a range of feature-selection approaches and ML algorithms could be applied to other biological markers and disease phenotypes, to examine their relationship with DNA methylation and predictive value.
Asunto(s)
Metilación de ADN , Epigenómica , Homeostasis del Telómero , Algoritmos , Epigenómica/métodos , Análisis de Regresión , Aprendizaje Automático , HumanosRESUMEN
DNA methylation profiles of aggressive behavior may capture lifetime cumulative effects of genetic, stochastic, and environmental influences associated with aggression. Here, we report the first large meta-analysis of epigenome-wide association studies (EWAS) of aggressive behavior (N = 15,324 participants). In peripheral blood samples of 14,434 participants from 18 cohorts with mean ages ranging from 7 to 68 years, 13 methylation sites were significantly associated with aggression (alpha = 1.2 × 10-7; Bonferroni correction). In cord blood samples of 2425 children from five cohorts with aggression assessed at mean ages ranging from 4 to 7 years, 83% of these sites showed the same direction of association with childhood aggression (r = 0.74, p = 0.006) but no epigenome-wide significant sites were found. Top-sites (48 at a false discovery rate of 5% in the peripheral blood meta-analysis or in a combined meta-analysis of peripheral blood and cord blood) have been associated with chemical exposures, smoking, cognition, metabolic traits, and genetic variation (mQTLs). Three genes whose expression levels were associated with top-sites were previously linked to schizophrenia and general risk tolerance. At six CpGs, DNA methylation variation in blood mirrors variation in the brain. On average 44% (range = 3-82%) of the aggression-methylation association was explained by current and former smoking and BMI. These findings point at loci that are sensitive to chemical exposures with potential implications for neuronal functions. We hope these results to be a starting point for studies leading to applications as peripheral biomarkers and to reveal causal relationships with aggression and related traits.
Asunto(s)
Metilación de ADN , Epigenoma , Adolescente , Adulto , Anciano , Agresión , Niño , Preescolar , Islas de CpG/genética , Metilación de ADN/genética , Epigénesis Genética/genética , Estudio de Asociación del Genoma Completo , Humanos , Longevidad , Persona de Mediana Edad , Adulto JovenRESUMEN
Genome-wide association studies (GWAS) have identified multiple genomic regions associated with schizophrenia, although many variants reside in noncoding regions characterized by high linkage disequilibrium (LD) making the elucidation of molecular mechanisms challenging. A genomic region on chromosome 10q24 has been consistently associated with schizophrenia with risk attributed to the AS3MT gene. Although AS3MT is hypothesized to play a role in neuronal development and differentiation, work to fully understand the function of this gene has been limited. In this study we explored the function of AS3MT using a neuronal cell line (SH-SY5Y). We confirm previous findings of isoform specific expression of AS3MT during SH-SY5Y differentiation toward neuronal fates. Using CRISPR-Cas9 gene editing we generated AS3MT knockout SH-SY5Y cell lines and used RNA-seq to identify significant changes in gene expression in pathways associated with neuronal development, inflammation, extracellular matrix formation, and RNA processing, including dysregulation of other genes strongly implicated in schizophrenia. We did not observe any morphological changes in cell size and neurite length following neuronal differentiation and MAP2 immunocytochemistry. These results provide novel insights into the potential role of AS3MT in brain development and identify pathways through which genetic variation in this region may confer risk for schizophrenia.
Asunto(s)
Neuroblastoma , Esquizofrenia , Estudio de Asociación del Genoma Completo , Humanos , Desequilibrio de Ligamiento/genética , Metiltransferasas/genética , Neurogénesis/genética , Esquizofrenia/genéticaRESUMEN
BACKGROUND: The combination of sodium bisulfite treatment with highly-parallel sequencing is a common method for quantifying DNA methylation across the genome. The power to detect between-group differences in DNA methylation using bisulfite-sequencing approaches is influenced by both experimental (e.g. read depth, missing data and sample size) and biological (e.g. mean level of DNA methylation and difference between groups) parameters. There is, however, no consensus about the optimal thresholds for filtering bisulfite sequencing data with implications for the reproducibility of findings in epigenetic epidemiology. RESULTS: We used a large reduced representation bisulfite sequencing (RRBS) dataset to assess the distribution of read depth across DNA methylation sites and the extent of missing data. To investigate how various study variables influence power to identify DNA methylation differences between groups, we developed a framework for simulating bisulfite sequencing data. As expected, sequencing read depth, group size, and the magnitude of DNA methylation difference between groups all impacted upon statistical power. The influence on power was not dependent on one specific parameter, but reflected the combination of study-specific variables. As a resource to the community, we have developed a tool, POWEREDBiSeq, which utilizes our simulation framework to predict study-specific power for the identification of DNAm differences between groups, taking into account user-defined read depth filtering parameters and the minimum sample size per group. CONCLUSIONS: Our data-driven approach highlights the importance of filtering bisulfite-sequencing data by minimum read depth and illustrates how the choice of threshold is influenced by the specific study design and the expected differences between groups being compared. The POWEREDBiSeq tool, which can be applied to different types of bisulfite sequencing data (e.g. RRBS, whole genome bisulfite sequencing (WGBS), targeted bisulfite sequencing and amplicon-based bisulfite sequencing), can help users identify the level of data filtering needed to optimize power and aims to improve the reproducibility of bisulfite sequencing studies.
Asunto(s)
Metilación de ADN , Sulfitos , Epigenómica , Secuenciación de Nucleótidos de Alto Rendimiento , Reproducibilidad de los Resultados , Análisis de Secuencia de ADNRESUMEN
BACKGROUND: Sex is an important covariate of epigenome-wide association studies due to its strong influence on DNA methylation patterns across numerous genomic positions. Nevertheless, many samples on the Gene Expression Omnibus (GEO) frequently lack a sex annotation or are incorrectly labelled. Considering the influence that sex imposes on DNA methylation patterns, it is necessary to ensure that methods for filtering poor samples and checking of sex assignment are accurate and widely applicable. RESULTS: Here we presented a novel method to predict sex using only DNA methylation beta values, which can be readily applied to almost all DNA methylation datasets of different formats (raw IDATs or text files with only signal intensities) uploaded to GEO. We identified 4345 significantly (p<0.01) sex-associated CpG sites present on both 450K and EPIC arrays, and constructed a sex classifier based on the two first principal components of the DNA methylation data of sex-associated probes mapped on sex chromosomes. The proposed method is constructed using whole blood samples and exhibits good performance across a wide range of tissues. We further demonstrated that our method can be used to identify samples with sex chromosome aneuploidy, this function is validated by five Turner syndrome cases and one Klinefelter syndrome case. CONCLUSIONS: This proposed sex classifier not only can be used for sex predictions but also applied to identify samples with sex chromosome aneuploidy, and it is freely and easily accessible by calling the 'estimateSex' function from the newest wateRmelon Bioconductor package ( https://github.com/schalkwyk/wateRmelon ).
Asunto(s)
Metilación de ADN , Genómica , Aneuploidia , Islas de CpG , Humanos , Cromosomas Sexuales/genéticaRESUMEN
Autism spectrum disorder (ASD) encompasses a collection of complex neuropsychiatric disorders characterized by deficits in social functioning, communication and repetitive behaviour. Building on recent studies supporting a role for developmentally moderated regulatory genomic variation in the molecular aetiology of ASD, we quantified genome-wide patterns of DNA methylation in 223 post-mortem tissues samples isolated from three brain regions [prefrontal cortex, temporal cortex and cerebellum (CB)] dissected from 43 ASD patients and 38 non-psychiatric control donors. We identified widespread differences in DNA methylation associated with idiopathic ASD (iASD), with consistent signals in both cortical regions that were distinct to those observed in the CB. Individuals carrying a duplication on chromosome 15q (dup15q), representing a genetically defined subtype of ASD, were characterized by striking differences in DNA methylationacross a discrete domain spanning an imprinted gene cluster within the duplicated region. In addition to the dramatic cis-effects on DNA methylation observed in dup15q carriers, we identified convergent methylomic signatures associated with both iASD and dup15q, reflecting the findings from previous studies of gene expression and H3K27ac. Cortical co-methylation network analysis identified a number of co-methylated modules significantly associated with ASD that are enriched for genomic regions annotated to genes involved in the immune system, synaptic signalling and neuronal regulation. Our study represents the first systematic analysis of DNA methylation associated with ASD across multiple brain regions, providing novel evidence for convergent molecular signatures associated with both idiopathic and syndromic autism.
Asunto(s)
Trastorno Autístico/genética , Cerebelo/metabolismo , Metilación de ADN , Corteza Prefrontal/metabolismo , Lóbulo Temporal/metabolismo , Trastorno Autístico/metabolismo , Estudios de Casos y Controles , Cerebelo/química , Epigenoma , Femenino , Ontología de Genes , Redes Reguladoras de Genes , Genoma Humano , Humanos , Sistema Inmunológico/metabolismo , Masculino , Vías Nerviosas/fisiología , Corteza Prefrontal/química , Transmisión Sináptica/genética , Transmisión Sináptica/fisiología , Lóbulo Temporal/químicaRESUMEN
Characterizing the complex relationship between genetic, epigenetic, and transcriptomic variation has the potential to increase understanding about the mechanisms underpinning health and disease phenotypes. We undertook a comprehensive analysis of common genetic variation on DNA methylation (DNAm) by using the Illumina EPIC array to profile samples from the UK Household Longitudinal study. We identified 12,689,548 significant DNA methylation quantitative trait loci (mQTL) associations (p < 6.52 × 10-14) occurring between 2,907,234 genetic variants and 93,268 DNAm sites, including a large number not identified by previous DNAm-profiling methods. We demonstrate the utility of these data for interpreting the functional consequences of common genetic variation associated with > 60 human traits by using summary-data-based Mendelian randomization (SMR) to identify 1,662 pleiotropic associations between 36 complex traits and 1,246 DNAm sites. We also use SMR to characterize the relationship between DNAm and gene expression and thereby identify 6,798 pleiotropic associations between 5,420 DNAm sites and the transcription of 1,702 genes. Our mQTL database and SMR results are available via a searchable online database as a resource to the research community.
Asunto(s)
Metilación de ADN/genética , ADN/genética , Epigénesis Genética/genética , Expresión Génica/genética , Variación Genética/genética , Sitios de Carácter Cuantitativo/genética , Transcriptoma/genética , Estudio de Asociación del Genoma Completo/métodos , Humanos , Estudios Longitudinales , Fenotipo , Carácter Cuantitativo Heredable , Transcripción Genética/genéticaRESUMEN
Human DNA methylation data have been used to develop biomarkers of ageing, referred to as 'epigenetic clocks', which have been widely used to identify differences between chronological age and biological age in health and disease including neurodegeneration, dementia and other brain phenotypes. Existing DNA methylation clocks have been shown to be highly accurate in blood but are less precise when used in older samples or in tissue types not included in training the model, including brain. We aimed to develop a novel epigenetic clock that performs optimally in human cortex tissue and has the potential to identify phenotypes associated with biological ageing in the brain. We generated an extensive dataset of human cortex DNA methylation data spanning the life course (n = 1397, ages = 1 to 108 years). This dataset was split into 'training' and 'testing' samples (training: n = 1047; testing: n = 350). DNA methylation age estimators were derived using a transformed version of chronological age on DNA methylation at specific sites using elastic net regression, a supervised machine learning method. The cortical clock was subsequently validated in a novel independent human cortex dataset (n = 1221, ages = 41 to 104 years) and tested for specificity in a large whole blood dataset (n = 1175, ages = 28 to 98 years). We identified a set of 347 DNA methylation sites that, in combination, optimally predict age in the human cortex. The sum of DNA methylation levels at these sites weighted by their regression coefficients provide the cortical DNA methylation clock age estimate. The novel clock dramatically outperformed previously reported clocks in additional cortical datasets. Our findings suggest that previous associations between predicted DNA methylation age and neurodegenerative phenotypes might represent false positives resulting from clocks not robustly calibrated to the tissue being tested and for phenotypes that become manifest in older ages. The age distribution and tissue type of samples included in training datasets need to be considered when building and applying epigenetic clock algorithms to human epidemiological or disease cohorts.
Asunto(s)
Envejecimiento/genética , Relojes Biológicos/fisiología , Corteza Cerebral/crecimiento & desarrollo , Epigénesis Genética/fisiología , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Algoritmos , Recuento de Células , Corteza Cerebral/citología , Niño , Preescolar , ADN/genética , Metilación de ADN , Bases de Datos Factuales , Femenino , Humanos , Lactante , Aprendizaje Automático , Masculino , Persona de Mediana Edad , Neuronas/fisiología , Fenotipo , Reproducibilidad de los Resultados , Caracteres Sexuales , Adulto JovenRESUMEN
Variation in DNA methylation is being increasingly associated with health and disease outcomes. Although DNA methylation is hypothesized to be a mechanism by which both genetic and non-genetic factors can influence the regulation of gene expression, little is known about the extent to which DNA methylation at specific sites is influenced by heritable as well as environmental factors. We quantified DNA methylation in whole blood at age 18 in a birth cohort of 1,464 individuals comprising 426 monozygotic (MZ) and 306 same-sex dizygotic (DZ) twin pairs. Site-specific levels of DNA methylation were more strongly correlated across the genome between MZ than DZ twins. Structural equation models revealed that although the average contribution of additive genetic influences on DNA methylation across the genome was relatively low, it was notably elevated at the highly variable sites characterized by intermediate levels of DNAm that are most relevant for epigenetic epidemiology. Sites at which variable DNA methylation was most influenced by genetic factors were significantly enriched for DNA methylation quantitative trait loci (mQTL) effects, and overlapped with sites where inter-individual variation correlates across tissues. Finally, we show that DNA methylation at sites robustly associated with environmental exposures such as tobacco smoking and obesity is also influenced by additive genetic effects, highlighting the need to control for genetic background in analyses of exposure-associated DNA methylation differences. Estimates of the contribution of genetic and environmental influences to DNA methylation at all sites profiled in this study are available as a resource for the research community (http://www.epigenomicslab.com/online-data-resources).
Asunto(s)
Metilación de ADN , Interacción Gen-Ambiente , Gemelos Dicigóticos/genética , Gemelos Monocigóticos/genética , Adolescente , Índice de Masa Corporal , Niño , Preescolar , Estudios de Cohortes , Epigenómica , Femenino , Estudios de Asociación Genética , Genoma Humano , Humanos , Modelos Lineales , Estudios Longitudinales , Masculino , Sitios de Carácter Cuantitativo , Fumar Tabaco/efectos adversosRESUMEN
Common genetic variation appears to largely influence risk for neuropsychiatric disorders through effects on gene regulation. It is therefore possible to shed light on the biology of these conditions by testing for enrichment of associated genetic variation within regulatory genomic regions operating in specific tissues or cell types. Here, we have used the assay for transposase-accessible chromatin with high-throughput sequencing (ATAC-Seq) to map open chromatin (an index of active regulatory genomic regions) in bulk tissue, NeuN+ and NeuN- nuclei from the prenatal human frontal cortex, and tested enrichment of single-nucleotide polymorphism (SNP) heritability for five neuropsychiatric disorders (autism spectrum disorder, attention deficit hyperactivity disorder [ADHD], bipolar disorder, major depressive disorder, and schizophrenia) within these regions. We observed significant enrichment of SNP heritability for ADHD, major depressive disorder, and schizophrenia within open chromatin regions (OCRs) mapped in bulk fetal frontal cortex, and for all five tested neuropsychiatric conditions when we restricted these sites to those overlapping histone modifications indicative of enhancers (H3K4me1) or promoters (H3K4me3) in fetal brain. SNP heritability for neuropsychiatric disorders was significantly enriched in OCRs identified in fetal frontal cortex NeuN- as well as NeuN+ nuclei overlapping fetal brain H3K4me1 or H3K4me3 sites. We additionally demonstrate the utility of our mapped OCRs for prioritizing potentially functional SNPs at genome-wide significant risk loci for neuropsychiatric disorders. Our data provide evidence for an early neurodevelopmental component to a range of neuropsychiatric conditions and highlight an important role for regulatory genomic regions active within both NeuN+ and NeuN- cells of the prenatal brain.
Asunto(s)
Trastorno del Espectro Autista , Trastorno Bipolar , Trastorno Depresivo Mayor , Trastorno Bipolar/genética , Femenino , Lóbulo Frontal , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Polimorfismo de Nucleótido Simple , EmbarazoRESUMEN
Depression is a common and disabling disorder, representing a major social and economic health issue. Moreover, depression is associated with the progression of diseases with an inflammatory etiology including many inflammatory-related disorders. At the molecular level, the mechanisms by which depression might promote the onset of these diseases and associated immune-dysfunction are not well understood. In this study we assessed genome-wide patterns of DNA methylation in whole blood-derived DNA obtained from individuals with a self-reported history of depression (n = 100) and individuals without a history of depression (n = 100) using the Illumina 450K microarray. Our analysis identified six significant (Sidák corrected P < 0.05) depression-associated differentially methylated regions (DMRs); the top-ranked DMR was located in exon 1 of the LTB4R2 gene (Sidák corrected P = 1.27 × 10-14). Polygenic risk scores (PRS) for depression were generated and known biological markers of inflammation, telomere length (TL) and IL-6, were measured in DNA and serum samples, respectively. Next, we employed a systems-level approach to identify networks of co-methylated loci associated with a history of depression, in addition to depression PRS, TL and IL-6 levels. Our analysis identified one depression-associated co-methylation module (P = 0.04). Interestingly, the depression-associated module was highly enriched for pathways related to immune function and was also associated with TL and IL-6 cytokine levels. In summary, our genome-wide DNA methylation analysis of individuals with and without a self-reported history of depression identified several candidate DMRs of potential relevance to the pathogenesis of depression and its associated immune-dysfunction phenotype.
Asunto(s)
Metilación de ADN/genética , Depresión/genética , Estudio de Asociación del Genoma Completo , Receptores de Leucotrieno B4/genética , Adulto , Anciano , Biomarcadores/sangre , Índice de Masa Corporal , Islas de CpG/genética , Depresión/sangre , Depresión/patología , Epigénesis Genética , Femenino , Predisposición Genética a la Enfermedad , Genoma Humano/genética , Humanos , Inflamación/sangre , Inflamación/genética , Inflamación/patología , Interleucina-6/sangre , Interleucina-6/genética , Masculino , Persona de Mediana Edad , Receptores de Leucotrieno B4/sangre , Homeostasis del Telómero/genéticaRESUMEN
Most genetic variants identified in genome-wide association studies (GWASs) of complex traits are thought to act by affecting gene regulation rather than directly altering the protein product. As a consequence, the actual genes involved in disease are not necessarily the most proximal to the associated variants. By integrating data from GWAS analyses with those from genetic studies of regulatory variation, it is possible to identify variants pleiotropically associated with both a complex trait and measures of gene regulation. In this study, we used summary-data-based Mendelian randomization (SMR), a method developed to identify variants pleiotropically associated with both complex traits and gene expression, to identify variants associated with complex traits and DNA methylation. We used large DNA methylation quantitative trait locus (mQTL) datasets generated from two different tissues (blood and fetal brain) to prioritize genes for >40 complex traits with robust GWAS data and found considerable overlap with the results of SMR analyses performed with expression QTL (eQTL) data. We identified multiple examples of variable DNA methylation associated with GWAS variants for a range of complex traits, demonstrating the utility of this approach for refining genetic association signals.