RESUMO
BACKGROUND: Assisted reproductive technologies (ART) may perturb DNA methylation (DNAm) in early embryonic development. Although a handful of epigenome-wide association studies of ART have been published, none have investigated CpGs on the X chromosome. To bridge this knowledge gap, we leveraged one of the largest collections of mother-father-newborn trios of ART and non-ART (natural) conceptions to date to investigate sex-specific DNAm differences on the X chromosome. The discovery cohort consisted of 982 ART and 963 non-ART trios from the Norwegian Mother, Father, and Child Cohort Study (MoBa). To verify our results from the MoBa cohort, we used an external cohort of 149 ART and 58 non-ART neonates from the Australian 'Clinical review of the Health of adults conceived following Assisted Reproductive Technologies' (CHART) study. The Illumina EPIC array was used to measure DNAm in both datasets. In the MoBa cohort, we performed a set of X-chromosome-wide association studies ('XWASs' hereafter) to search for sex-specific DNAm differences between ART and non-ART newborns. We tested several models to investigate the influence of various confounders, including parental DNAm. We also searched for differentially methylated regions (DMRs) and regions of co-methylation flanking the most significant CpGs. Additionally, we ran an analogous model to our main model on the external CHART dataset. RESULTS: In the MoBa cohort, we found more differentially methylated CpGs and DMRs in girls than boys. Most of the associations persisted after controlling for parental DNAm and other confounders. Many of the significant CpGs and DMRs were in gene-promoter regions, and several of the genes linked to these CpGs are expressed in tissues relevant for both ART and sex (testis, placenta, and fallopian tube). We found no support for parental DNAm-dependent features as an explanation for the observed associations in the newborns. The most significant CpG in the boys-only analysis was in UBE2DNL, which is expressed in testes but with unknown function. The most significant CpGs in the girls-only analysis were in EIF2S3 and AMOT. These three loci also displayed differential DNAm in the CHART cohort. CONCLUSIONS: Genes that co-localized with the significant CpGs and DMRs associated with ART are implicated in several key biological processes (e.g., neurodevelopment) and disorders (e.g., intellectual disability and autism). These connections are particularly compelling in light of previous findings indicating that neurodevelopmental outcomes differ in ART-conceived children compared to those naturally conceived.
Assuntos
Metilação de DNA , Epigênese Genética , Masculino , Gravidez , Adulto , Criança , Feminino , Humanos , Recém-Nascido , Metilação de DNA/genética , Estudos de Coortes , Estudo de Associação Genômica Ampla , AustráliaRESUMO
BACKGROUND: DNA methylation (DNAm) profiling has emerged as a powerful tool for characterizing the placental methylome. However, previous studies have focused primarily on whole placental tissue, which is a mixture of epigenetically distinct cell populations. Here, we present the first methylome-wide analysis of first trimester (n = 9) and term (n = 19) human placental samples of four cell populations: trophoblasts, Hofbauer cells, endothelial cells, and stromal cells, using the Illumina EPIC methylation array, which quantifies DNAm at > 850,000 CpGs. RESULTS: The most distinct DNAm profiles were those of placental trophoblasts, which are central to many pregnancy-essential functions, and Hofbauer cells, which are a rare fetal-derived macrophage population. Cell-specific DNAm occurs at functionally-relevant genes, including genes associated with placental development and preeclampsia. Known placental-specific methylation marks, such as those associated with genomic imprinting, repetitive element hypomethylation, and placental partially methylated domains, were found to be more pronounced in trophoblasts and often absent in Hofbauer cells. Lastly, we characterize the cell composition and cell-specific DNAm dynamics across gestation. CONCLUSIONS: Our results provide a comprehensive analysis of DNAm in human placental cell types from first trimester and term pregnancies. This data will serve as a useful DNAm reference for future placental studies, and we provide access to this data via download from GEO (GSE159526), through interactive exploration from the web browser ( https://robinsonlab.shinyapps.io/Placental_Methylome_Browser/ ), and through the R package planet, which allows estimation of cell composition directly from placental DNAm data.
Assuntos
Epigênese Genética , Epigenoma , Metilação de DNA , Células Endoteliais , Feminino , Humanos , Placenta/metabolismo , GravidezRESUMO
DnA methylation has recently been accepted as the most reliable and effective method of diagnosing central nervous system (CNS) tumors. Healthy organs and tumors of different localizations have their own unique methylation structure. Determination of total tumor DNA methylome is the detection of all methylated nucleotides in a tumor. The "gold standard" for analyzing the methylation state of individual cytosines is bisulfite conversion, in which unmethylated cytosines are converted to uracils and read as thymines, while methylated cytosines are protected from conversion.
Assuntos
Neoplasias Encefálicas , Metilação de DNA , Neoplasias Encefálicas/genética , DNA/metabolismo , HumanosRESUMO
BACKGROUND: Systematic technical effects-also called batch effects-are a considerable challenge when analyzing DNA methylation (DNAm) microarray data, because they can lead to false results when confounded with the variable of interest. Methods to correct these batch effects are error-prone, as previous findings have shown. RESULTS: Here, we demonstrate how using the R function ComBat to correct simulated Infinium HumanMethylation450 BeadChip (450 K) and Infinium MethylationEPIC BeadChip Kit (EPIC) DNAm data can lead to a large number of false positive results under certain conditions. We further provide a detailed assessment of the consequences for the highly relevant problem of p-value inflation with subsequent false positive findings after application of the frequently used ComBat method. Using ComBat to correct for batch effects in randomly generated samples produced alarming numbers of false discovery rate (FDR) and Bonferroni-corrected (BF) false positive results in unbalanced as well as in balanced sample distributions in terms of the relation between the outcome of interest variable and the technical position of the sample during the probe measurement. Both sample size and number of batch factors (e.g. number of chips) were systematically simulated to assess the probability of false positive findings. The effect of sample size was simulated using n = 48 up to n = 768 randomly generated samples. Increasing the number of corrected factors led to an exponential increase in the number of false positive signals. Increasing the number of samples reduced, but did not completely prevent, this effect. CONCLUSIONS: Using the approach described, we demonstrate, that using ComBat for batch correction in DNAm data can lead to false positive results under certain conditions and sample distributions. Our results are thus contrary to previous publications, considering a balanced sample distribution as unproblematic when using ComBat. We do not claim completeness in terms of reporting all technical conditions and possible solutions of the occurring problems as we approach the problem from a clinician's perspective and not from that of a computer scientist. With our approach of simulating data, we provide readers with a simple method to assess the probability of false positive findings in DNAm microarray data analysis pipelines.
Assuntos
Metilação de DNA , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Ilhas de CpG , Reações Falso-Positivas , Humanos , Dispositivos Lab-On-A-Chip , Probabilidade , Tamanho da AmostraRESUMO
The aim of this study was to identify placental DNA methylation (DNAm) variations associated with adiposity at 3 years of age. We quantified placental DNAm using the Infinium MethylationEPIC BeadChips. We assessed associations between DNAm at single-CpGs and skinfold thickness using robust linear regression models adjusted for gestational age, child's sex, age at follow-up and cellular heterogeneity. We sought replication of DNAm association with child adiposity in an independent cohort. We quantified placental mRNA levels for annotated gene using qRT-PCR and tested for correlation with DNAm. Lower DNAm at cg22593959 and cg22436429 was associated with higher adiposity (ß = -1.18, q = 0.002 and ß = -0.82, q = 0.04). The cg22593959 is located in an intergenic region (chr7q31.3), whereas cg22436429 is within the TFAP2E gene (1p34.3). DNAm at cg22593959 and cg22436429 was correlated with mRNA levels at FAM3C (rs = -0.279, p = 0.005) and TFAP2E (rs = 0.216, p = 0.03). In an independent cohort, the association between placental DNAm at cg22593959 and childhood adiposity was of similar strength and direction (ß = -3.8 ± 4.1, p = 0.36), yet non-significant. Four genomic regions were also associated with skinfold thickness within FMN1, MAGI2, SKAP2 and BMPR1B genes. We identified placental epigenetic variations associated with adiposity at 3 years of age suggesting that childhood fat accretion patterns might be established during fetal life.
Assuntos
Adiposidade/genética , Epigenoma/genética , Predisposição Genética para Doença , Obesidade Infantil/genética , Adulto , Pré-Escolar , Metilação de DNA/genética , Feminino , Estudo de Associação Genômica Ampla , Humanos , Masculino , Obesidade Infantil/patologia , Placenta/metabolismo , Placenta/patologia , Gravidez , Dobras CutâneasRESUMO
BACKGROUND: There has been a steady increase in the number of studies aiming to identify DNA methylation differences associated with complex phenotypes. Many of the challenges of epigenetic epidemiology regarding study design and interpretation have been discussed in detail, however there are analytical concerns that are outstanding and require further exploration. In this study we seek to address three analytical issues. First, we quantify the multiple testing burden and propose a standard statistical significance threshold for identifying DNA methylation sites that are associated with an outcome. Second, we establish whether linear regression, the chosen statistical tool for the majority of studies, is appropriate and whether it is biased by the underlying distribution of DNA methylation data. Finally, we assess the sample size required for adequately powered DNA methylation association studies. RESULTS: We quantified DNA methylation in the Understanding Society cohort (n = 1175), a large population based study, using the Illumina EPIC array to assess the statistical properties of DNA methylation association analyses. By simulating null DNA methylation studies, we generated the distribution of p-values expected by chance and calculated the 5% family-wise error for EPIC array studies to be 9 × 10- 8. Next, we tested whether the assumptions of linear regression are violated by DNA methylation data and found that the majority of sites do not satisfy the assumption of normal residuals. Nevertheless, we found no evidence that this bias influences analyses by increasing the likelihood of affected sites to be false positives. Finally, we performed power calculations for EPIC based DNA methylation studies, demonstrating that existing studies with data on ~ 1000 samples are adequately powered to detect small differences at the majority of sites. CONCLUSION: We propose that a significance threshold of P < 9 × 10- 8 adequately controls the false positive rate for EPIC array DNA methylation studies. Moreover, our results indicate that linear regression is a valid statistical methodology for DNA methylation studies, despite the fact that the data do not always satisfy the assumptions of this test. These findings have implications for epidemiological-based studies of DNA methylation and provide a framework for the interpretation of findings from current and future studies.
Assuntos
Metilação de DNA , Epigenômica/métodos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Ilhas de CpG , Epigênese Genética , Estudo de Associação Genômica Ampla , Humanos , Modelos LinearesRESUMO
Recently, we described a machine learning approach for classification of central nervous system tumors based on the analysis of genome-wide DNA methylation patterns [6]. Here, we report on DNA methylation-based central nervous system (CNS) tumor diagnostics conducted in our institution between the years 2015 and 2018. In this period, more than 1000 tumors from the neurosurgical departments in Heidelberg and Mannheim and more than 1000 tumors referred from external institutions were subjected to DNA methylation analysis for diagnostic purposes. We describe our current approach to the integrated diagnosis of CNS tumors with a focus on constellations with conflicts between morphological and molecular genetic findings. We further describe the benefit of integrating DNA copy-number alterations into diagnostic considerations and provide a catalog of copy-number changes for individual DNA methylation classes. We also point to several pitfalls accompanying the diagnostic implementation of DNA methylation profiling and give practical suggestions for recurring diagnostic scenarios.
Assuntos
Neoplasias do Sistema Nervoso Central/classificação , Neoplasias do Sistema Nervoso Central/genética , Variações do Número de Cópias de DNA/genética , Metilação de DNA/genética , Proteínas de Neoplasias/genética , Metilases de Modificação do DNA/genética , Enzimas Reparadoras do DNA/genética , Feminino , Humanos , Isocitrato Desidrogenase/genética , Masculino , Receptores Proteína Tirosina Quinases/genética , Estudos Retrospectivos , Proteínas Supressoras de Tumor/genéticaRESUMO
Patients with DICER1 predisposition syndrome have an increased risk to develop pleuropulmonary blastoma, cystic nephroma, embryonal rhabdomyosarcoma, and several other rare tumor entities. In this study, we identified 22 primary intracranial sarcomas, including 18 in pediatric patients, with a distinct methylation signature detected by array-based DNA-methylation profiling. In addition, two uterine rhabdomyosarcomas sharing identical features were identified. Gene panel sequencing of the 22 intracranial sarcomas revealed the almost unifying feature of DICER1 hotspot mutations (21/22; 95%) and a high frequency of co-occurring TP53 mutations (12/22; 55%). In addition, 17/22 (77%) sarcomas exhibited alterations in the mitogen-activated protein kinase pathway, most frequently affecting the mutational hotspots of KRAS (8/22; 36%) and mutations or deletions of NF1 (7/22; 32%), followed by mutations of FGFR4 (2/22; 9%), NRAS (2/22; 9%), and amplification of EGFR (1/22; 5%). A germline DICER1 mutation was detected in two of five cases with constitutional DNA available. Notably, none of the patients showed evidence of a cancer-related syndrome at the time of diagnosis. In contrast to the genetic findings, the morphological features of these tumors were less distinctive, although rhabdomyoblasts or rhabdomyoblast-like cells could retrospectively be detected in all cases. The identified combination of genetic events indicates a relationship between the intracranial tumors analyzed and DICER1 predisposition syndrome-associated sarcomas such as embryonal rhabdomyosarcoma or the recently described group of anaplastic sarcomas of the kidney. However, the intracranial tumors in our series were initially interpreted to represent various tumor types, but rhabdomyosarcoma was not among the typical differential diagnoses considered. Given the rarity of intracranial sarcomas, this molecularly clearly defined group comprises a considerable fraction thereof. We therefore propose the designation "spindle cell sarcoma with rhabdomyosarcoma-like features, DICER1 mutant" for this intriguing group.
Assuntos
RNA Helicases DEAD-box/genética , Predisposição Genética para Doença/genética , Mutação/genética , Rabdomiossarcoma Embrionário/genética , Ribonuclease III/genética , Sarcoma/genética , Metilação de DNA/genética , Feminino , Mutação em Linhagem Germinativa , Humanos , Masculino , Estudos Retrospectivos , Rabdomiossarcoma Embrionário/patologia , Sarcoma/patologiaRESUMO
Type 2 diabetes mellitus (T2DM) and periodontitis (PD) have intricated connections as chronic inflammatory diseases. While the immune response is a key factor that accounts for their association, the underlying mechanisms remain unclear. To gain a deeper understanding of the connection, we conducted research using a multiomics approach. We generated whole genome and methylation profiling array data from the periodontium of PD patients with DM (PDDM) and without DM to confirm genetic and epigenetic changes. Independent bulk and single-cell RNA sequencing data were employed to verify the expression levels of hypo-methylated genes. We observed a gradual rise in C>T base substitutions and hypomethylation in PD and PDDM patients compared with healthy participants. Furthermore, specific genetic and epigenetic alterations were prominently associated with the Fc-gamma receptor-mediated phagocytosis pathway. The upregulation of these genes was confirmed in both the periodontal tissues of PD patients and the pancreatic tissues of T2DM patients. Through single-cell RNA analysis of peripheral blood mononuclear cells, substantial upregulation of Fc-gamma receptors and related genes was particularly identified in monocytes. Our findings suggest that targeting the Fc-gamma signaling pathway in monocytes holds promise as a potential treatment strategy for managing systemic complications associated with diabetes.
RESUMO
The case of the monozygotic (MZ) twin as a suspect demonstrates a practical problem in forensic casework. As the MZ twins are genetically identical, they share the same short tandem repeat (STR) profile. Many studies showed that older MZ twins have significant differences in overall content and genomic distribution of methylation between them. However, studies addressing the investigation of epigenetic MZ triplet differentiation in various forensic reference materials are lacking. Here, one triplet set of Egyptian MZ twins was used as an analog to a forensic case. The genome-wide methylation analysis was performed via the new Human Methylation EPIC BeadChip array. Following normalization methods, potential differentially methylated positions (DMPs) were discovered. This resulted in the detection of 24 potential DMPs in reference-type blood DNA and 11 potential DMPs in reference-type buccal DNA. Then, the Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO) analyses were performed to show the associated biological functions. Our findings revealed that the 35 potential DMPs were enriched in 283 significant GO terms. These terms are mainly enriched in the immune system. Overall, this study demonstrates the general feasibility of epigenetic MZ triplet differentiation in the forensic context and highlights that some potential DMPs identified in blood DNA were not informative in buccal DNA. This is due to various reasons, including the tissue specificity of DNA methylation.
RESUMO
BACKGROUND: Smoking impacts DNA methylation, but data are lacking on smoking-related differential methylation by sex or dietary intake, recent smoking cessation (<1 year), persistence of differential methylation from in utero smoking exposure, and effects of environmental tobacco smoke (ETS). METHODS: We meta-analysed data from up to 15,014 adults across 5 cohorts with DNA methylation measured in blood using Illumina's EPIC array for current smoking (2560 exposed), quit < 1 year (500 exposed), in utero (286 exposed), and ETS exposure (676 exposed). We also evaluated the interaction of current smoking with sex or diet (fibre, folate, and vitamin C). FINDINGS: Using false discovery rate (FDR < 0.05), 65,857 CpGs were differentially methylated in relation to current smoking, 4025 with recent quitting, 594 with in utero exposure, and 6 with ETS. Most current smoking CpGs attenuated within a year of quitting. CpGs related to in utero exposure in adults were enriched for those previously observed in newborns. Differential methylation by current smoking at 4-71 CpGs may be modified by sex or dietary intake. Nearly half (35-50%) of differentially methylated CpGs on the 450 K array were associated with blood gene expression. Current smoking and in utero smoking CpGs implicated 3049 and 1067 druggable targets, including chemotherapy drugs. INTERPRETATION: Many smoking-related methylation sites were identified with Illumina's EPIC array. Most signals revert to levels observed in never smokers within a year of cessation. Many in utero smoking CpGs persist into adulthood. Smoking-related druggable targets may provide insights into cancer treatment response and shared mechanisms across smoking-related diseases. FUNDING: Intramural Research Program of the National Institutes of Health, Norwegian Ministry of Health and Care Services and the Ministry of Education and Research, Chief Scientist Office of the Scottish Government Health Directorates and the Scottish Funding Council, Medical Research Council UK and the Wellcome Trust.
Assuntos
Abandono do Hábito de Fumar , Poluição por Fumaça de Tabaco , Adulto , Humanos , Recém-Nascido , Metilação de DNA , Epigênese Genética , Fumar/efeitos adversos , Fumar/genética , Fumar Tabaco , Ilhas de CpGRESUMO
Background: The recently launched DNA methylation profiling platform, Illumina MethylationEPIC BeadChip Infinium microarray v2.0 (EPICv2), is highly correlated with measurements obtained from its predecessor MethylationEPIC BeadChip Infinium microarray v1.0 (EPICv1). However, the concordance between the two versions in the context of DNA methylation-based tools, including cell type deconvolution algorithms, epigenetic clocks, and inflammation and lifestyle biomarkers has not yet been investigated. To address this, we profiled DNA methylation on both EPIC versions using matched venous blood samples from individuals spanning early to late adulthood across four cohorts. Findings: Within each cohort, samples primarily clustered by the EPIC version they were measured on. High concordance between EPIC versions at the array level, but variable concordance at the individual probe level was noted. Significant differences between versions in estimates from DNA methylation-based tools were observed, irrespective of the normalization method, with some nuanced differences across cohorts and tools. Adjusting for EPIC version or calculating estimates separately for each version largely mitigated these version-specific discordances. Conclusions: Our work illustrates the importance of accounting for EPIC version differences in research scenarios, especially in meta-analyses and longitudinal studies, when samples profiled across different versions are harmonized. Alongside DNA methylation-based tools, our observations also have implications in interpretation of epigenome-wide association studies (EWAS) findings, when results obtained from one version are compared to another, particularly for probes that are poorly concordant between versions.
RESUMO
DNA methylation (DNAm) plays a crucial role in a number of complex diseases. However, the reliability of DNAm levels measured using Illumina arrays varies across different probes. Previous research primarily assessed probe reliability by comparing duplicate samples between the 450k-450k or 450k-EPIC platforms, with limited investigations on Illumina EPIC arrays. We conducted a comprehensive assessment of the EPIC array probe reliability using 138 duplicated blood DNAm samples generated by the Alzheimer's Disease Neuroimaging Initiative study. We introduced a novel statistical measure, the modified intraclass correlation, to better account for the disagreement in duplicate measurements. We observed higher reliability in probes with average methylation beta values of 0.2 to 0.8, and lower reliability in type I probes or those within the promoter and CpG island regions. Importantly, we found that probe reliability has significant implications in the analyses of Epigenome-wide Association Studies (EWAS). Higher reliability is associated with more consistent effect sizes in different studies, the identification of differentially methylated regions (DMRs) and methylation quantitative trait locus (mQTLs), and significant correlations with downstream gene expression. Moreover, blood DNAm measurements obtained from probes with higher reliability are more likely to show concordance with brain DNAm measurements. Our findings, which provide crucial reliable information for probes on the EPIC array, will serve as a valuable resource for future DNAm studies.
RESUMO
BACKGROUND: The Infinium EPIC array measures the methylation status of > 850,000 CpG sites. The EPIC BeadChip uses a two-array design: Infinium Type I and Type II probes. These probe types exhibit different technical characteristics which may confound analyses. Numerous normalization and pre-processing methods have been developed to reduce probe type bias as well as other issues such as background and dye bias. METHODS: This study evaluates the performance of various normalization methods using 16 replicated samples and three metrics: absolute beta-value difference, overlap of non-replicated CpGs between replicate pairs, and effect on beta-value distributions. Additionally, we carried out Pearson's correlation and intraclass correlation coefficient (ICC) analyses using both raw and SeSAMe 2 normalized data. RESULTS: The method we define as SeSAMe 2, which consists of the application of the regular SeSAMe pipeline with an additional round of QC, pOOBAH masking, was found to be the best performing normalization method, while quantile-based methods were found to be the worst performing methods. Whole-array Pearson's correlations were found to be high. However, in agreement with previous studies, a substantial proportion of the probes on the EPIC array showed poor reproducibility (ICC < 0.50). The majority of poor performing probes have beta values close to either 0 or 1, and relatively low standard deviations. These results suggest that probe reliability is largely the result of limited biological variation rather than technical measurement variation. Importantly, normalizing the data with SeSAMe 2 dramatically improved ICC estimates, with the proportion of probes with ICC values > 0.50 increasing from 45.18% (raw data) to 61.35% (SeSAMe 2).
Assuntos
Metilação de DNA , Humanos , Reprodutibilidade dos Testes , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Ilhas de CpGRESUMO
BACKGROUND: The pooled sample method is used in epigenomic research and expression analysis and is a cost-effective screening approach for small amounts of DNA. Evaluation of the pooled sample method in epigenomic studies is performed using the Illumina Infinium Methylation 450K BeadChip array; however, subsequent reports on the updated 850K array are lacking. A previous study demonstrated that the methylation levels obtained from individual samples were accurately replicated using pooled samples but did not address epigenome-wide association study (EWAS) statistics. The DNA quantification method, which is important for the homogeneous mixing of DNA in the pooled sample method, has since become fluorescence-based, and additional factors need to be considered including the resolution of batch effects of microarray chips and the heterogeneity of the cellular proportions from which the DNA samples are derived. In this study, four pooled samples were created from 44 individual samples, and EWAS statistics for differentially methylated positions (DMPs) and regions (DMRs) were conducted for individual samples and compared with the statistics obtained from the pooled samples. RESULTS: The methylation levels could be reproduced fairly well in the pooled samples. This was the case for the entire dataset and when limited to the top 100 CpG sites, consistent with a previous study using the 450K BeadChip array. However, the statistical results of the EWAS for the DMP by individual samples were not replicated in pooled samples. Qualitative analyses highlighting methylation within an arbitrary candidate gene were replicable. Focusing on chr 20, the statistical results of EWAS for DMR from individual samples showed replicability in the pooled samples as long as they were limited to regions with a sufficient effect size. CONCLUSIONS: The pooled sample method replicated the methylation values well and can be used for EWAS in DMR. This method is sample amount-effective and cost-effective and can be utilized for screening by carefully understanding the effective features and disadvantages of the pooled sample method and combining it with candidate gene analyses.
Assuntos
Metilação de DNA , Epigenoma , Humanos , Epigenômica , Processamento de Proteína Pós-TraducionalRESUMO
The majority of epigenetic epidemiology studies to date have generated genome-wide profiles from bulk tissues (e.g., whole blood) however these are vulnerable to confounding from variation in cellular composition. Proxies for cellular composition can be mathematically derived from the bulk tissue profiles using a deconvolution algorithm; however, there is no method to assess the validity of these estimates for a dataset where the true cellular proportions are unknown. In this study, we describe, validate and characterize a sample level accuracy metric for derived cellular heterogeneity variables. The CETYGO score captures the deviation between a sample's DNA methylation profile and its expected profile given the estimated cellular proportions and cell type reference profiles. We demonstrate that the CETYGO score consistently distinguishes inaccurate and incomplete deconvolutions when applied to reconstructed whole blood profiles. By applying our novel metric to >6,300 empirical whole blood profiles, we find that estimating accurate cellular composition is influenced by both technical and biological variation. In particular, we show that when using a common reference panel for whole blood, less accurate estimates are generated for females, neonates, older individuals and smokers. Our results highlight the utility of a metric to assess the accuracy of cellular deconvolution, and describe how it can enhance studies of DNA methylation that are reliant on statistical proxies for cellular heterogeneity. To facilitate incorporating our methodology into existing pipelines, we have made it freely available as an R package (https://github.com/ds420/CETYGO).
Assuntos
Algoritmos , Metilação de DNA , Feminino , Recém-Nascido , Humanos , Incerteza , Biologia Computacional/métodos , EpigenômicaRESUMO
While low serotonergic activity is often associated with psychological disorders such as depression, anxiety, mood, and personality disorders, variations in serotonin also contribute to normal personality differences. In this study, we investigated the role of blood DNA methylation levels at individual CpG sites of two key serotonergic genes (serotonin receptor gene 1A, HTR1A; serotonin transporter gene, SLC6A4) in predicting the personalities of captive chimpanzees. We found associations between methylation at 9/48 CpG sites with four personality dimensions: Dominance, Reactivity/Dependability, Agreeableness, and Openness. Directionality of effects were CpG location-dependent and confirmed a role of serotonergic methylation in reducing anxiety (Dominance) and aggression-related personality (Reactivity/Undependability) while simultaneously promoting prosocial (Agreeableness) and exploratory personalities (Openness). Although early-life adversity has been shown to impact serotonergic methylation patterns in other species, here, atypical early social rearing experiences only had a modest impact on CpG methylation levels in this chimpanzee sample. The precise environmental factors impacting serotonergic methylation in chimpanzees remain to be identified. Nevertheless, our study suggests a role in shaping natural variation in animal personalities. The results of this study offer a basis for future hypothesis-driven testing in additional populations and species to better understand the impact of ecology and evolution on complex behavioral traits.
RESUMO
BACKGROUND: DNA methylation is an epigenetic mark that is influenced by underlying genetic profile, environment, and ageing. In addition to X-linked DNA methylation, sex-specific methylation patterns are widespread across autosomal chromosomes and can be present from birth or arise over time. In individuals where gender identity and sex assigned at birth are markedly incongruent, as in the case of transgender people, feminization or masculinization may be sought through gender-affirming hormone therapy (GAHT). GAHT is a cornerstone of transgender care, yet no studies to date have investigated its effect on genome-wide methylation. We profiled genome-wide DNA methylation in blood of transgender women (n = 13) and transgender men (n = 13) before and during GAHT (6 months and 12 months into feminizing or masculinizing hormone therapy). RESULTS: We identified several thousand differentially methylated CpG sites (DMPs) (Δß ≥ 0.02, unadjusted p value < 0.05) and several differentially methylated regions (DMRs) in both people undergoing feminizing and masculinizing GAHT, the vast majority of which were progressive changes over time. X chromosome and sex-specific autosomal DNA methylation patterns established in early development are largely refractory to change in association with GAHT, with only 3% affected (Δß ≥ 0.02, unadjusted p value < 0.05). The small number of sex-specific DMPs that were affected by GAHT were those that become sex-specific during the lifetime, known as sex-and-age DMPs, including DMRs in PRR4 and VMP1 genes. The GAHT-induced changes at these sex-associated probes consistently demonstrated a shift towards the methylation signature of the GAHT-naïve opposite sex, and we observed enrichment of previously reported adolescence-associated methylation changes. CONCLUSION: We provide evidence for GAHT inducing a unique blood methylation signature in transgender people. This study advances our understanding of the complex interplay between sex hormones, sex chromosomes, and DNA methylation in the context of immunity. We highlight the need to broaden the field of 'sex-specific' immunity beyond cisgender males and cisgender females, as transgender people on GAHT exhibit a unique molecular profile.
Assuntos
Identidade de Gênero , Pessoas Transgênero , Metilação de DNA , Feminino , Hormônios , Humanos , Recém-Nascido , MasculinoRESUMO
BACKGROUND: Sex differences are known to play a role in disease aetiology, progression and outcome. Previous studies have revealed autosomal epigenetic differences between males and females in some tissues, including differences in DNA methylation patterns. Here, we report for the first time an analysis of autosomal sex differences in DNAme using the Illumina EPIC array in human whole blood by performing a discovery (n = 1171) and validation (n = 2471) analysis. RESULTS: We identified and validated 396 sex-associated differentially methylated CpG sites (saDMPs) with the majority found to be female-biased CpGs (74%). These saDMP's are enriched in CpG islands and CpG shores and located preferentially at 5'UTRs, 3'UTRs and enhancers. Additionally, we identified 266 significant sex-associated differentially methylated regions overlapping genes, which have previously been shown to exhibit epigenetic sex differences, and novel genes. Transcription factor binding site enrichment revealed enrichment of transcription factors related to critical developmental processes and sex determination such as SRY and ESR1. CONCLUSION: Our study reports a reliable catalogue of sex-associated CpG sites and elucidates several characteristics of these sites using large-scale discovery and validation data sets. This resource will benefit future studies aiming to investigate sex specific epigenetic signatures and further our understanding of the role of DNA methylation in sex differences in human whole blood.
Assuntos
Metilação de DNA , Caracteres Sexuais , Ilhas de CpG , Epigênese Genética , Epigenômica , Feminino , Humanos , MasculinoRESUMO
Breast cancers of the luminal B subtype are frequent tumors with high proliferation and poor prognosis. Epigenetic alterations have been found in breast tumors and in biological fluids. We aimed to profile the cell-free DNA (cfDNA) methylome of metastatic luminal B breast cancer (LBBC) patients using an epigenomic approach to discover potential noninvasive biomarkers. Plasma cfDNA was analyzed using the Infinium MethylationEpic array in a cohort of 14 women, including metastatic LBBC patients and nontumor controls. The methylation levels of cfDNA and tissue samples were validated with droplet digital PCR. The methylation and gene expression data of 582 primary luminal breast tumors and 79 nontumor tissues were obtained from The Cancer Genome Atlas (TCGA). We found an episignature of 1,467 differentially methylated CpGs that clearly identified patients with LBBC. Among the genes identified, the promoter hypermethylation of WNT1 was validated in cfDNA, showing an area under the ROC curve (AUC) of 0.86 for the noninvasive detection of metastatic LBBC. Both paired cfDNA and primary/metastatic breast tumor samples showed hypermethylation of WNT1. TCGA analysis revealed significant WNT1 hypermethylation in the primary tumors of luminal breast cancer patients, with a negative association between WNT1 methylation and gene expression. In this proof-of-principle study, we discovered an episignature associated with metastatic LBBC using a genome-wide cfDNA methylation approach. We also identified the promoter hypermethylation of WNT1 in cfDNA as a potential noninvasive biomarker for luminal breast cancer. Our results support the use of EPIC arrays to identify new epigenetic noninvasive biomarkers in breast cancer.