RESUMO
Empirical models of sequence evolution have spurred progress in the field of evolutionary genetics for decades. We are now realizing the importance and complexity of the eukaryotic epigenome. While epigenome analysis has been applied to genomes from single-cell eukaryotes to human, comparative analyses are still relatively few and computational algorithms to quantify epigenome evolution remain scarce. Accordingly, a quantitative model of epigenome evolution remains to be established. We review here the comparative epigenomics literature and synthesize its overarching themes. We also suggest one mechanism, transcription factor binding site (TFBS) turnover, which relates sequence evolution to epigenetic conservation or divergence. Lastly, we propose a framework for how the field can move forward to build a coherent quantitative model of epigenome evolution.
Assuntos
Evolução Molecular , Genoma , Genômica , Vertebrados/genética , Algoritmos , Animais , Biologia Computacional , Humanos , FilogeniaRESUMO
BACKGROUND: Uncovering mechanisms of epigenome evolution is an essential step towards understanding the evolution of different cellular phenotypes. While studies have confirmed DNA methylation as a conserved epigenetic mechanism in mammalian development, little is known about the conservation of tissue-specific genome-wide DNA methylation patterns. RESULTS: Using a comparative epigenomics approach, we identified and compared the tissue-specific DNA methylation patterns of rat against those of mouse and human across three shared tissue types. We confirmed that tissue-specific differentially methylated regions are strongly associated with tissue-specific regulatory elements. Comparisons between species revealed that at a minimum 11-37% of tissue-specific DNA methylation patterns are conserved, a phenomenon that we define as epigenetic conservation. Conserved DNA methylation is accompanied by conservation of other epigenetic marks including histone modifications. Although a significant amount of locus-specific methylation is epigenetically conserved, the majority of tissue-specific DNA methylation is not conserved across the species and tissue types that we investigated. Examination of the genetic underpinning of epigenetic conservation suggests that primary sequence conservation is a driving force behind epigenetic conservation. In contrast, evolutionary dynamics of tissue-specific DNA methylation are best explained by the maintenance or turnover of binding sites for important transcription factors. CONCLUSIONS: Our study extends the limited literature of comparative epigenomics and suggests a new paradigm for epigenetic conservation without genetic conservation through analysis of transcription factor binding sites.
Assuntos
Sequência Conservada , Metilação de DNA/genética , Animais , Sítios de Ligação , Epigenômica , Evolução Molecular , Humanos , Camundongos , Especificidade de Órgãos , Ratos , Fatores de Transcrição/metabolismoRESUMO
DNA methylation plays key roles in diverse biological processes such as X chromosome inactivation, transposable element repression, genomic imprinting, and tissue-specific gene expression. Sequencing-based DNA methylation profiling provides an unprecedented opportunity to map and compare complete DNA methylomes. This includes one of the most widely applied technologies for measuring DNA methylation: methylated DNA immunoprecipitation followed by sequencing (MeDIP-seq), coupled with a complementary method, methylation-sensitive restriction enzyme sequencing (MRE-seq). A computational approach that integrates data from these two different but complementary assays and predicts methylation differences between samples has been unavailable. Here, we present a novel integrative statistical framework M&M (for integration of MeDIP-seq and MRE-seq) that dynamically scales, normalizes, and combines MeDIP-seq and MRE-seq data to detect differentially methylated regions. Using sample-matched whole-genome bisulfite sequencing (WGBS) as a gold standard, we demonstrate superior accuracy and reproducibility of M&M compared to existing analytical methods for MeDIP-seq data alone. M&M leverages the complementary nature of MeDIP-seq and MRE-seq data to allow rapid comparative analysis between whole methylomes at a fraction of the cost of WGBS. Comprehensive analysis of nineteen human DNA methylomes with M&M reveals distinct DNA methylation patterns among different tissue types, cell types, and individuals, potentially underscoring divergent epigenetic regulation at different scales of phenotypic diversity. We find that differential DNA methylation at enhancer elements, with concurrent changes in histone modifications and transcription factor binding, is common at the cell, tissue, and individual levels, whereas promoter methylation is more prominent in reinforcing fundamental tissue identities.
Assuntos
Algoritmos , Metilação de DNA , Genoma Humano , Análise de Sequência de DNA/métodos , Interpretação Estatística de Dados , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Especificidade de ÓrgãosRESUMO
SUMMARY: We present methylC track, an efficient mechanism for visualizing single-base resolution DNA methylation data on a genome browser. The methylC track dynamically integrates the level of methylation, the position and context of the methylated cytosine (i.e. CG, CHG and CHH), strand and confidence level (e.g. read coverage depth in the case of whole-genome bisulfite sequencing data). Investigators can access and integrate these information visually at specific locus or at the genome-wide level on the WashU EpiGenome Browser in the context of other rich epigenomic datasets. AVAILABILITY AND IMPLEMENTATION: The methylC track is part of the WashU EpiGenome Browser, which is open source and freely available at http://epigenomegateway.wustl.edu/browser/. The most up-to-date instructions and tools for preparing methylC track are available at http://epigenomegateway.wustl.edu/+/cmtk. CONTACT: twang@genetics.wustl.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Metilação de DNA , Epigenômica/métodos , Navegador , Metilação de DNA/efeitos dos fármacos , Bases de Dados Genéticas , Genoma Humano/genética , Humanos , Análise de Sequência de DNA , Sulfitos/farmacologiaRESUMO
BACKGROUND: Aberrant DNA methylation is a hallmark of many cancers. Classically there are two types of endometrial cancer, endometrioid adenocarcinoma (EAC), or Type I, and uterine papillary serous carcinoma (UPSC), or Type II. However, the whole genome DNA methylation changes in these two classical types of endometrial cancer is still unknown. RESULTS: Here we described complete genome-wide DNA methylome maps of EAC, UPSC, and normal endometrium by applying a combined strategy of methylated DNA immunoprecipitation sequencing (MeDIP-seq) and methylation-sensitive restriction enzyme digestion sequencing (MRE-seq). We discovered distinct genome-wide DNA methylation patterns in EAC and UPSC: 27,009 and 15,676 recurrent differentially methylated regions (DMRs) were identified respectively, compared with normal endometrium. Over 80% of DMRs were in intergenic and intronic regions. The majority of these DMRs were not interrogated on the commonly used Infinium 450K array platform. Large-scale demethylation of chromosome X was detected in UPSC, accompanied by decreased XIST expression. Importantly, we discovered that the majority of the DMRs harbored promoter or enhancer functions and are specifically associated with genes related to uterine development and disease. Among these, abnormal methylation of transposable elements (TEs) may provide a novel mechanism to deregulate normal endometrium-specific enhancers derived from specific TEs. CONCLUSIONS: DNA methylation changes are an important signature of endometrial cancer and regulate gene expression by affecting not only proximal promoters but also distal enhancers.
Assuntos
Neoplasias do Endométrio/genética , Neoplasias do Endométrio/fisiopatologia , Elementos Facilitadores Genéticos/genética , Regiões Promotoras Genéticas/genética , Neoplasias Uterinas/genética , Neoplasias Uterinas/fisiopatologia , Proteínas Adaptadoras de Transdução de Sinal/genética , Família Aldeído Desidrogenase 1 , Carcinoma Papilar/genética , Carcinoma Papilar/metabolismo , Cromossomos Humanos X , Ilhas de CpG , DNA (Citosina-5-)-Metiltransferases/genética , DNA (Citosina-5-)-Metiltransferases/metabolismo , Metilação de DNA , Elementos de DNA Transponíveis/genética , Feminino , Humanos , Fator 4 Semelhante a Kruppel , Fatores de Transcrição Kruppel-Like/genética , Proteína 1 Homóloga a MutL , Proteínas Nucleares/genética , Polimorfismo de Nucleotídeo Único , RNA Longo não Codificante/genética , Retinal Desidrogenase/genética , Análise de Sequência de DNARESUMO
BACKGROUND: Zebrafish pigment cell differentiation provides an attractive model for studying cell fate progression as a neural crest progenitor engenders diverse cell types, including two morphologically distinct pigment cells: black melanophores and reflective iridophores. Nontrivial classical genetic and transcriptomic approaches have revealed essential molecular mechanisms and gene regulatory circuits that drive neural crest-derived cell fate decisions. However, how the epigenetic landscape contributes to pigment cell differentiation, especially in the context of iridophore cell fate, is poorly understood. RESULTS: We chart the global changes in the epigenetic landscape, including DNA methylation and chromatin accessibility, during neural crest differentiation into melanophores and iridophores to identify epigenetic determinants shaping cell type-specific gene expression. Motif enrichment in the epigenetically dynamic regions reveals putative transcription factors that might be responsible for driving pigment cell identity. Through this effort, in the relatively uncharacterized iridophores, we validate alx4a as a necessary and sufficient transcription factor for iridophore differentiation and present evidence on alx4a's potential regulatory role in guanine synthesis pathway. CONCLUSIONS: Pigment cell fate is marked by substantial DNA demethylation events coupled with dynamic chromatin accessibility to potentiate gene regulation through cis-regulatory control. Here, we provide a multi-omic resource for neural crest differentiation into melanophores and iridophores. This work led to the discovery and validation of iridophore-specific alx4a transcription factor.
Assuntos
Diferenciação Celular/genética , Cromatóforos/metabolismo , Epigênese Genética , Melanóforos/metabolismo , Peixe-Zebra/genética , Animais , Cromatina/metabolismo , Ilhas de CpG , Metilação de DNA , Redes Reguladoras de Genes , Crista Neural/citologia , Crista Neural/metabolismo , Sequências Reguladoras de Ácido Nucleico , Fatores de Transcrição/metabolismo , Fatores de Transcrição/fisiologia , Transcrição Gênica , Peixe-Zebra/metabolismo , Proteínas de Peixe-Zebra/metabolismo , Proteínas de Peixe-Zebra/fisiologiaRESUMO
The human placenta and its specialized cytotrophoblasts rapidly develop, have a compressed lifespan, govern pregnancy outcomes, and program the offspring's health. Understanding the molecular underpinnings of these behaviors informs development and disease. Profiling the extraembryonic epigenome and transcriptome during the 2nd and 3rd trimesters revealed H3K9 trimethylation overlapping deeply DNA hypomethylated domains with reduced gene expression and compartment-specific patterns that illuminated their functions. Cytotrophoblast DNA methylation increased, and several key histone modifications decreased across the genome as pregnancy advanced. Cytotrophoblasts from severe preeclampsia had substantially increased H3K27 acetylation globally and at genes that are normally downregulated at term but upregulated in this syndrome. In addition, some cases had an immature pattern of H3K27ac peaks, and others showed evidence of accelerated aging, suggesting subtype-specific alterations in severe preeclampsia. Thus, the cytotrophoblast epigenome dramatically reprograms during pregnancy, placental disease is associated with failures in this process, and H3K27 hyperacetylation is a feature of severe preeclampsia.
Assuntos
Epigenoma , Doenças Placentárias/genética , Doenças Placentárias/patologia , Trofoblastos/metabolismo , Trofoblastos/patologia , Acetilação , Metilação de DNA/genética , Elementos Facilitadores Genéticos/genética , Feminino , Regulação da Expressão Gênica no Desenvolvimento , Idade Gestacional , Histonas/metabolismo , Humanos , Lisina/metabolismo , Pré-Eclâmpsia/genética , Gravidez , Processamento de Proteína Pós-TraducionalRESUMO
Evidence that noncoding mutation can result in cancer driver events is mounting. However, it is more difficult to assign molecular biological consequences to noncoding mutations than to coding mutations, and a typical cancer genome contains many more noncoding mutations than protein-coding mutations. Accordingly, parsing functional noncoding mutation signal from noise remains an important challenge. Here we use an empirical approach to identify putatively functional noncoding somatic single nucleotide variants (SNVs) from liver cancer genomes. Annotation of candidate variants by publicly available epigenome datasets finds that 40.5% of SNVs fall in regulatory elements. When assigned to specific regulatory elements, we find that the distribution of regulatory element mutation mirrors that of nonsynonymous coding mutation, where few regulatory elements are recurrently mutated in a patient population but many are singly mutated. We find potential gain-of-binding site events among candidate SNVs, suggesting a mechanism of action for these variants. When aggregating noncoding somatic mutation in promoters, we find that genes in the ERBB signaling and MAPK signaling pathways are significantly enriched for promoter mutations. Altogether, our results suggest that functional somatic SNVs in cancer are sporadic, but occasionally occur in regulatory elements and may affect phenotype by creating binding sites for transcriptional regulators. Accordingly, we propose that noncoding mutation should be formally accounted for when determining gene- and pathway-mutation burden in cancer.
Assuntos
Epigenômica , Neoplasias Hepáticas/genética , Polimorfismo de Nucleotídeo Único , Cromatina/metabolismo , Regulação Neoplásica da Expressão Gênica/genética , Humanos , Neoplasias Hepáticas/metabolismo , Modelos Genéticos , Polimorfismo de Nucleotídeo Único/genética , Elementos Reguladores de Transcrição/genéticaRESUMO
DNA methylation undergoes dynamic changes during development and cell differentiation. Recent genome-wide studies discovered that tissue-specific differentially methylated regions (DMRs) often overlap tissue-specific distal cis-regulatory elements. However, developmental DNA methylation dynamics of the majority of the genomic CpGs outside gene promoters and CpG islands has not been extensively characterized. Here, we generate and compare comprehensive DNA methylome maps of zebrafish developing embryos. From these maps, we identify thousands of developmental stage-specific DMRs (dsDMRs) across zebrafish developmental stages. The dsDMRs contain evolutionarily conserved sequences, are associated with developmental genes and are marked with active enhancer histone posttranslational modifications. Their methylation pattern correlates much stronger than promoter methylation with expression of putative target genes. When tested in vivo using a transgenic zebrafish assay, 20 out of 20 selected candidate dsDMRs exhibit functional enhancer activities. Our data suggest that developmental enhancers are a major target of DNA methylation changes during embryogenesis.
Assuntos
Metilação de DNA , Elementos Facilitadores Genéticos , Regulação da Expressão Gênica no Desenvolvimento , Peixe-Zebra/embriologia , Animais , Ilhas de CpG , DNA/metabolismo , Desenvolvimento Embrionário/genética , Epigênese Genética , Feminino , Biblioteca Gênica , Redes Reguladoras de Genes , Proteínas de Fluorescência Verde/metabolismo , Histonas/química , Masculino , Regiões Promotoras Genéticas , Análise de Sequência de DNA , Transdução de SinaisRESUMO
Developmental history shapes the epigenome and biological function of differentiated cells. Epigenomic patterns have been broadly attributed to the three embryonic germ layers. Here we investigate how developmental origin influences epigenomes. We compare key epigenomes of cell types derived from surface ectoderm (SE), including keratinocytes and breast luminal and myoepithelial cells, against neural crest-derived melanocytes and mesoderm-derived dermal fibroblasts, to identify SE differentially methylated regions (SE-DMRs). DNA methylomes of neonatal keratinocytes share many more DMRs with adult breast luminal and myoepithelial cells than with melanocytes and fibroblasts from the same neonatal skin. This suggests that SE origin contributes to DNA methylation patterning, while shared skin tissue environment has limited effect on epidermal keratinocytes. Hypomethylated SE-DMRs are in proximity to genes with SE relevant functions. They are also enriched for enhancer- and promoter-associated histone modifications in SE-derived cells, and for binding motifs of transcription factors important in keratinocyte and mammary gland biology. Thus, epigenomic analysis of cell types with common developmental origin reveals an epigenetic signature that underlies a shared gene regulatory network.
Assuntos
Ectoderma/metabolismo , Epigênese Genética , Redes Reguladoras de Genes , Diferenciação Celular , Células Cultivadas , Metilação de DNA , Ectoderma/citologia , Fibroblastos/citologia , Fibroblastos/metabolismo , Humanos , Queratinócitos/citologia , Queratinócitos/metabolismo , Melanócitos/citologia , Melanócitos/metabolismo , Regiões Promotoras Genéticas , Especificidade da EspécieRESUMO
Transposable element (TE)-derived sequences comprise half of the human genome and DNA methylome and are presumed to be densely methylated and inactive. Examination of genome-wide DNA methylation status within 928 TE subfamilies in human embryonic and adult tissues identified unexpected tissue-specific and subfamily-specific hypomethylation signatures. Genes proximal to tissue-specific hypomethylated TE sequences were enriched for functions important for the relevant tissue type, and their expression correlated strongly with hypomethylation within the TEs. When hypomethylated, these TE sequences gained tissue-specific enhancer marks, including monomethylation of histone H3 at lysine 4 (H3K4me1) and occupancy by p300, and a majority exhibited enhancer activity in reporter gene assays. Many such TEs also harbored binding sites for transcription factors that are important for tissue-specific functions and showed evidence of evolutionary selection. These data suggest that sequences derived from TEs may be responsible for wiring tissue type-specific regulatory networks and may have acquired tissue-specific epigenetic regulation.