Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 33
Filtrar
1.
Cell ; 186(7): 1493-1511.e40, 2023 03 30.
Artigo em Inglês | MEDLINE | ID: mdl-37001506

RESUMO

Understanding how genetic variants impact molecular phenotypes is a key goal of functional genomics, currently hindered by reliance on a single haploid reference genome. Here, we present the EN-TEx resource of 1,635 open-access datasets from four donors (∼30 tissues × âˆ¼15 assays). The datasets are mapped to matched, diploid genomes with long-read phasing and structural variants, instantiating a catalog of >1 million allele-specific loci. These loci exhibit coordinated activity along haplotypes and are less conserved than corresponding, non-allele-specific ones. Surprisingly, a deep-learning transformer model can predict the allele-specific activity based only on local nucleotide-sequence context, highlighting the importance of transcription-factor-binding motifs particularly sensitive to variants. Furthermore, combining EN-TEx with existing genome annotations reveals strong associations between allele-specific and GWAS loci. It also enables models for transferring known eQTLs to difficult-to-profile tissues (e.g., from skin to heart). Overall, EN-TEx provides rich data and generalizable models for more accurate personal functional genomics.


Assuntos
Epigenoma , Locos de Características Quantitativas , Estudo de Associação Genômica Ampla , Genômica , Fenótipo , Polimorfismo de Nucleotídeo Único
2.
Cell ; 175(5): 1228-1243.e20, 2018 11 15.
Artigo em Inglês | MEDLINE | ID: mdl-30392959

RESUMO

Genetic drivers of cancer can be dysregulated through epigenetic modifications of DNA. Although the critical role of DNA 5-methylcytosine (5mC) in the regulation of transcription is recognized, the functions of other non-canonical DNA modifications remain obscure. Here, we report the identification of novel N6-methyladenine (N6-mA) DNA modifications in human tissues and implicate this epigenetic mark in human disease, specifically the highly malignant brain cancer glioblastoma. Glioblastoma markedly upregulated N6-mA levels, which co-localized with heterochromatic histone modifications, predominantly H3K9me3. N6-mA levels were dynamically regulated by the DNA demethylase ALKBH1, depletion of which led to transcriptional silencing of oncogenic pathways through decreasing chromatin accessibility. Targeting the N6-mA regulator ALKBH1 in patient-derived human glioblastoma models inhibited tumor cell proliferation and extended the survival of tumor-bearing mice, supporting this novel DNA modification as a potential therapeutic target for glioblastoma. Collectively, our results uncover a novel epigenetic node in cancer through the DNA modification N6-mA.


Assuntos
Adenina/análogos & derivados , Neoplasias Encefálicas/patologia , Metilação de DNA , Glioblastoma/patologia , Adenina/análise , Adenina/química , Adulto , Idoso , Homólogo AlkB 1 da Histona H2a Dioxigenase/antagonistas & inibidores , Homólogo AlkB 1 da Histona H2a Dioxigenase/genética , Homólogo AlkB 1 da Histona H2a Dioxigenase/metabolismo , Animais , Astrócitos/citologia , Astrócitos/metabolismo , Neoplasias Encefálicas/metabolismo , Neoplasias Encefálicas/mortalidade , Hipóxia Celular , Criança , Epigenômica , Feminino , Glioblastoma/metabolismo , Glioblastoma/mortalidade , Heterocromatina/metabolismo , Histonas/metabolismo , Humanos , Estimativa de Kaplan-Meier , Masculino , Camundongos , Pessoa de Meia-Idade , Células-Tronco Neoplásicas/citologia , Células-Tronco Neoplásicas/metabolismo , Interferência de RNA , RNA Interferente Pequeno/metabolismo , Proteína Supressora de Tumor p53/metabolismo
3.
Cell ; 162(4): 900-10, 2015 Aug 13.
Artigo em Inglês | MEDLINE | ID: mdl-26276636

RESUMO

CTCF and the associated cohesin complex play a central role in insulator function and higher-order chromatin organization of mammalian genomes. Recent studies identified a correlation between the orientation of CTCF-binding sites (CBSs) and chromatin loops. To test the functional significance of this observation, we combined CRISPR/Cas9-based genomic-DNA-fragment editing with chromosome-conformation-capture experiments to show that the location and relative orientations of CBSs determine the specificity of long-range chromatin looping in mammalian genomes, using protocadherin (Pcdh) and ß-globin as model genes. Inversion of CBS elements within the Pcdh enhancer reconfigures the topology of chromatin loops between the distal enhancer and target promoters and alters gene-expression patterns. Thus, although enhancers can function in an orientation-independent manner in reporter assays, in the native chromosome context, the orientation of at least some enhancers carrying CBSs can determine both the architecture of topological chromatin domains and enhancer/promoter specificity. These findings reveal how 3D chromosome architecture can be encoded by linear genome sequences.


Assuntos
Cromossomos/metabolismo , Técnicas Genéticas , Proteínas Repressoras/metabolismo , Animais , Sítios de Ligação , Fator de Ligação a CCCTC , Caderinas/genética , Proteínas de Ciclo Celular/metabolismo , Proteínas Cromossômicas não Histona/metabolismo , Cromossomos/química , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , DNA/química , Elementos Facilitadores Genéticos , Expressão Gênica , Genoma Humano , Humanos , Células K562 , Camundongos , Regiões Promotoras Genéticas , Globinas beta/genética , Coesinas
4.
Cell ; 155(5): 1022-33, 2013 Nov 21.
Artigo em Inglês | MEDLINE | ID: mdl-24267888

RESUMO

Sequence polymorphisms linked to human diseases and phenotypes in genome-wide association studies often affect noncoding regions. A SNP within an intron of the gene encoding Interferon Regulatory Factor 4 (IRF4), a transcription factor with no known role in melanocyte biology, is strongly associated with sensitivity of skin to sun exposure, freckles, blue eyes, and brown hair color. Here, we demonstrate that this SNP lies within an enhancer of IRF4 transcription in melanocytes. The allele associated with this pigmentation phenotype impairs binding of the TFAP2A transcription factor that, together with the melanocyte master regulator MITF, regulates activity of the enhancer. Assays in zebrafish and mice reveal that IRF4 cooperates with MITF to activate expression of Tyrosinase (TYR), an essential enzyme in melanin synthesis. Our findings provide a clear example of a noncoding polymorphism that affects a phenotype by modulating a developmental gene regulatory network.


Assuntos
Fatores Reguladores de Interferon/metabolismo , Polimorfismo de Nucleotídeo Único , Animais , Sequência de Bases , Elementos Facilitadores Genéticos , Humanos , Fatores Reguladores de Interferon/química , Fatores Reguladores de Interferon/genética , Melanócitos/metabolismo , Camundongos , Dados de Sequência Molecular , Pigmentação , Transdução de Sinais , Fator de Transcrição AP-2/química , Fator de Transcrição AP-2/metabolismo , Peixe-Zebra
5.
Nature ; 594(7863): 398-402, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-34012112

RESUMO

Genetic risk variants that have been identified in genome-wide association studies of complex diseases are primarily non-coding1. Translating these risk variants into mechanistic insights requires detailed maps of gene regulation in disease-relevant cell types2. Here we combined two approaches: a genome-wide association study of type 1 diabetes (T1D) using 520,580 samples, and the identification of candidate cis-regulatory elements (cCREs) in pancreas and peripheral blood mononuclear cells using single-nucleus assay for transposase-accessible chromatin with sequencing (snATAC-seq) of 131,554 nuclei. Risk variants for T1D were enriched in cCREs that were active in T cells and other cell types, including acinar and ductal cells of the exocrine pancreas. Risk variants at multiple T1D signals overlapped with exocrine-specific cCREs that were linked to genes with exocrine-specific expression. At the CFTR locus, the T1D risk variant rs7795896 mapped to a ductal-specific cCRE that regulated CFTR; the risk allele reduced transcription factor binding, enhancer activity and CFTR expression in ductal cells. These findings support a role for the exocrine pancreas in the pathogenesis of T1D and highlight the power of large-scale genome-wide association studies and single-cell epigenomics for understanding the cellular origins of complex disease.


Assuntos
Diabetes Mellitus Tipo 1/genética , Epigenômica , Predisposição Genética para Doença , Análise de Célula Única , Cromatina/genética , Regulador de Condutância Transmembrana em Fibrose Cística/genética , Feminino , Regulação da Expressão Gênica , Estudo de Associação Genômica Ampla , Humanos , Imunidade/genética , Masculino , Ductos Pancreáticos/metabolismo , Ductos Pancreáticos/patologia
6.
Nature ; 598(7879): 129-136, 2021 10.
Artigo em Inglês | MEDLINE | ID: mdl-34616068

RESUMO

The mammalian cerebrum performs high-level sensory perception, motor control and cognitive functions through highly specialized cortical and subcortical structures1. Recent surveys of mouse and human brains with single-cell transcriptomics2-6 and high-throughput imaging technologies7,8 have uncovered hundreds of neural cell types distributed in different brain regions, but the transcriptional regulatory programs that are responsible for the unique identity and function of each cell type remain unknown. Here we probe the accessible chromatin in more than 800,000 individual nuclei from 45 regions that span the adult mouse isocortex, olfactory bulb, hippocampus and cerebral nuclei, and use the resulting data to map the state of 491,818 candidate cis-regulatory DNA elements in 160 distinct cell types. We find high specificity of spatial distribution for not only excitatory neurons, but also most classes of inhibitory neurons and a subset of glial cell types. We characterize the gene regulatory sequences associated with the regional specificity within these cell types. We further link a considerable fraction of the cis-regulatory elements to putative target genes expressed in diverse cerebral cell types and predict transcriptional regulators that are involved in a broad spectrum of molecular and cellular pathways in different neuronal and glial cell populations. Our results provide a foundation for comprehensive analysis of gene regulatory programs of the mammalian brain and assist in the interpretation of noncoding risk variants associated with various neurological diseases and traits in humans.


Assuntos
Cérebro/citologia , Cérebro/metabolismo , Sequências Reguladoras de Ácido Nucleico/genética , Animais , Atlas como Assunto , Cromatina/química , Cromatina/genética , Cromatina/metabolismo , Montagem e Desmontagem da Cromatina , Regulação da Expressão Gênica , Predisposição Genética para Doença/genética , Humanos , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Doenças do Sistema Nervoso/genética , Neuroglia/classificação , Neuroglia/metabolismo , Neurônios/classificação , Neurônios/metabolismo , Análise de Sequência de DNA , Análise de Célula Única
7.
Brief Bioinform ; 25(3)2024 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-38711367

RESUMO

Hi-C data are commonly normalized using single sample processing methods, with focus on comparisons between regions within a given contact map. Here, we aim to compare contact maps across different samples. We demonstrate that unwanted variation, of likely technical origin, is present in Hi-C data with replicates from different individuals, and that properties of this unwanted variation change across the contact map. We present band-wise normalization and batch correction, a method for normalization and batch correction of Hi-C data and show that it substantially improves comparisons across samples, including in a quantitative trait loci analysis as well as differential enrichment across cell types.


Assuntos
Locos de Características Quantitativas , Humanos , Biologia Computacional
8.
Nature ; 583(7818): 752-759, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32728242

RESUMO

Cytosine DNA methylation is essential for mammalian development but understanding of its spatiotemporal distribution in the developing embryo remains limited1,2. Here, as part of the mouse Encyclopedia of DNA Elements (ENCODE) project, we profiled 168 methylomes from 12 mouse tissues or organs at 9 developmental stages from embryogenesis to adulthood. We identified 1,808,810 genomic regions that showed variations in CG methylation by comparing the methylomes of different tissues or organs from different developmental stages. These DNA elements predominantly lose CG methylation during fetal development, whereas the trend is reversed after birth. During late stages of fetal development, non-CG methylation accumulated within the bodies of key developmental transcription factor genes, coinciding with their transcriptional repression. Integration of genome-wide DNA methylation, histone modification and chromatin accessibility data enabled us to predict 461,141 putative developmental tissue-specific enhancers, the human orthologues of which were enriched for disease-associated genetic variants. These spatiotemporal epigenome maps provide a resource for studies of gene regulation during tissue or organ progression, and a starting point for investigating regulatory elements that are involved in human developmental disorders.


Assuntos
Metilação de DNA , Epigenoma , Feto/embriologia , Feto/metabolismo , Animais , Animais Recém-Nascidos , Cromatina/genética , Cromatina/metabolismo , Doença/genética , Regulação para Baixo , Elementos Facilitadores Genéticos/genética , Repressão Epigenética , Feminino , Inativação Gênica , Humanos , Camundongos , Camundongos Endogâmicos C57BL , Modelos Animais , Anotação de Sequência Molecular , Polimorfismo de Nucleotídeo Único , Análise Espaço-Temporal
9.
Nature ; 583(7818): 744-751, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32728240

RESUMO

The Encyclopedia of DNA Elements (ENCODE) project has established a genomic resource for mammalian development, profiling a diverse panel of mouse tissues at 8 developmental stages from 10.5 days after conception until birth, including transcriptomes, methylomes and chromatin states. Here we systematically examined the state and accessibility of chromatin in the developing mouse fetus. In total we performed 1,128 chromatin immunoprecipitation with sequencing (ChIP-seq) assays for histone modifications and 132 assay for transposase-accessible chromatin using sequencing (ATAC-seq) assays for chromatin accessibility across 72 distinct tissue-stages. We used integrative analysis to develop a unified set of chromatin state annotations, infer the identities of dynamic enhancers and key transcriptional regulators, and characterize the relationship between chromatin state and accessibility during developmental gene regulation. We also leveraged these data to link enhancers to putative target genes and demonstrate tissue-specific enrichments of sequence variants associated with disease in humans. The mouse ENCODE data sets provide a compendium of resources for biomedical researchers and achieve, to our knowledge, the most comprehensive view of chromatin dynamics during mammalian fetal development to date.


Assuntos
Cromatina/genética , Cromatina/metabolismo , Conjuntos de Dados como Assunto , Desenvolvimento Fetal/genética , Histonas/metabolismo , Anotação de Sequência Molecular , Sequências Reguladoras de Ácido Nucleico/genética , Animais , Cromatina/química , Sequenciamento de Cromatina por Imunoprecipitação , Doença/genética , Elementos Facilitadores Genéticos/genética , Feminino , Regulação da Expressão Gênica no Desenvolvimento/genética , Variação Genética , Histonas/química , Humanos , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Especificidade de Órgãos/genética , Reprodutibilidade dos Testes , Transposases/metabolismo
11.
Nature ; 583(7818): 699-710, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32728249

RESUMO

The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development. All data are available through the ENCODE data portal (https://www.encodeproject.org), including phase II ENCODE1 and Roadmap Epigenomics2 data. We have developed a registry of 926,535 human and 339,815 mouse candidate cis-regulatory elements, covering 7.9 and 3.4% of their respective genomes, by integrating selected datatypes associated with gene regulation, and constructed a web-based server (SCREEN; http://screen.encodeproject.org) to provide flexible, user-defined access to this resource. Collectively, the ENCODE data and registry provide an expansive resource for the scientific community to build a better understanding of the organization and function of the human and mouse genomes.


Assuntos
DNA/genética , Bases de Dados Genéticas , Genoma/genética , Genômica , Anotação de Sequência Molecular , Sistema de Registros , Sequências Reguladoras de Ácido Nucleico/genética , Animais , Cromatina/genética , Cromatina/metabolismo , DNA/química , Pegada de DNA , Metilação de DNA/genética , Período de Replicação do DNA , Desoxirribonuclease I/metabolismo , Genoma Humano , Histonas/metabolismo , Humanos , Camundongos , Camundongos Transgênicos , Proteínas de Ligação a RNA/genética , Transcrição Gênica/genética , Transposases/metabolismo
12.
PLoS Genet ; 19(6): e1010759, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37289818

RESUMO

Gene regulation is highly cell type-specific and understanding the function of non-coding genetic variants associated with complex traits requires molecular phenotyping at cell type resolution. In this study we performed single nucleus ATAC-seq (snATAC-seq) and genotyping in peripheral blood mononuclear cells from 13 individuals. Clustering chromatin accessibility profiles of 96,002 total nuclei identified 17 immune cell types and sub-types. We mapped chromatin accessibility QTLs (caQTLs) in each immune cell type and sub-type using individuals of European ancestry which identified 6,901 caQTLs at FDR < .10 and 4,220 caQTLs at FDR < .05, including those obscured from assays of bulk tissue such as with divergent effects on different cell types. For 3,941 caQTLs we further annotated putative target genes of variant activity using single cell co-accessibility, and caQTL variants were significantly correlated with the accessibility level of linked gene promoters. We fine-mapped loci associated with 16 complex immune traits and identified immune cell caQTLs at 622 candidate causal variants, including those with cell type-specific effects. At the 6q15 locus associated with type 1 diabetes, in line with previous reports, variant rs72928038 was a naïve CD4+ T cell caQTL linked to BACH2 and we validated the allelic effects of this variant on regulatory activity in Jurkat T cells. These results highlight the utility of snATAC-seq for mapping genetic effects on accessible chromatin in specific cell types.


Assuntos
Sequenciamento de Cromatina por Imunoprecipitação , Cromatina , Humanos , Cromatina/genética , Herança Multifatorial , Leucócitos Mononucleares , Locos de Características Quantitativas/genética
13.
Mol Cell ; 62(5): 668-80, 2016 06 02.
Artigo em Inglês | MEDLINE | ID: mdl-27259200

RESUMO

How eukaryotic chromosomes fold inside the nucleus is an age-old question that remains unanswered today. Early biochemical and microscopic studies revealed the existence of chromatin domains and loops as a pervasive feature of interphase chromosomes, but the biological implications of such organizational features were obscure. Genome-wide analysis of pair-wise chromatin interactions using chromatin conformation capture (3C)-based techniques has shed new light on the organization of chromosomes in interphase nuclei. Particularly, the finding of cell-type invariant, evolutionarily conserved topologically associating domains (TADs) in a broad spectrum of cell types has provided a new molecular framework for the study of animal development and human diseases. Here, we review recent progress in characterization of such chromatin domains and delineation of mechanisms of their formation in animal cells.


Assuntos
Núcleo Celular/metabolismo , Montagem e Desmontagem da Cromatina , Cromatina/metabolismo , Cromossomos/metabolismo , Genoma , Interfase , Animais , Cromatina/química , Cromatina/genética , Cromossomos/química , Cromossomos/genética , Regulação da Expressão Gênica , Genômica/métodos , Humanos , Conformação de Ácido Nucleico , Conformação Proteica , Relação Estrutura-Atividade
15.
Genomics ; 114(3): 110330, 2022 05.
Artigo em Inglês | MEDLINE | ID: mdl-35278615

RESUMO

Primary hepatocytes are widely used in the pharmaceutical industry to screen drug candidates for hepatotoxicity, but hepatocytes quickly dedifferentiate and lose their mature metabolic function in culture. Attempts have been made to better recapitulate the in vivo liver environment in culture, but the full spectrum of signals required to maintain hepatocyte function ex vivo remains elusive. To elucidate molecular changes that accompany, and may contribute to dedifferentiation of hepatocytes ex vivo, we performed lineage tracing and comprehensive profiling of alterations in their gene expression profiles and chromatin landscape during culture. First, using genetically tagged hepatocytes we demonstrate that expression of the fetal gene alpha-fetoprotein in cultured hepatocytes comes from cells that previously expressed the mature gene albumin, and not from a population of albumin-negative precursor cells, proving mature hepatocytes undergo true dedifferentiation in culture. Next we studied the dedifferentiation process in detail through bulk RNA-sequencing of hepatocytes cultured over an extended period. We identified three distinct phases of dedifferentiation: an early phase, where mature hepatocyte genes are rapidly downregulated in a matter of hours; a middle phase, where fetal genes are activated; and a late phase, where initially rare contaminating non-parenchymal cells proliferate, taking over the culture. Lastly, to better understand the signaling events that result in the rapid downregulation of mature genes in hepatocytes, we examined changes in chromatin accessibility in these cells during the first 24 h of culture using Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq). We find that drastic and rapid changes in chromatin accessibility occur immediately upon the start of culture. Using binding motif analysis of the areas of open chromatin sharing similar temporal profiles, we identify several candidate transcription factors potentially involved in the dedifferentiation of primary hepatocytes in culture.


Assuntos
Hepatócitos , Fígado , Células Cultivadas , Hepatócitos/metabolismo , Albuminas , Cromatina/genética
16.
Hum Genomics ; 15(1): 44, 2021 07 13.
Artigo em Inglês | MEDLINE | ID: mdl-34256850

RESUMO

BACKGROUND: Previous research in autism and other neurodevelopmental disorders (NDDs) has indicated an important contribution of protein-coding (coding) de novo variants (DNVs) within specific genes. The role of de novo noncoding variation has been observable as a general increase in genetic burden but has yet to be resolved to individual functional elements. In this study, we assessed whole-genome sequencing data in 2671 families with autism (discovery cohort of 516 families, replication cohort of 2155 families). We focused on DNVs in enhancers with characterized in vivo activity in the brain and identified an excess of DNVs in an enhancer named hs737. RESULTS: We adapted the fitDNM statistical model to work in noncoding regions and tested enhancers for excess of DNVs in families with autism. We found only one enhancer (hs737) with nominal significance in the discovery (p = 0.0172), replication (p = 2.5 × 10-3), and combined dataset (p = 1.1 × 10-4). Each individual with a DNV in hs737 had shared phenotypes including being male, intact cognitive function, and hypotonia or motor delay. Our in vitro assessment of the DNVs showed they all reduce enhancer activity in a neuronal cell line. By epigenomic analyses, we found that hs737 is brain-specific and targets the transcription factor gene EBF3 in human fetal brain. EBF3 is genome-wide significant for coding DNVs in NDDs (missense p = 8.12 × 10-35, loss-of-function p = 2.26 × 10-13) and is widely expressed in the body. Through characterization of promoters bound by EBF3 in neuronal cells, we saw enrichment for binding to NDD genes (p = 7.43 × 10-6, OR = 1.87) involved in gene regulation. Individuals with coding DNVs have greater phenotypic severity (hypotonia, ataxia, and delayed development syndrome [HADDS]) in comparison to individuals with noncoding DNVs that have autism and hypotonia. CONCLUSIONS: In this study, we identify DNVs in the hs737 enhancer in individuals with autism. Through multiple approaches, we find hs737 targets the gene EBF3 that is genome-wide significant in NDDs. By assessment of noncoding variation and the genes they affect, we are beginning to understand their impact on gene regulatory networks in NDDs.


Assuntos
Transtorno Autístico/genética , Predisposição Genética para Doença , Hipotonia Muscular/genética , Transtornos do Neurodesenvolvimento/genética , Fatores de Transcrição/genética , Transtorno Autístico/epidemiologia , Transtorno Autístico/patologia , Elementos Facilitadores Genéticos/genética , Exoma/genética , Feminino , Redes Reguladoras de Genes/genética , Humanos , Masculino , Hipotonia Muscular/epidemiologia , Hipotonia Muscular/patologia , Mutação/genética , Transtornos do Neurodesenvolvimento/epidemiologia , Transtornos do Neurodesenvolvimento/patologia , Neurônios/metabolismo , Neurônios/patologia
18.
Proc Natl Acad Sci U S A ; 114(9): E1633-E1640, 2017 02 28.
Artigo em Inglês | MEDLINE | ID: mdl-28193886

RESUMO

Accurate enhancer identification is critical for understanding the spatiotemporal transcriptional regulation during development as well as the functional impact of disease-related noncoding genetic variants. Computational methods have been developed to predict the genomic locations of active enhancers based on histone modifications, but the accuracy and resolution of these methods remain limited. Here, we present an algorithm, regulatory element prediction based on tissue-specific local epigenetic marks (REPTILE), which integrates histone modification and whole-genome cytosine DNA methylation profiles to identify the precise location of enhancers. We tested the ability of REPTILE to identify enhancers previously validated in reporter assays. Compared with existing methods, REPTILE shows consistently superior performance across diverse cell and tissue types, and the enhancer locations are significantly more refined. We show that, by incorporating base-resolution methylation data, REPTILE greatly improves upon current methods for annotation of enhancers across a variety of cell and tissue types. REPTILE is available at https://github.com/yupenghe/REPTILE/.


Assuntos
Elementos Facilitadores Genéticos/genética , Epigênese Genética/genética , Regulação da Expressão Gênica/genética , Algoritmos , Biologia Computacional/métodos , Metilação de DNA/genética , Epigenômica/métodos , Genômica/métodos , Código das Histonas/genética , Histonas/genética , Humanos , Transcrição Gênica/genética
19.
Hum Mol Genet ; 24(19): 5433-50, 2015 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-26206884

RESUMO

SOX10 is required for melanocyte development and maintenance, and has been linked to melanoma initiation and progression. However, the molecular mechanisms by which SOX10 guides the appropriate gene expression programs necessary to promote the melanocyte lineage are not fully understood. Here we employ genetic and epigenomic analysis approaches to uncover novel genomic targets and previously unappreciated molecular roles of SOX10 in melanocytes. Through global analysis of SOX10-binding sites and epigenetic characteristics of chromatin states, we uncover an extensive catalog of SOX10 targets genome-wide. Our findings reveal that SOX10 predominantly engages 'open' chromatin regions and binds to distal regulatory elements, including novel and previously known melanocyte enhancers. Integrated chromatin occupancy and transcriptome analysis suggest a role for SOX10 in both transcriptional activation and repression to regulate functionally distinct classes of genes. We demonstrate that distinct epigenetic signatures and cis-regulatory sequence motifs predicted to bind putative co-regulatory transcription factors define SOX10-activated and SOX10-repressed target genes. Collectively, these findings uncover a central role of SOX10 as a global regulator of gene expression in the melanocyte lineage by targeting diverse regulatory pathways.


Assuntos
Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes , Melanócitos/metabolismo , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Fatores de Transcrição SOXE/metabolismo , Animais , Sítios de Ligação , Linhagem Celular , Cromatina/genética , Cromatina/metabolismo , Epigenômica/métodos , Melanócitos/citologia , Camundongos , Fatores de Transcrição SOXE/química , Fatores de Transcrição SOXE/genética
20.
Genome Res ; 22(11): 2290-301, 2012 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-23019145

RESUMO

We take a comprehensive approach to the study of regulatory control of gene expression in melanocytes that proceeds from large-scale enhancer discovery facilitated by ChIP-seq; to rigorous validation in silico, in vitro, and in vivo; and finally to the use of machine learning to elucidate a regulatory vocabulary with genome-wide predictive power. We identify 2489 putative melanocyte enhancer loci in the mouse genome by ChIP-seq for EP300 and H3K4me1. We demonstrate that these putative enhancers are evolutionarily constrained, enriched for sequence motifs predicted to bind key melanocyte transcription factors, located near genes relevant to melanocyte biology, and capable of driving reporter gene expression in melanocytes in culture (86%; 43/50) and in transgenic zebrafish (70%; 7/10). Next, using the sequences of these putative enhancers as a training set for a supervised machine learning algorithm, we develop a vocabulary of 6-mers predictive of melanocyte enhancer function. Lastly, we demonstrate that this vocabulary has genome-wide predictive power in both the mouse and human genomes. This study provides deep insight into the regulation of gene expression in melanocytes and demonstrates a powerful approach to the investigation of regulatory sequences that can be applied to other cell types.


Assuntos
Inteligência Artificial , Imunoprecipitação da Cromatina/métodos , Elementos Facilitadores Genéticos , Melanócitos/metabolismo , Algoritmos , Animais , Proteína p300 Associada a E1A/genética , Proteína p300 Associada a E1A/metabolismo , Evolução Molecular , Regulação da Expressão Gênica , Genes Reporter , Genoma Humano , Histonas/metabolismo , Humanos , Camundongos , Análise de Sequência de DNA/métodos , Fatores de Transcrição/metabolismo , Peixe-Zebra
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA