RESUMO
Higher-order chromosomal organization for transcription regulation is poorly understood in eukaryotes. Using genome-wide Chromatin Interaction Analysis with Paired-End-Tag sequencing (ChIA-PET), we mapped long-range chromatin interactions associated with RNA polymerase II in human cells and uncovered widespread promoter-centered intragenic, extragenic, and intergenic interactions. These interactions further aggregated into higher-order clusters, wherein proximal and distal genes were engaged through promoter-promoter interactions. Most genes with promoter-promoter interactions were active and transcribed cooperatively, and some interacting promoters could influence each other implying combinatorial complexity of transcriptional controls. Comparative analyses of different cell lines showed that cell-specific chromatin interactions could provide structural frameworks for cell-specific transcription, and suggested significant enrichment of enhancer-promoter interactions for cell-specific functions. Furthermore, genetically-identified disease-associated noncoding elements were found to be spatially engaged with corresponding genes through long-range interactions. Overall, our study provides insights into transcription regulation by three-dimensional chromatin interactions for both housekeeping and cell-specific genes in human cells.
Assuntos
Cromatina/metabolismo , Regulação da Expressão Gênica , Regiões Promotoras Genéticas , RNA Polimerase II/metabolismo , Transcrição Gênica , Linhagem Celular Tumoral , Imunoprecipitação da Cromatina , Elementos Facilitadores Genéticos , Estudo de Associação Genômica Ampla , HumanosRESUMO
The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.
Assuntos
Genoma/genética , Genômica , Camundongos/genética , Anotação de Sequência Molecular , Animais , Linhagem da Célula/genética , Cromatina/genética , Cromatina/metabolismo , Sequência Conservada/genética , Replicação do DNA/genética , Desoxirribonuclease I/metabolismo , Regulação da Expressão Gênica/genética , Redes Reguladoras de Genes/genética , Estudo de Associação Genômica Ampla , Humanos , RNA/genética , Sequências Reguladoras de Ácido Nucleico/genética , Especificidade da Espécie , Fatores de Transcrição/metabolismo , Transcriptoma/genéticaRESUMO
Chromatin immunoprecipitation (ChIP) followed by high-throughput DNA sequencing (ChIP-seq) has become a valuable and widely used approach for mapping the genomic location of transcription-factor binding and histone modifications in living cells. Despite its widespread use, there are considerable differences in how these experiments are conducted, how the results are scored and evaluated for quality, and how the data and metadata are archived for public use. These practices affect the quality and utility of any global ChIP experiment. Through our experience in performing ChIP-seq experiments, the ENCODE and modENCODE consortia have developed a set of working standards and guidelines for ChIP experiments that are updated routinely. The current guidelines address antibody validation, experimental replication, sequencing depth, data and metadata reporting, and data quality assessment. We discuss how ChIP quality, assessed in these ways, affects different uses of ChIP-seq data. All data sets used in the analysis have been deposited for public viewing and downloading at the ENCODE (http://encodeproject.org/ENCODE/) and modENCODE (http://www.modencode.org/) portals.
Assuntos
Imunoprecipitação da Cromatina/métodos , Bases de Dados Genéticas , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Animais , Genoma/genética , Genômica/métodos , Guias como Assunto , Histonas/metabolismo , Humanos , Internet , Fatores de Transcrição/metabolismoRESUMO
Cis-regulatory modules (CRMs) function by binding sequence specific transcription factors, but the relationship between in vivo physical binding and the regulatory capacity of factor-bound DNA elements remains uncertain. We investigate this relationship for the well-studied Twist factor in Drosophila melanogaster embryos by analyzing genome-wide factor occupancy and testing the functional significance of Twist occupied regions and motifs within regions. Twist ChIP-seq data efficiently identified previously studied Twist-dependent CRMs and robustly predicted new CRM activity in transgenesis, with newly identified Twist-occupied regions supporting diverse spatiotemporal patterns (>74% positive, n = 31). Some, but not all, candidate CRMs require Twist for proper expression in the embryo. The Twist motifs most favored in genome ChIP data (in vivo) differed from those most favored by Systematic Evolution of Ligands by EXponential enrichment (SELEX) (in vitro). Furthermore, the majority of ChIP-seq signals could be parsimoniously explained by a CABVTG motif located within 50 bp of the ChIP summit and, of these, CACATG was most prevalent. Mutagenesis experiments demonstrated that different Twist E-box motif types are not fully interchangeable, suggesting that the ChIP-derived consensus (CABVTG) includes sites having distinct regulatory outputs. Further analysis of position, frequency of occurrence, and sequence conservation revealed significant enrichment and conservation of CABVTG E-box motifs near Twist ChIP-seq signal summits, preferential conservation of ±150 bp surrounding Twist occupied summits, and enrichment of GA- and CA-repeat sequences near Twist occupied summits. Our results show that high resolution in vivo occupancy data can be used to drive efficient discovery and dissection of global and local cis-regulatory logic.
Assuntos
DNA/genética , Drosophila/embriologia , Drosophila/genética , Evolução Molecular , Proteína 1 Relacionada a Twist/genética , Proteína 1 Relacionada a Twist/metabolismo , Animais , Composição de Bases , Sequência de Bases , Sítios de Ligação/genética , Biologia Computacional , Sequência Consenso/genética , Sequência Conservada , Regulação da Expressão Gênica no Desenvolvimento , Dados de Sequência Molecular , Elementos Reguladores de Transcrição/genéticaRESUMO
BACKGROUND: Hundreds of genes, including muscle creatine kinase (MCK), are differentially expressed in fast- and slow-twitch muscle fibers, but the fiber type-specific regulatory mechanisms are not well understood. RESULTS: Modulatory region 1 (MR1) is a 1-kb regulatory region within MCK intron 1 that is highly active in terminally differentiating skeletal myocytes in vitro. A MCK small intronic enhancer (MCK-SIE) containing a paired E-box/myocyte enhancer factor 2 (MEF2) regulatory motif resides within MR1. The SIE's transcriptional activity equals that of the extensively characterized 206-bp MCK 5'-enhancer, but the MCK-SIE is flanked by regions that can repress its activity via the individual and combined effects of about 15 different but highly conserved 9- to 24-bp sequences. ChIP and ChIP-Seq analyses indicate that the SIE and the MCK 5'-enhancer are occupied by MyoD, myogenin and MEF2. Many other E-boxes located within or immediately adjacent to intron 1 are not occupied by MyoD or myogenin. Transgenic analysis of a 6.5-kb MCK genomic fragment containing the 5'-enhancer and proximal promoter plus the 3.2-kb intron 1, with and without MR1, indicates that MR1 is critical for MCK expression in slow- and intermediate-twitch muscle fibers (types I and IIa, respectively), but is not required for expression in fast-twitch muscle fibers (types IIb and IId). CONCLUSIONS: In this study, we discovered that MR1 is critical for MCK expression in slow- and intermediate-twitch muscle fibers and that MR1's positive transcriptional activity depends on a paired E-box MEF2 site motif within a SIE. This is the first study to delineate the DNA controls for MCK expression in different skeletal muscle fiber types.