RESUMO
In higher eukaryotes, many genes are regulated by enhancers that are 104-106 base pairs (bp) away from the promoter. Enhancers contain transcription-factor-binding sites (which are typically around 7-22 bp), and physical contact between the promoters and enhancers is thought to be required to modulate gene expression. Although chromatin architecture has been mapped extensively at resolutions of 1 kilobase and above; it has not been possible to define physical contacts at the scale of the proteins that determine gene expression. Here we define these interactions in detail using a chromosome conformation capture method (Micro-Capture-C) that enables the physical contacts between different classes of regulatory elements to be determined at base-pair resolution. We find that highly punctate contacts occur between enhancers, promoters and CCCTC-binding factor (CTCF) sites and we show that transcription factors have an important role in the maintenance of the contacts between enhancers and promoters. Our data show that interactions between CTCF sites are increased when active promoters and enhancers are located within the intervening chromatin. This supports a model in which chromatin loop extrusion1 is dependent on cohesin loading at active promoters and enhancers, which explains the formation of tissue-specific chromatin domains without changes in CTCF binding.
Assuntos
Pareamento de Bases/genética , Genoma/genética , Animais , Sítios de Ligação , Fator de Ligação a CCCTC/metabolismo , Proteínas de Ciclo Celular/metabolismo , Células Cultivadas , Cromatina/química , Cromatina/genética , Cromatina/metabolismo , Proteínas Cromossômicas não Histona/metabolismo , Elementos Facilitadores Genéticos/genética , Células Eritroides/citologia , Células Eritroides/metabolismo , Regulação da Expressão Gênica , Camundongos , Camundongos Endogâmicos C57BL , Especificidade de Órgãos , Regiões Promotoras Genéticas/genética , alfa-Globinas/genética , CoesinasRESUMO
The human genome contains â¼30,000 CpG islands (CGIs). While CGIs associated with promoters nearly always remain unmethylated, many of the â¼9,000 CGIs lying within gene bodies become methylated during development and differentiation. Both promoter and intragenic CGIs may also become abnormally methylated as a result of genome rearrangements and in malignancy. The epigenetic mechanisms by which some CGIs become methylated but others, in the same cell, remain unmethylated in these situations are poorly understood. Analyzing specific loci and using a genome-wide analysis, we show that transcription running across CGIs, associated with specific chromatin modifications, is required for DNA methyltransferase 3B (DNMT3B)-mediated DNA methylation of many naturally occurring intragenic CGIs. Importantly, we also show that a subgroup of intragenic CGIs is not sensitive to this process of transcription-mediated methylation and that this correlates with their individual intrinsic capacity to initiate transcription in vivo. We propose a general model of how transcription could act as a primary determinant of the patterns of CGI methylation in normal development and differentiation, and in human disease.
Assuntos
Diferenciação Celular/genética , Ilhas de CpG/genética , Metilação de DNA/genética , Transcrição Gênica/genética , Animais , Linhagem Celular , Epigênese Genética/genética , Genoma Humano/genética , Humanos , Camundongos , Regiões Promotoras Genéticas/genética , Análise de Sequência de DNA/métodosRESUMO
The expression profile of a gene is controlled by DNA sequences called cis-regulatory modules (CRMs). CRMs can function over large genomic distances and can be located many kilobases away from their target promoters. hes1 is a key developmental gene that is overexpressed in certain cancers and is a primary target of NOTCH signaling. Despite this, analysis of hes1 transcriptional control has been limited solely to its promoter. Here, we identify seven conserved DNA sequence blocks, representing the hes1 promoter and six novel CRMs, within 57 kb upstream of the mouse hes1 gene. We identify 12 binding sites for the RBP-Jκ NOTCH effector and a single M-CAT motif within these regions. We validate RBP-Jκ and TEAD family occupancy in cells in culture and test the response of each of these CRMs to active NOTCH. We show that two regions, CRM5 and CRM7, function as enhancers, and four can repress transcription. A pair of RBP-Jκ motifs arranged in a tail-tail configuration in CRM5 and the M-CAT motif in CRM7 are necessary for enhancer function. Furthermore, these enhancers are occupied by transcriptional co-activators and loop onto the hes1 promoter within the endogenous hes1 locus. This work demonstrates the power of combining computational genomics and experimental methodologies to identify novel CRMs and characterize their function.
Assuntos
Fatores de Transcrição Hélice-Alça-Hélice Básicos/genética , Regulação da Expressão Gênica/genética , Proteínas de Homeodomínio/genética , Mioblastos/metabolismo , Sequências Reguladoras de Ácido Nucleico/genética , Animais , Linhagem Celular , Biologia Computacional , Sequência Conservada/genética , Elementos Facilitadores Genéticos/genética , Proteína de Ligação a Sequências Sinal de Recombinação J de Imunoglobina/metabolismo , Camundongos , Mioblastos/citologia , Motivos de Nucleotídeos/genética , Regiões Promotoras Genéticas/genética , Receptores Notch/metabolismo , Transdução de Sinais , Fatores de Transcrição HES-1 , Sítio de Iniciação de TranscriçãoRESUMO
Mouse embryonic stem cells (mESCs) can be manipulated in vitro to recapitulate the process of erythropoiesis, during which multipotent cells undergo lineage specification, differentiation and maturation to produce erythroid cells. Although useful for identifying specific progenitors and precursors, this system has not been fully exploited as a source of cells to analyse erythropoiesis. Here, we establish a protocol in which characterised erythroblasts can be isolated in a scalable manner from differentiated embryoid bodies (EBs). Using transcriptional and epigenetic analysis, we demonstrate that this system faithfully recapitulates normal primitive erythropoiesis and fully reproduces the effects of natural and engineered mutations seen in primary cells obtained from mouse models. We anticipate this system to be of great value in reducing the time and costs of generating and maintaining mouse lines in a number of research scenarios.
Assuntos
Diferenciação Celular , Corpos Embrioides/metabolismo , Eritroblastos/metabolismo , Eritropoese , Modelos Biológicos , Células-Tronco Embrionárias Murinas/metabolismo , Animais , Linhagem Celular , Corpos Embrioides/citologia , Eritroblastos/citologia , Camundongos , Células-Tronco Embrionárias Murinas/citologiaRESUMO
The transcription factor RUNX1 is a critical regulator of developmental hematopoiesis and is frequently disrupted in leukemia. Runx1 is a large, complex gene that is expressed from two alternative promoters under the spatiotemporal control of multiple hematopoietic enhancers. To dissect the dynamic regulation of Runx1 in hematopoietic development, we analyzed its three-dimensional chromatin conformation in mouse embryonic stem cell (ESC) differentiation cultures. Runx1 resides in a 1.1 Mb topologically associating domain (TAD) demarcated by convergent CTCF motifs. As ESCs differentiate to mesoderm, chromatin accessibility, Runx1 enhancer-promoter (E-P) interactions, and CTCF-CTCF interactions increase in the TAD, along with initiation of Runx1 expression from the P2 promoter. Differentiation to hematopoietic progenitor cells is associated with the formation of tissue-specific sub-TADs over Runx1, a shift in E-P interactions, P1 promoter demethylation, and robust expression from both Runx1 promoters. Deletion of promoter-proximal CTCF sites at the sub-TAD boundaries has no obvious effects on E-P interactions but leads to partial loss of domain structure, mildly affects gene expression, and delays hematopoietic development. Together, our analysis of gene regulation at a large multi-promoter developmental gene reveals that dynamic sub-TAD chromatin boundaries play a role in establishing TAD structure and coordinated gene expression.
Assuntos
Cromatina/metabolismo , Subunidade alfa 2 de Fator de Ligação ao Core/genética , Subunidade alfa 2 de Fator de Ligação ao Core/metabolismo , Expressão Gênica , Animais , Proteínas de Ciclo Celular/metabolismo , Diferenciação Celular , DNA/química , Regulação da Expressão Gênica no Desenvolvimento , Células-Tronco Hematopoéticas/metabolismo , Mesoderma/metabolismo , Camundongos , Conformação de Ácido Nucleico , Regiões Promotoras GenéticasRESUMO
The genomic instructions used to regulate development are encoded within a set of functional DNA elements called cis-regulatory modules (CRMs). These elements determine the precise patterns of temporal and spatial gene expression. Here we summarize recent progress made towards cataloguing and characterizing the complete repertoire of CRMs. We describe CRMs as genomic information processing devices containing clusters of transcription factor binding sites and we position CRMs as nodes within large gene regulatory networks. We define CRM architecture and describe how these genomic elements process the information they encode to their target genes. Furthermore, we present an overview describing high-throughput techniques to identify CRMs genome wide and experimental methodologies to validate their function on a large scale. This review emphasizes the advantages and power of a systems biology approach which integrates computational and experimental technologies to further our understanding of CRM function.
Assuntos
Redes Reguladoras de Genes , Biologia de Sistemas/métodos , Animais , Biotecnologia , Variação Genética , Humanos , Modelos GenéticosRESUMO
The α- and ß-globin loci harbor developmentally expressed genes, which are silenced throughout post-natal life. Reactivation of these genes may offer therapeutic approaches for the hemoglobinopathies, the most common single gene disorders. Here, we address mechanisms regulating the embryonically expressed α-like globin, termed ζ-globin. We show that in embryonic erythroid cells, the ζ-gene lies within a ~65 kb sub-TAD (topologically associating domain) of open, acetylated chromatin and interacts with the α-globin super-enhancer. By contrast, in adult erythroid cells, the ζ-gene is packaged within a small (~10 kb) sub-domain of hypoacetylated, facultative heterochromatin within the acetylated sub-TAD and that it no longer interacts with its enhancers. The ζ-gene can be partially re-activated by acetylation and inhibition of histone de-acetylases. In addition to suggesting therapies for severe α-thalassemia, these findings illustrate the general principles by which reactivation of developmental genes may rescue abnormalities arising from mutations in their adult paralogues.
Assuntos
Regulação da Expressão Gênica no Desenvolvimento , Inativação Gênica , Ativação Transcricional , Globinas zeta/genética , Acetilação , Animais , Cromatina/metabolismo , Proteínas de Ligação a DNA/metabolismo , Elementos Facilitadores Genéticos , Células Eritroides/metabolismo , Regulação da Expressão Gênica no Desenvolvimento/efeitos dos fármacos , Inativação Gênica/efeitos dos fármacos , Inibidores de Histona Desacetilases/farmacologia , Humanos , Camundongos , Proteínas Repressoras/metabolismo , Fatores de Transcrição/metabolismo , Ativação Transcricional/efeitos dos fármacos , alfa-Globinas/genéticaRESUMO
The extraction of fluorescence time course data is a major bottleneck in high-throughput live-cell microscopy. Here we present an extendible framework based on the open-source image analysis software ImageJ, which aims in particular at analyzing the expression of fluorescent reporters through cell divisions. The ability to track individual cell lineages is essential for the analysis of gene regulatory factors involved in the control of cell fate and identity decisions. In our approach, cell nuclei are identified using Hoechst, and a characteristic drop in Hoechst fluorescence helps to detect dividing cells. We first compare the efficiency and accuracy of different segmentation methods and then present a statistical scoring algorithm for cell tracking, which draws on the combination of various features, such as nuclear intensity, area or shape, and importantly, dynamic changes thereof. Principal component analysis is used to determine the most significant features, and a global parameter search is performed to determine the weighting of individual features. Our algorithm has been optimized to cope with large cell movements, and we were able to semi-automatically extract cell trajectories across three cell generations. Based on the MTrackJ plugin for ImageJ, we have developed tools to efficiently validate tracks and manually correct them by connecting broken trajectories and reassigning falsely connected cell positions. A gold standard consisting of two time-series with 15,000 validated positions will be released as a valuable resource for benchmarking. We demonstrate how our method can be applied to analyze fluorescence distributions generated from mouse stem cells transfected with reporter constructs containing transcriptional control elements of the Msx1 gene, a regulator of pluripotency, in mother and daughter cells. Furthermore, we show by tracking zebrafish PAC2 cells expressing FUCCI cell cycle markers, our framework can be easily adapted to different cell types and fluorescent markers.