RESUMEN
Gene transcription occurs via a cycle of linked events, including initiation, promoter-proximal pausing, and elongation of RNA polymerase II (Pol II). A key question is how transcriptional enhancers influence these events to control gene expression. Here, we present an approach that evaluates the level and change in promoter-proximal transcription (initiation and pausing) in the context of differential gene expression, genome-wide. This combinatorial approach shows that in primary cells, control of gene expression during differentiation is achieved predominantly via changes in transcription initiation rather than via release of Pol II pausing. Using genetically engineered mouse models, deleted for functionally validated enhancers of the α- and ß-globin loci, we confirm that these elements regulate Pol II recruitment and/or initiation to modulate gene expression. Together, our data show that gene expression during differentiation is regulated predominantly at the level of initiation and that enhancers are key effectors of this process.
Asunto(s)
Elementos de Facilitación Genéticos , Regiones Promotoras Genéticas , ARN Polimerasa II/genética , Iniciación de la Transcripción Genética , Globinas alfa/genética , Globinas beta/genética , Animales , Diferenciación Celular , Exones , Feto , Regulación de la Expresión Génica , Biblioteca de Genes , Proteínas HSP70 de Choque Térmico/genética , Proteínas HSP70 de Choque Térmico/metabolismo , Humanos , Intrones , Células K562 , Hígado/citología , Hígado/metabolismo , Ratones , Ratones Noqueados , Proteínas Proto-Oncogénicas c-myc/genética , Proteínas Proto-Oncogénicas c-myc/metabolismo , ARN Polimerasa II/metabolismo , Transducción de Señal , Globinas alfa/deficiencia , Globinas beta/deficienciaRESUMEN
Dynamic cellular processes such as differentiation are driven by changes in the abundances of transcription factors (TFs). However, despite years of studies, our knowledge about the protein copy number of TFs in the nucleus is limited. Here, by determining the absolute abundances of 103 TFs and co-factors during the course of human erythropoiesis, we provide a dynamic and quantitative scale for TFs in the nucleus. Furthermore, we establish the first gene regulatory network of cell fate commitment that integrates temporal protein stoichiometry data with mRNA measurements. The model revealed quantitative imbalances in TFs' cross-antagonistic relationships that underlie lineage determination. Finally, we made the surprising discovery that, in the nucleus, co-repressors are dramatically more abundant than co-activators at the protein level, but not at the RNA level, with profound implications for understanding transcriptional regulation. These analyses provide a unique quantitative framework to understand transcriptional regulation of cell differentiation in a dynamic context.
Asunto(s)
Eritropoyesis/genética , Redes Reguladoras de Genes/genética , Factores de Transcripción/genética , Bases de Datos Factuales , Regulación de la Expresión Génica/genética , Hematopoyesis/genética , Humanos , Proteómica/métodos , Factores de Transcripción/análisis , Factores de Transcripción/metabolismoRESUMEN
Knowledge of locations and activities of cis-regulatory elements (CREs) is needed to decipher basic mechanisms of gene regulation and to understand the impact of genetic variants on complex traits. Previous studies identified candidate CREs (cCREs) using epigenetic features in one species, making comparisons difficult between species. In contrast, we conducted an interspecies study defining epigenetic states and identifying cCREs in blood cell types to generate regulatory maps that are comparable between species, using integrative modeling of eight epigenetic features jointly in human and mouse in our Validated Systematic Integration (VISION) Project. The resulting catalogs of cCREs are useful resources for further studies of gene regulation in blood cells, indicated by high overlap with known functional elements and strong enrichment for human genetic variants associated with blood cell phenotypes. The contribution of each epigenetic state in cCREs to gene regulation, inferred from a multivariate regression, was used to estimate epigenetic state regulatory potential (esRP) scores for each cCRE in each cell type, which were used to categorize dynamic changes in cCREs. Groups of cCREs displaying similar patterns of regulatory activity in human and mouse cell types, obtained by joint clustering on esRP scores, harbor distinctive transcription factor binding motifs that are similar between species. An interspecies comparison of cCREs revealed both conserved and species-specific patterns of epigenetic evolution. Finally, we show that comparisons of the epigenetic landscape between species can reveal elements with similar roles in regulation, even in the absence of genomic sequence alignment.
Asunto(s)
Epigénesis Genética , Epigenoma , Especificidad de la Especie , Animales , Ratones , Humanos , Células Sanguíneas/metabolismo , Secuencias Reguladoras de Ácidos Nucleicos , Regulación de la Expresión Génica , Epigenómica/métodosRESUMEN
Neurocristopathies such as CHARGE syndrome result from aberrant neural crest development. A large proportion of CHARGE cases are attributed to pathogenic variants in the gene encoding CHD7, chromodomain helicase DNA binding protein 7, which remodels chromatin. While the role for CHD7 in neural crest development is well documented, how this factor is specifically up-regulated in neural crest cells is not understood. Here, we use epigenomic profiling of chick and human neural crest to identify a cohort of enhancers regulating Chd7 expression in neural crest cells and other tissues. We functionally validate upstream transcription factor binding at candidate enhancers, revealing novel epistatic relationships between neural crest master regulators and Chd7, showing tissue-specific regulation of a globally acting chromatin remodeller. Furthermore, we find conserved enhancer features in human embryonic epigenomic data and validate the activity of the human equivalent CHD7 enhancers in the chick embryo. Our findings embed Chd7 in the neural crest gene regulatory network and offer potentially clinically relevant elements for interpreting CHARGE syndrome cases without causative allocation.
Asunto(s)
Proteínas de Unión al ADN , Regulación del Desarrollo de la Expresión Génica , Cresta Neural , Factores de Transcripción , Cresta Neural/metabolismo , Cresta Neural/embriología , Animales , Humanos , Embrión de Pollo , Proteínas de Unión al ADN/metabolismo , Proteínas de Unión al ADN/genética , Factores de Transcripción/metabolismo , Factores de Transcripción/genética , ADN Helicasas/metabolismo , ADN Helicasas/genética , Ensamble y Desensamble de Cromatina/genética , Elementos de Facilitación Genéticos/genética , Síndrome CHARGE/genética , Síndrome CHARGE/metabolismo , Redes Reguladoras de Genes , Especificidad de Órganos/genéticaRESUMEN
In higher eukaryotes, many genes are regulated by enhancers that are 104-106 base pairs (bp) away from the promoter. Enhancers contain transcription-factor-binding sites (which are typically around 7-22 bp), and physical contact between the promoters and enhancers is thought to be required to modulate gene expression. Although chromatin architecture has been mapped extensively at resolutions of 1 kilobase and above; it has not been possible to define physical contacts at the scale of the proteins that determine gene expression. Here we define these interactions in detail using a chromosome conformation capture method (Micro-Capture-C) that enables the physical contacts between different classes of regulatory elements to be determined at base-pair resolution. We find that highly punctate contacts occur between enhancers, promoters and CCCTC-binding factor (CTCF) sites and we show that transcription factors have an important role in the maintenance of the contacts between enhancers and promoters. Our data show that interactions between CTCF sites are increased when active promoters and enhancers are located within the intervening chromatin. This supports a model in which chromatin loop extrusion1 is dependent on cohesin loading at active promoters and enhancers, which explains the formation of tissue-specific chromatin domains without changes in CTCF binding.
Asunto(s)
Emparejamiento Base/genética , Genoma/genética , Animales , Sitios de Unión , Factor de Unión a CCCTC/metabolismo , Proteínas de Ciclo Celular/metabolismo , Células Cultivadas , Cromatina/química , Cromatina/genética , Cromatina/metabolismo , Proteínas Cromosómicas no Histona/metabolismo , Elementos de Facilitación Genéticos/genética , Células Eritroides/citología , Células Eritroides/metabolismo , Regulación de la Expresión Génica , Ratones , Ratones Endogámicos C57BL , Especificidad de Órganos , Regiones Promotoras Genéticas/genética , Globinas alfa/genética , CohesinasRESUMEN
Oncogenes can be activated in cis through multiple mechanisms including enhancer hijacking events and noncoding mutations that create enhancers or promoters de novo. These paradigms have helped parse somatic variation of noncoding cancer genomes, thereby providing a rationale to identify noncanonical mechanisms of gene activation. Here we describe a novel mechanism of oncogene activation whereby focal copy number loss of an intronic element within the FTO gene leads to aberrant expression of IRX3, an oncogene in T cell acute lymphoblastic leukemia (T-ALL). Loss of this CTCF bound element downstream to IRX3 (+224 kb) leads to enhancer hijack of an upstream developmentally active super-enhancer of the CRNDE long noncoding RNA (-644 kb). Unexpectedly, the CRNDE super-enhancer interacts with the IRX3 promoter with no transcriptional output until it is untethered from the FTO intronic site. We propose that 'promoter tethering' of oncogenes to inert regions of the genome is a previously unappreciated biological mechanism preventing tumorigenesis.
RESUMEN
Deciphering the process by which hundreds of distinct cell types emerge from a single zygote to form a complex multicellular organism remains one of the greatest challenges in biological research. Enhancers are known to be central to cell type-specific gene expression, yet many questions regarding how these genomic elements interact both temporally and spatially with other cis- and trans-acting factors to control transcriptional activity during differentiation and development remain unanswered. Here, we review our current understanding of the role of enhancers and their interactions in this context and highlight recent progress achieved with experimental methods of unprecedented resolution.
Asunto(s)
Elementos de Facilitación Genéticos , Transactivadores , Diferenciación Celular/genética , Regiones Promotoras Genéticas , Transactivadores/genéticaRESUMEN
The successful development and ongoing functioning of complex organisms depend on the faithful execution of the genetic code. A critical step in this process is the correct spatial and temporal expression of genes. The highly orchestrated transcription of genes is controlled primarily by cis-regulatory elements: promoters, enhancers, and insulators. The medical importance of this key biological process can be seen by the frequency with which mutations and inherited variants that alter cis-regulatory elements lead to monogenic and complex diseases and cancer. Here, we provide an overview of the methods available to characterize and perturb gene regulatory circuits. We then highlight mechanisms through which regulatory rewiring contributes to disease, and conclude with a perspective on how our understanding of gene regulation can be used to improve human health.
Asunto(s)
Regulación de la Expresión Génica , Redes Reguladoras de Genes , Elementos de Facilitación Genéticos , Humanos , Mutación , Regiones Promotoras GenéticasRESUMEN
ATRX is an X-linked gene of the SWI/SNF family, mutations in which cause syndromal mental retardation and downregulation of α-globin expression. Here we show that ATRX binds to tandem repeat (TR) sequences in both telomeres and euchromatin. Genes associated with these TRs can be dysregulated when ATRX is mutated, and the change in expression is determined by the size of the TR, producing skewed allelic expression. This reveals the characteristics of the affected genes, explains the variable phenotypes seen with identical ATRX mutations, and illustrates a new mechanism underlying variable penetrance. Many of the TRs are G rich and predicted to form non-B DNA structures (including G-quadruplex) in vivo. We show that ATRX binds G-quadruplex structures in vitro, suggesting a mechanism by which ATRX may play a role in various nuclear processes and how this is perturbed when ATRX is mutated.
Asunto(s)
ADN Helicasas/metabolismo , Proteínas Nucleares/metabolismo , Animales , Células Cultivadas , Inmunoprecipitación de Cromatina , Cromosomas de los Mamíferos/metabolismo , Islas de CpG , ADN Helicasas/genética , ADN Ribosómico/metabolismo , G-Cuádruplex , Expresión Génica , Estudio de Asociación del Genoma Completo , Histonas/metabolismo , Humanos , Ratones , Repeticiones de Minisatélite , Mutación , Proteínas Nucleares/genética , Telómero/metabolismo , Proteína Nuclear Ligada al Cromosoma XRESUMEN
Central nervous system-expressed long non-coding RNAs (lncRNAs) are often located in the genome close to protein coding genes involved in transcriptional control. Such lncRNA-protein coding gene pairs are frequently temporally and spatially co-expressed in the nervous system and are predicted to act together to regulate neuronal development and function. Although some of these lncRNAs also bind and modulate the activity of the encoded transcription factors, the regulatory mechanisms controlling co-expression of neighbouring lncRNA-protein coding genes remain unclear. Here, we used high resolution NG Capture-C to map the cis-regulatory interaction landscape of the key neuro-developmental Paupar-Pax6 lncRNA-mRNA locus. The results define chromatin architecture changes associated with high Paupar-Pax6 expression in neurons and identify both promoter selective as well as shared cis-regulatory-promoter interactions involved in regulating Paupar-Pax6 co-expression. We discovered that the TCF7L2 transcription factor, a regulator of chromatin architecture and major effector of the Wnt signalling pathway, binds to a subset of these candidate cis-regulatory elements to coordinate Paupar and Pax6 co-expression. We describe distinct roles for Paupar in Pax6 expression control and show that the Paupar DNA locus contains a TCF7L2 bound transcriptional silencer whilst the Paupar transcript can act as an activator of Pax6. Our work provides important insights into the chromatin interactions, signalling pathways and transcription factors controlling co-expression of adjacent lncRNAs and protein coding genes in the brain.
Asunto(s)
ARN Largo no Codificante , Cromatina/genética , Neuronas/metabolismo , Factor de Transcripción PAX6/genética , Factor de Transcripción PAX6/metabolismo , ARN Largo no Codificante/genética , ARN Largo no Codificante/metabolismo , Factores de Transcripción/genéticaRESUMEN
Chromatin structure is tightly intertwined with transcription regulation. Here we compared the chromosomal architectures of fetal and adult human erythroblasts and found that, globally, chromatin structures and compartments A/B are highly similar at both developmental stages. At a finer scale, we detected distinct folding patterns at the developmentally controlled ß-globin locus. Specifically, new fetal stage-specific contacts were uncovered between a region separating the fetal (γ) and adult (δ and ß) globin genes (encompassing the HBBP1 and BGLT3 noncoding genes) and two distal chromosomal sites (HS5 and 3'HS1) that flank the locus. In contrast, in adult cells, the HBBP1-BGLT3 region contacts the embryonic ε-globin gene, physically separating the fetal globin genes from the enhancer (locus control region [LCR]). Deletion of the HBBP1 region in adult cells alters contact landscapes in ways more closely resembling those of fetal cells, including increased LCR-γ-globin contacts. These changes are accompanied by strong increases in γ-globin transcription. Notably, the effects of HBBP1 removal on chromatin architecture and gene expression closely mimic those of deleting the fetal globin repressor BCL11A, implicating BCL11A in the function of the HBBP1 region. Our results uncover a new critical regulatory region as a potential target for therapeutic genome editing for hemoglobinopathies and highlight the power of chromosome conformation analysis in discovering new cis control elements.
Asunto(s)
Cromatina/química , Eritroblastos/metabolismo , Regulación del Desarrollo de la Expresión Génica , Elementos Reguladores de la Transcripción , Globinas beta/genética , Adulto , Proteínas Portadoras/genética , Feto , Silenciador del Gen , Humanos , Región de Control de Posición , Proteínas Nucleares/genética , Seudogenes , Proteínas Represoras , Transcriptoma , gamma-Globinas/genéticaRESUMEN
Long non-coding (lnc) RNAs can regulate gene expression and protein functions. However, the proportion of lncRNAs with biological activities among the thousands expressed in mammalian cells is controversial. We studied Lockd (lncRNA downstream of Cdkn1b), a 434-nt polyadenylated lncRNA originating 4 kb 3' to the Cdkn1b gene. Deletion of the 25-kb Lockd locus reduced Cdkn1b transcription by approximately 70% in an erythroid cell line. In contrast, homozygous insertion of a polyadenylation cassette 80 bp downstream of the Lockd transcription start site reduced the entire lncRNA transcript level by >90% with no effect on Cdkn1b transcription. The Lockd promoter contains a DNase-hypersensitive site, binds numerous transcription factors, and physically associates with the Cdkn1b promoter in chromosomal conformation capture studies. Therefore, the Lockd gene positively regulates Cdkn1b transcription through an enhancer-like cis element, whereas the lncRNA itself is dispensable, which may be the case for other lncRNAs.
Asunto(s)
Inhibidor p27 de las Quinasas Dependientes de la Ciclina/genética , Elementos de Facilitación Genéticos , ARN Largo no Codificante/genética , Animales , Línea Celular , Regulación de la Expresión Génica , Ratones , Poli A/metabolismo , Regiones Promotoras Genéticas , Transcripción GenéticaRESUMEN
Predicting the impact of noncoding genetic variation requires interpreting it in the context of three-dimensional genome architecture. We have developed deepC, a transfer-learning-based deep neural network that accurately predicts genome folding from megabase-scale DNA sequence. DeepC predicts domain boundaries at high resolution, learns the sequence determinants of genome folding and predicts the impact of both large-scale structural and single base-pair variations.
Asunto(s)
Genoma Humano/genética , Genómica/métodos , Modelos Genéticos , Redes Neurales de la Computación , Secuencia de Bases , Factor de Unión a CCCTC/genética , Cromatina/genética , Simulación por Computador , Variación Estructural del Genoma , HumanosRESUMEN
MOTIVATION: Genome sequencing experiments have revolutionized molecular biology by allowing researchers to identify important DNA-encoded elements genome wide. Regions where these elements are found appear as peaks in the analog signal of an assay's coverage track, and despite the ease with which humans can visually categorize these patterns, the size of many genomes necessitates algorithmic implementations. Commonly used methods focus on statistical tests to classify peaks, discounting that the background signal does not completely follow any known probability distribution and reducing the information-dense peak shapes to simply maximum height. Deep learning has been shown to be highly accurate for many pattern recognition tasks, on par or even exceeding human capabilities, providing an opportunity to reimagine and improve peak calling. RESULTS: We present the peak calling framework LanceOtron, which combines deep learning for recognizing peak shape with multifaceted enrichment calculations for assessing significance. In benchmarking ATAC-seq, ChIP-seq and DNase-seq, LanceOtron outperforms long-standing, gold-standard peak callers through its improved selectivity and near-perfect sensitivity. AVAILABILITY AND IMPLEMENTATION: A fully featured web application is freely available from LanceOtron.molbiol.ox.ac.uk, command line interface via python is pip installable from PyPI at https://pypi.org/project/lanceotron/, and source code and benchmarking tests are available at https://github.com/LHentges/LanceOtron. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Aprendizaje Profundo , Humanos , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Secuenciación de Inmunoprecipitación de Cromatina , Secuencia de Bases , Secuenciación de Nucleótidos de Alto Rendimiento/métodosRESUMEN
The ubiquitous family of dimeric transcription factors AP-1 is made up of Fos and Jun family proteins. It has long been thought to operate principally at gene promoters and how it controls transcription is still ill-understood. The Fos family protein Fra-1 is overexpressed in triple negative breast cancers (TNBCs) where it contributes to tumor aggressiveness. To address its transcriptional actions in TNBCs, we combined transcriptomics, ChIP-seqs, machine learning and NG Capture-C. Additionally, we studied its Fos family kin Fra-2 also expressed in TNBCs, albeit much less. Consistently with their pleiotropic effects, Fra-1 and Fra-2 up- and downregulate individually, together or redundantly many genes associated with a wide range of biological processes. Target gene regulation is principally due to binding of Fra-1 and Fra-2 at regulatory elements located distantly from cognate promoters where Fra-1 modulates the recruitment of the transcriptional co-regulator p300/CBP and where differences in AP-1 variant motif recognition can underlie preferential Fra-1- or Fra-2 bindings. Our work also shows no major role for Fra-1 in chromatin architecture control at target gene loci, but suggests collaboration between Fra-1-bound and -unbound enhancers within chromatin hubs sometimes including promoters for other Fra-1-regulated genes. Our work impacts our view of AP-1.
Asunto(s)
Elementos de Facilitación Genéticos , Regulación Neoplásica de la Expresión Génica , Proteínas Proto-Oncogénicas c-fos/metabolismo , Neoplasias de la Mama Triple Negativas/genética , Sitios de Unión , Línea Celular Tumoral , Cromatina/química , Cromatina/metabolismo , Epigénesis Genética , Antígeno 2 Relacionado con Fos/metabolismo , Humanos , Motivos de Nucleótidos , Regiones Promotoras Genéticas , Proteínas Proto-Oncogénicas c-fos/fisiología , Factor de Transcripción AP-1/metabolismo , Neoplasias de la Mama Triple Negativas/metabolismo , Factores de Transcripción p300-CBP/metabolismoRESUMEN
BACKGROUND: Congenital dyserythropoietic anaemia type I (CDA-I) is a hereditary anaemia caused by biallelic mutations in the widely expressed genes CDAN1 and C15orf41. Little is understood about either protein and it is unclear in which cellular pathways they participate. METHODS: Genetic analysis of a cohort of patients with CDA-I identifies novel pathogenic variants in both known causative genes. We analyse the mutation distribution and the predicted structural positioning of amino acids affected in Codanin-1, the protein encoded by CDAN1. Using western blotting, immunoprecipitation and immunofluorescence, we determine the effect of particular mutations on both proteins and interrogate protein interaction, stability and subcellular localisation. RESULTS: We identify six novel CDAN1 mutations and one novel mutation in C15orf41 and uncover evidence of further genetic heterogeneity in CDA-I. Additionally, population genetics suggests that CDA-I is more common than currently predicted. Mutations are enriched in six clusters in Codanin-1 and tend to affect buried residues. Many missense and in-frame mutations do not destabilise the entire protein. Rather C15orf41 relies on Codanin-1 for stability and both proteins, which are enriched in the nucleolus, interact to form an obligate complex in cells. CONCLUSION: Stability and interaction data suggest that C15orf41 may be the key determinant of CDA-I and offer insight into the mechanism underlying this disease. Both proteins share a common pathway likely to be present in a wide variety of cell types; however, nucleolar enrichment may provide a clue as to the erythroid specific nature of CDA-I. The surprisingly high predicted incidence of CDA-I suggests that better ascertainment would lead to improved patient care.
Asunto(s)
Anemia Diseritropoyética Congénita/genética , Predisposición Genética a la Enfermedad , Glicoproteínas/genética , Proteínas Nucleares/genética , Factores de Transcripción/genética , Anemia Diseritropoyética Congénita/patología , Femenino , Regulación de la Expresión Génica/genética , Pruebas Genéticas , Genética de Población , Humanos , Masculino , Complejos Multiproteicos/genética , Mutación/genéticaRESUMEN
In the era of genome-wide association studies (GWAS) and personalized medicine, predicting the impact of single nucleotide polymorphisms (SNPs) in regulatory elements is an important goal. Current approaches to determine the potential of regulatory SNPs depend on inadequate knowledge of cell-specific DNA binding motifs. Here, we present Sasquatch, a new computational approach that uses DNase footprint data to estimate and visualize the effects of noncoding variants on transcription factor binding. Sasquatch performs a comprehensive k-mer-based analysis of DNase footprints to determine any k-mer's potential for protein binding in a specific cell type and how this may be changed by sequence variants. Therefore, Sasquatch uses an unbiased approach, independent of known transcription factor binding sites and motifs. Sasquatch only requires a single DNase-seq data set per cell type, from any genotype, and produces consistent predictions from data generated by different experimental procedures and at different sequence depths. Here we demonstrate the effectiveness of Sasquatch using previously validated functional SNPs and benchmark its performance against existing approaches. Sasquatch is available as a versatile webtool incorporating publicly available data, including the human ENCODE collection. Thus, Sasquatch provides a powerful tool and repository for prioritizing likely regulatory SNPs in the noncoding genome.
Asunto(s)
Huella de ADN/métodos , Desoxirribonucleasas/química , Células Eritroides/metabolismo , Motivos de Nucleótidos , Polimorfismo de Nucleótido Simple , Elementos de Respuesta , Análisis de Secuencia de ADN/métodos , Factores de Transcripción/metabolismo , Humanos , Valor Predictivo de las PruebasRESUMEN
The T-box transcription factor (TF) Eomes is a key regulator of cell fate decisions during early mouse development. The cis-acting regulatory elements that direct expression in the anterior visceral endoderm (AVE), primitive streak (PS) and definitive endoderm (DE) have yet to be defined. Here, we identified three gene-proximal enhancer-like sequences (PSE_a, PSE_b and VPE) that faithfully activate tissue-specific expression in transgenic embryos. However, targeted deletion experiments demonstrate that PSE_a and PSE_b are dispensable, and only VPE is required for optimal Eomes expression in vivo Embryos lacking this enhancer display variably penetrant defects in anterior-posterior axis orientation and DE formation. Chromosome conformation capture experiments reveal VPE-promoter interactions in embryonic stem cells (ESCs), prior to gene activation. The locus resides in a large (500â kb) pre-formed compartment in ESCs and activation during DE differentiation occurs in the absence of 3D structural changes. ATAC-seq analysis reveals that VPE, PSE_a and four additional putative enhancers display increased chromatin accessibility in DE that is associated with Smad2/3 binding coincident with transcriptional activation. By contrast, activation of the Eomes target genes Foxa2 and Lhx1 is associated with higher order chromatin reorganisation. Thus, diverse regulatory mechanisms govern activation of lineage specifying TFs during early development.
Asunto(s)
Embrión de Mamíferos/metabolismo , Regulación del Desarrollo de la Expresión Génica , Secuencias Reguladoras de Ácidos Nucleicos/genética , Proteínas de Dominio T Box/genética , Animales , Diferenciación Celular/genética , Cromatina/metabolismo , Endodermo/metabolismo , Elementos de Facilitación Genéticos , Femenino , Factores de Transcripción Forkhead/metabolismo , Gastrulación/genética , Eliminación de Gen , Marcación de Gen , Genes Reporteros , Genotipo , Ratones Endogámicos C57BL , Modelos Biológicos , Proteínas del Grupo Polycomb/metabolismo , Transducción de Señal/genética , Proteína Smad2/metabolismo , Proteínas de Dominio T Box/metabolismoRESUMEN
Chromosome conformation capture (3C) methods are central to understanding the link between nuclear structure and function, and the physical interactions between distal regulatory elements and promoters. However, no one method is appropriate to address all biological questions, as each variant differs markedly in resolution, reproducibility, throughput and biases. A thorough appreciation of the strengths and weaknesses of each technique is critical when choosing the correct method for a specific application or for gauging how best to interpret different sources of data. In addition, the analysis method must be carefully considered, as this choice can profoundly affect the output. In this Review, we describe and compare the different available 3C-based approaches, with a focus on the analysis of mammalian genomes.
Asunto(s)
Cromosomas , Técnicas Genéticas , Animales , Cromatina/química , Cromatina/genética , Cromatina/metabolismo , Mapeo Cromosómico , Cromosomas/química , Cromosomas/genética , Biblioteca de Genes , Ensayos Analíticos de Alto Rendimiento/métodos , Humanos , Hibridación Fluorescente in Situ , Células K562 , Ratones , Reacción en Cadena de la Polimerasa/métodos , Factores de Transcripción SOXB1/genética , Globinas alfa/genéticaRESUMEN
A substantial amount of organismal complexity is thought to be encoded by enhancers which specify the location, timing, and levels of gene expression. In mammals there are more enhancers than promoters which are distributed both between and within genes. Here we show that activated, intragenic enhancers frequently act as alternative tissue-specific promoters producing a class of abundant, spliced, multiexonic poly(A)(+) RNAs (meRNAs) which reflect the host gene's structure. meRNAs make a substantial and unanticipated contribution to the complexity of the transcriptome, appearing as alternative isoforms of the host gene. The low protein-coding potential of meRNAs suggests that many meRNAs may be byproducts of enhancer activation or underlie as-yet-unidentified RNA-encoded functions. Distinguishing between meRNAs and mRNAs will transform our interpretation of dynamic changes in transcription both at the level of individual genes and of the genome as a whole.