ABSTRACT
Previous studies have identified topologically associating domains (TADs) as basic units of genome organization. We present evidence of a previously unreported level of genome folding, where distant TAD pairs, megabases apart, interact to form meta-domains. Within meta-domains, gene promoters and structural intergenic elements present in distant TADs are specifically paired. The associated genes encode neuronal determinants, including those engaged in axonal guidance and adhesion. These long-range associations occur in a large fraction of neurons but support transcription in only a subset of neurons. Meta-domains are formed by diverse transcription factors that are able to pair over long and flexible distances. We present evidence that two such factors, GAF and CTCF, play direct roles in this process. The relative simplicity of higher-order meta-domain interactions in Drosophila, compared with those previously described in mammals, allowed the demonstration that genomes can fold into highly specialized cell-type-specific scaffolds that enable megabase-scale regulatory associations.
Subject(s)
Chromosomes, Insect , Drosophila , Animals , Chromatin/genetics , DNA Packaging , Drosophila/genetics , Mammals/genetics , Neurogenesis , Neurons , Transcription Factors , Drosophila Proteins , Genome, Insect , Gene Expression RegulationABSTRACT
Chromatin loops between gene pairs have been observed in diverse contexts in both flies and vertebrates. Combining high-resolution Capture-C, DNA fluorescence in situ hybridization, and genetic perturbations, we dissect the functional role of three loops between genes with related function during Drosophila embryogenesis. By mutating the loop anchor (but not the gene) or the gene (but not loop anchor), we disentangle loop formation and gene expression and show that the 3D proximity of paralogous gene loci supports their co-regulation. Breaking the loop leads to either an attenuation or enhancement of expression and perturbs their relative levels of expression and cross-regulation. Although many loops appear constitutive across embryogenesis, their function can change in different developmental contexts. Taken together, our results indicate that chromatin gene-gene loops act as architectural scaffolds that can be used in different ways in different contexts to fine-tune the coordinated expression of genes with related functions and sustain their cross-regulation.
Subject(s)
Chromatin , Chromosomes , Animals , In Situ Hybridization, Fluorescence , Chromatin/genetics , Drosophila/geneticsABSTRACT
BACKGROUND: Microglia (MG) are myeloid cells of the central nervous system that support homeostasis and instigate neuroinflammation in pathologies. Single-cell RNA sequencing (scRNA-seq) revealed the functional heterogeneity of MG in mouse brains. Microglia are self-renewing cells and inhibition of colony-stimulating factor 1 receptor (CSF1R) signaling depletes microglia which rapidly repopulate. The functions of repopulated microglia are poorly known. METHODS: We combined scRNA-seq, bulk RNA-seq, immunofluorescence, and confocal imaging to study the functionalities and morphology of repopulated microglia. RESULTS: A CSRF1R inhibitor (BLZ-945) depleted microglia within 21 days and a number of microglia was fully restored within 7 days, as confirmed by TMEM119 staining and flow cytometry. ScRNA-seq and computational analyses demonstrate that repopulated microglia originated from preexisting progenitors and reconstituted functional clusters but upregulated inflammatory genes. Percentages of proliferating, immature microglia displaying inflammatory gene expression increased in aging mice. Morphometric analysis of MG cell body and branching revealed a distinct morphology of repopulated MG, particularly in brains of old mice. We demonstrate that with aging some repopulated MG fail to reach the homeostatic phenotype. These differences may contribute to the deterioration of MG protective functions with age.
Subject(s)
Aging , Mice, Inbred C57BL , Microglia , Animals , Microglia/metabolism , Mice , Aging/physiology , Brain/cytology , Brain/metabolism , Male , Receptors, Granulocyte-Macrophage Colony-Stimulating Factor/metabolism , Receptors, Granulocyte-Macrophage Colony-Stimulating Factor/genetics , Single-Cell AnalysisABSTRACT
Herein, we report on a large Polish family presenting with a classical triphalangeal thumb-polysyndactyly syndrome (TPT-PS). This rare congenital limb anomaly is generally caused by microduplications encompassing the Sonic Hedgehog (SHH) limb enhancer, termed the zone of polarizing activity (ZPA) regulatory sequence (ZRS). Recently, a pathogenic variant in the pre-ZRS (pZRS), a conserved sequence located near the ZRS, has been described in a TPT-PS Dutch family. We performed targeted ZRS sequencing, array comparative genomic hybridization, and whole-exome sequencing. Next, we sequenced the recently described pZRS region. Finally, we performed a circular chromatin conformation capture-sequencing (4C-seq) assay on skin fibroblasts of one affected family member and control samples to examine potential alterations in the SHH regulatory domain and functionally characterize the identified variant. We found that all affected individuals shared a recently identified pathogenic point mutation in the pZRS region: NC_000007.14:g.156792782C>G (GRCh38/hg38), which is the same as in the Dutch family. The results of 4C-seq experiments revealed increased interactions within the whole SHH regulatory domain (SHH-LMBR1 TAD) in the patient compared to controls. Our study expands the number of TPT-PS families carrying a pathogenic alteration of the pZRS and underlines the importance of routine pZRS sequencing in the genetic diagnostics of patients with TPT-PS or similar phenotypes. The pathogenic mutation causative for TPT-PS in our patient gave rise to increased interactions within the SHH regulatory domain in yet unknown mechanism.
Subject(s)
Congenital Abnormalities , Hedgehog Proteins , Mandibulofacial Dysostosis , Polydactyly , Comparative Genomic Hybridization , Congenital Abnormalities/genetics , Enhancer Elements, Genetic , Hedgehog Proteins/genetics , Humans , Mandibulofacial Dysostosis/genetics , Mutation , Pedigree , ThumbABSTRACT
FOXA1 is a transcription factor capable to bind silenced chromatin to direct context-dependent cell fate conversion. Here, we demonstrate that a compact palindromic DNA element (termed 'DIV' for its diverging half-sites) induces the homodimerization of FOXA1 with strongly positive cooperativity. Alternative structural models are consistent with either an indirect DNA-mediated cooperativity or a direct protein-protein interaction. The cooperative homodimer formation is strictly constrained by precise half-site spacing. Re-analysis of chromatin immunoprecipitation sequencing data indicates that the DIV is effectively targeted by FOXA1 in the context of chromatin. Reporter assays show that FOXA1-dependent transcriptional activity declines when homodimeric binding is disrupted. In response to phosphatidylinositol-3 kinase inhibition DIV sites pre-bound by FOXA1 such as at the PVT1/MYC locus exhibit a strong increase in accessibility suggesting a role of the DIV configuration in the chromatin closed-open dynamics. Moreover, several disease-associated single nucleotide polymorphisms map to DIV elements and show allelic differences in FOXA1 homodimerization, reporter gene expression and are annotated as quantitative trait loci. This includes the rs541455835 variant at the MAPT locus encoding the Tau protein associated with Parkinson's disease. Collectively, the DIV guides chromatin engagement and regulation by FOXA1 and its perturbation could be linked to disease etiologies.
Subject(s)
DNA/genetics , Enhancer Elements, Genetic/genetics , Gene Expression Regulation/genetics , Hepatocyte Nuclear Factor 3-alpha/metabolism , Inverted Repeat Sequences/genetics , Cell Line, Tumor , Chromatin/metabolism , Dimerization , HCT116 Cells , Humans , MCF-7 Cells , Phosphoinositide-3 Kinase Inhibitors , Polymorphism, Single Nucleotide/genetics , Quantitative Trait Loci/genetics , Thiazoles/pharmacologyABSTRACT
MOTIVATION: Computational prediction of transcription factor (TF) binding sites in the genome remains a challenging task. Here, we present Romulus, a novel computational method for identifying individual TF binding sites from genome sequence information and cell-type-specific experimental data, such as DNase-seq. It combines the strengths of previous approaches, and improves robustness by reducing the number of free parameters in the model by an order of magnitude. RESULTS: We show that Romulus significantly outperforms existing methods across three sources of DNase-seq data, by assessing the performance of these tools against ChIP-seq profiles. The difference was particularly significant when applied to binding site prediction for low-information-content motifs. Our method is capable of inferring multiple binding modes for a single TF, which differ in their DNase I cut profile. Finally, using the model learned by Romulus and ChIP-seq data, we introduce Binding in Closed Chromatin (BCC) as a quantitative measure of TF pioneer factor activity. Uniquely, our measure quantifies a defining feature of pioneer factors, namely their ability to bind closed chromatin. AVAILABILITY AND IMPLEMENTATION: Romulus is freely available as an R package at http://github.com/ajank/Romulus CONTACT: ajank@mimuw.edu.pl SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Subject(s)
Binding Sites , Computational Biology/methods , Protein Binding , Transcription Factors , Chromatin , Chromatin Immunoprecipitation , Sequence Analysis, DNAABSTRACT
The binding of transcription factors (TFs) to their specific motifs in genomic regulatory regions is commonly studied in isolation. However, in order to elucidate the mechanisms of transcriptional regulation, it is essential to determine which TFs bind DNA cooperatively as dimers and to infer the precise nature of these interactions. So far, only a small number of such dimeric complexes are known. Here, we present an algorithm for predicting cell-type-specific TF-TF dimerization on DNA on a large scale, using DNase I hypersensitivity data from 78 human cell lines. We represented the universe of possible TF complexes by their corresponding motif complexes, and analyzed their occurrence at cell-type-specific DNase I hypersensitive sites. Based on â¼1.4 billion tests for motif complex enrichment, we predicted 603 highly significant cell-type-specific TF dimers, the vast majority of which are novel. Our predictions included 76% (19/25) of the known dimeric complexes and showed significant overlap with an experimental database of protein-protein interactions. They were also independently supported by evolutionary conservation, as well as quantitative variation in DNase I digestion patterns. Notably, the known and predicted TF dimers were almost always highly compact and rigidly spaced, suggesting that TFs dimerize in close proximity to their partners, which results in strict constraints on the structure of the DNA-bound complex. Overall, our results indicate that chromatin openness profiles are highly predictive of cell-type-specific TF-TF interactions. Moreover, cooperative TF dimerization seems to be a widespread phenomenon, with multiple TF complexes predicted in most cell types.
Subject(s)
Hepatocyte Nuclear Factor 3-alpha/metabolism , Models, Biological , Algorithms , Base Sequence , Binding Sites , Cell Line, Tumor , Cluster Analysis , Computer Simulation , Consensus Sequence , DNA Cleavage , Deoxyribonuclease I/chemistry , Evolution, Molecular , Humans , Protein Binding , Protein Interaction Mapping , Protein Interaction Maps , Protein Multimerization , Transcription Factors/metabolismABSTRACT
BACKGROUND: Cooperative binding of transcription factor (TF) dimers to DNA is increasingly recognized as a major contributor to binding specificity. However, it is likely that the set of known TF dimers is highly incomplete, given that they were discovered using ad hoc approaches, or through computational analyses of limited datasets. RESULTS: Here, we present TACO (Transcription factor Association from Complex Overrepresentation), a general-purpose standalone software tool that takes as input any genome-wide set of regulatory elements and predicts cell-type-specific TF dimers based on enrichment of motif complexes. TACO is the first tool that can accommodate motif complexes composed of overlapping motifs, a characteristic feature of many known TF dimers. Our method comprehensively outperforms existing tools when benchmarked on a reference set of 29 known dimers. We demonstrate the utility and consistency of TACO by applying it to 152 DNase-seq datasets and 94 ChIP-seq datasets. CONCLUSIONS: Based on these results, we uncover a general principle governing the structure of TF-TF-DNA ternary complexes, namely that the flexibility of the complex is correlated with, and most likely a consequence of, inter-motif spacing.
Subject(s)
Algorithms , Software , Binding Sites , Chromatin Immunoprecipitation , DNA/chemistry , DNA/metabolism , Dimerization , Protein Binding , Sequence Analysis, DNA , Transcription Factors/chemistry , Transcription Factors/metabolismABSTRACT
To regulate expression, enhancers must come in proximity to their target gene. However, the relationship between the timing of enhancer-promoter (E-P) proximity and activity remains unclear, with examples of uncoupled, anticorrelated and correlated interactions. To assess this, we selected 600 characterized enhancers or promoters with tissue-specific activity in Drosophila embryos and performed Capture-C in FACS-purified myogenic or neurogenic cells during specification and tissue differentiation. This enabled direct comparison between E-P proximity and activity transitioning from OFF-to-ON and ON-to-OFF states across developmental conditions. This showed remarkably similar E-P topologies between specified muscle and neuronal cells, which are uncoupled from activity. During tissue differentiation, many new distal interactions emerge where changes in E-P proximity reflect changes in activity. The mode of E-P regulation therefore appears to change as embryogenesis proceeds, from largely permissive topologies during cell-fate specification to more instructive regulation during terminal tissue differentiation, when E-P proximity is coupled to activation.
Subject(s)
Enhancer Elements, Genetic , Gene Expression Regulation, Developmental , Animals , Enhancer Elements, Genetic/genetics , Gene Expression Regulation, Developmental/genetics , Promoter Regions, Genetic/genetics , Drosophila/genetics , Cell Differentiation/geneticsABSTRACT
Chromatin topology is intricately linked to gene expression, yet its functional requirement remains unclear. Here, we comprehensively assessed the interplay between genome topology and gene expression using highly rearranged chromosomes (balancers) spanning ~75% of the Drosophila genome. Using transheterozyte (balancer/wild-type) embryos, we measured allele-specific changes in topology and gene expression in cis, while minimizing trans effects. Through genome sequencing, we resolved eight large nested inversions, smaller inversions, duplications and thousands of deletions. These extensive rearrangements caused many changes to chromatin topology, disrupting long-range loops, topologically associating domains (TADs) and promoter interactions, yet these are not predictive of changes in expression. Gene expression is generally not altered around inversion breakpoints, indicating that mis-appropriate enhancer-promoter activation is a rare event. Similarly, shuffling or fusing TADs, changing intra-TAD connections and disrupting long-range inter-TAD loops does not alter expression for the majority of genes. Our results suggest that properties other than chromatin topology ensure productive enhancer-promoter interactions.
Subject(s)
Chromatin/genetics , Chromosomes, Insect/genetics , Drosophila melanogaster/genetics , Enhancer Elements, Genetic , Gene Expression Regulation , Gene Rearrangement , Genome, Insect , Animals , Chromosome Mapping , Female , Male , Promoter Regions, GeneticABSTRACT
BACKGROUND: The enhanceosome is an enhancer located upstream of the human interferon ß gene, bound by transcription factor (TF) complex of extremely rigid structure. Within these rigid constraints, even a slight change of distances between transcription factor binding sites (TFBS) results in loss of functionality of the enhanceosome. We hypothesized that smaller subunits of the enhanceosome may entail TF complex formation in other regulatory regions. RESULTS: In order to verify this hypothesis we systematically searched for dimerization preferences of the TFs that have TFBS in the enhanceosome. For this we utilized our recently developed tool, TACO. We performed this computational experiment in a cell-type-specific manner by utilizing cell-type-specific DNase-seq data for 105 human cell types. We also used 20 TRANSFAC motifs comprising not only the usual TFs constituting the enhanceosome but also the architectural proteins of High Mobility Group I(Y) (HMG I). A similar experiment used 42 DNase-seq data sets for mouse cell types. We found 137 statistically significant dimer predictions in the human genome, and 37 predictions in the mouse genome, that matched the positioning on the enhanceosome with ±2 bp tolerance. To characterize these predicted TF dimers, we performed functional analysis (Gene Ontology enrichment) for sets of genes which were in the neighbourhood of predicted dimer instances. A notable feature of these instances is that (1) most of them are located in introns of genes, (2) they are enriched in regulatory states, and (3) those instances that are located near transcription start sites are enriched for inclusion in computationally predicted enhancers. We also investigated similarity of dimer predictions between human and mouse. CONCLUSIONS: It follows from our experiments that, except for homodimer formed by IRF proteins, the rest of the dimers were formed exclusively between one of the transcriptional activators (ATF-2/c-Jun and IRF) and a HMG I protein. NF- κB did not participate in forming dimers with other proteins. Dimers predicted in mouse were fully contained in those predicted in human, with exactly the same spacing and orientation. Intriguingly, in most of the cases the enhanceosome motifs have 1 bp wider spacing than the corresponding dimers predicted genome-wide, which is likely caused by the overall 3D structure constraints of the enhanceosome-bound complex.
Subject(s)
Computational Biology , High Mobility Group Proteins/chemistry , Protein Multimerization , Transcription Factors/chemistry , Animals , Base Sequence , Binding Sites , High Mobility Group Proteins/genetics , Humans , Interferon-beta/genetics , Introns/genetics , Mice , Protein Structure, Quaternary , Transcription Factors/geneticsABSTRACT
The SOXE transcription factors SOX8, SOX9 and SOX10 are master regulators of mammalian development directing sex determination, gliogenesis, pancreas specification and neural crest development. We identified a set of palindromic SOX binding sites specifically enriched in regulatory regions of melanoma cells. SOXE proteins homodimerize on these sequences with high cooperativity. In contrast to other transcription factor dimers, which are typically rigidly spaced, SOXE group proteins can bind cooperatively at a wide range of dimer spacings. Using truncated forms of SOXE proteins, we show that a single dimerization (DIM) domain, that precedes the DNA binding high mobility group (HMG) domain, is sufficient for dimer formation, suggesting that DIM : HMG rather than DIM:DIM interactions mediate the dimerization. All SOXE members can also heterodimerize in this fashion, whereas SOXE heterodimers with SOX2, SOX4, SOX6 and SOX18 are not supported. We propose a structural model where SOXE-specific intramolecular DIM:HMG interactions are allosterically communicated to the HMG of juxtaposed molecules. Collectively, SOXE factors evolved a unique mode to combinatorially regulate their target genes that relies on a multifaceted interplay between the HMG and DIM domains. This property potentially extends further the diversity of target genes and cell-specific functions that are regulated by SOXE proteins.