Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 11 de 11
Filter
1.
Annu Rev Cell Dev Biol ; 35: 357-379, 2019 10 06.
Article in English | MEDLINE | ID: mdl-31283382

ABSTRACT

Eukaryotic transcription factors (TFs) from the same structural family tend to bind similar DNA sequences, despite the ability of these TFs to execute distinct functions in vivo. The cell partly resolves this specificity paradox through combinatorial strategies and the use of low-affinity binding sites, which are better able to distinguish between similar TFs. However, because these sites have low affinity, it is challenging to understand how TFs recognize them in vivo. Here, we summarize recent findings and technological advancements that allow for the quantification and mechanistic interpretation of TF recognition across a wide range of affinities. We propose a model that integrates insights from the fields of genetics and cell biology to provide further conceptual understanding of TF binding specificity. We argue that in eukaryotes, target specificity is driven by an inhomogeneous 3D nuclear distribution of TFs and by variation in DNA binding affinity such that locally elevated TF concentration allows low-affinity binding sites to be functional.


Subject(s)
Eukaryota/metabolism , Regulatory Sequences, Nucleic Acid , Transcription Factors/metabolism , Animals , Binding Sites , Gene Expression Regulation , Humans
2.
Mol Cell ; 78(1): 152-167.e11, 2020 04 02.
Article in English | MEDLINE | ID: mdl-32053778

ABSTRACT

Eukaryotic transcription factors (TFs) form complexes with various partner proteins to recognize their genomic target sites. Yet, how the DNA sequence determines which TF complex forms at any given site is poorly understood. Here, we demonstrate that high-throughput in vitro DNA binding assays coupled with unbiased computational analysis provide unprecedented insight into how different DNA sequences select distinct compositions and configurations of homeodomain TF complexes. Using inferred knowledge about minor groove width readout, we design targeted protein mutations that destabilize homeodomain binding both in vitro and in vivo in a complex-specific manner. By performing parallel systematic evolution of ligands by exponential enrichment sequencing (SELEX-seq), chromatin immunoprecipitation sequencing (ChIP-seq), RNA sequencing (RNA-seq), and Hi-C assays, we not only classify the majority of in vivo binding events in terms of complex composition but also infer complex-specific functions by perturbing the gene regulatory network controlled by a single complex.


Subject(s)
DNA/chemistry , Drosophila Proteins/metabolism , Gene Expression Regulation , Homeodomain Proteins/metabolism , Transcription Factors/metabolism , Animals , Base Sequence , Binding Sites , DNA/metabolism , Drosophila Proteins/chemistry , Drosophila Proteins/genetics , Drosophila melanogaster/genetics , Drosophila melanogaster/metabolism , Homeodomain Proteins/chemistry , Homeodomain Proteins/genetics , Mutation , Nucleic Acid Conformation , Protein Binding , Transcription Factors/chemistry , Transcription Factors/genetics
3.
Trends Genet ; 39(2): 140-153, 2023 02.
Article in English | MEDLINE | ID: mdl-36549923

ABSTRACT

Regulation of gene expression is a complex but highly guided process. While genomic technologies and computational approaches have allowed high-throughput mapping of cis-regulatory elements (CREs) and their interactions in 3D, their precise role in regulating gene expression remains obscure. Recent complementary observations revealed that interactions between CREs frequently result in the formation of small-scale functional modules within topologically associating domains. Such chromatin modules likely emerge from a complex interplay between regulatory machineries assembled at CREs, including site-specific binding of transcription factors. Here, we review the methods that allow identifying chromatin modules, summarize possible mechanisms that steer CRE interactions within these modules, and discuss outstanding challenges to uncover how chromatin modules fit in our current understanding of the functional 3D genome.


Subject(s)
Chromatin , Gene Expression Regulation , Chromatin/genetics , Gene Expression Regulation/genetics , Genome/genetics , Genomics , Regulatory Sequences, Nucleic Acid/genetics
4.
Genome Res ; 28(1): 111-121, 2018 01.
Article in English | MEDLINE | ID: mdl-29196557

ABSTRACT

The DNA-binding interfaces of the androgen (AR) and glucocorticoid (GR) receptors are virtually identical, yet these transcription factors share only about a third of their genomic binding sites and regulate similarly distinct sets of target genes. To address this paradox, we determined the intrinsic specificities of the AR and GR DNA-binding domains using a refined version of SELEX-seq. We developed an algorithm, SelexGLM, that quantifies binding specificity over a large (31-bp) binding site by iteratively fitting a feature-based generalized linear model to SELEX probe counts. This analysis revealed that the DNA-binding preferences of AR and GR homodimers differ significantly, both within and outside the 15-bp core binding site. The relative preference between the two factors can be tuned over a wide range by changing the DNA sequence, with AR more sensitive to sequence changes than GR. The specificity of AR extends to the regions flanking the core 15-bp site, where isothermal calorimetry measurements reveal that affinity is augmented by enthalpy-driven readout of poly(A) sequences associated with narrowed minor groove width. We conclude that the increased specificity of AR is correlated with more enthalpy-driven binding than GR. The binding models help explain differences in AR and GR genomic binding and provide a biophysical rationale for how promiscuous binding by GR allows functional substitution for AR in some castration-resistant prostate cancers.


Subject(s)
Androgen Receptor Antagonists , Neoplasm Proteins , Prostatic Neoplasms, Castration-Resistant , Receptors, Androgen/metabolism , Receptors, Glucocorticoid , SELEX Aptamer Technique/methods , Androgen Receptor Antagonists/chemical synthesis , Androgen Receptor Antagonists/chemistry , Androgen Receptor Antagonists/pharmacology , Aptamers, Nucleotide/chemical synthesis , Aptamers, Nucleotide/chemistry , Aptamers, Nucleotide/pharmacology , Cell Line, Tumor , Humans , Male , Neoplasm Proteins/antagonists & inhibitors , Neoplasm Proteins/metabolism , Receptors, Glucocorticoid/antagonists & inhibitors , Receptors, Glucocorticoid/metabolism
5.
Proc Natl Acad Sci U S A ; 115(16): E3692-E3701, 2018 04 17.
Article in English | MEDLINE | ID: mdl-29610332

ABSTRACT

Transcription factors (TFs) control gene expression by binding to genomic DNA in a sequence-specific manner. Mutations in TF binding sites are increasingly found to be associated with human disease, yet we currently lack robust methods to predict these sites. Here, we developed a versatile maximum likelihood framework named No Read Left Behind (NRLB) that infers a biophysical model of protein-DNA recognition across the full affinity range from a library of in vitro selected DNA binding sites. NRLB predicts human Max homodimer binding in near-perfect agreement with existing low-throughput measurements. It can capture the specificity of the p53 tetramer and distinguish multiple binding modes within a single sample. Additionally, we confirm that newly identified low-affinity enhancer binding sites are functional in vivo, and that their contribution to gene expression matches their predicted affinity. Our results establish a powerful paradigm for identifying protein binding sites and interpreting gene regulatory sequences in eukaryotic genomes.


Subject(s)
DNA Footprinting/methods , DNA-Binding Proteins/metabolism , DNA/metabolism , Animals , Binding Sites , Datasets as Topic , Drosophila Proteins/metabolism , Electrophoretic Mobility Shift Assay , Enhancer Elements, Genetic , Gene Library , Homeodomain Proteins/metabolism , Humans , Models, Molecular , Protein Binding , Protein Conformation , Recombinant Proteins/metabolism , Transcription Factors/metabolism , Tumor Suppressor Protein p53/metabolism
6.
Mol Syst Biol ; 14(2): e7902, 2018 02 22.
Article in English | MEDLINE | ID: mdl-29472273

ABSTRACT

Transcription factors (TFs) interpret DNA sequence by probing the chemical and structural properties of the nucleotide polymer. DNA shape is thought to enable a parsimonious representation of dependencies between nucleotide positions. Here, we propose a unified mathematical representation of the DNA sequence dependence of shape and TF binding, respectively, which simplifies and enhances analysis of shape readout. First, we demonstrate that linear models based on mononucleotide features alone account for 60-70% of the variance in minor groove width, roll, helix twist, and propeller twist. This explains why simple scoring matrices that ignore all dependencies between nucleotide positions can partially account for DNA shape readout by a TF Adding dinucleotide features as sequence-to-shape predictors to our model, we can almost perfectly explain the shape parameters. Building on this observation, we developed a post hoc analysis method that can be used to analyze any mechanism-agnostic protein-DNA binding model in terms of shape readout. Our insights provide an alternative strategy for using DNA shape information to enhance our understanding of how cis-regulatory codes are interpreted by the cellular machinery.


Subject(s)
Computational Biology/methods , DNA/chemistry , Transcription Factors/metabolism , Binding Sites , DNA/metabolism , Models, Molecular , Models, Theoretical , Nucleic Acid Conformation
7.
Nat Biotechnol ; 40(10): 1520-1527, 2022 Oct.
Article in English | MEDLINE | ID: mdl-35606422

ABSTRACT

Protein-ligand interactions are increasingly profiled at high throughput using affinity selection and massively parallel sequencing. However, these assays do not provide the biophysical parameters that most rigorously quantify molecular interactions. Here we describe a flexible machine learning method, called ProBound, that accurately defines sequence recognition in terms of equilibrium binding constants or kinetic rates. This is achieved using a multi-layered maximum-likelihood framework that models both the molecular interactions and the data generation process. We show that ProBound quantifies transcription factor (TF) behavior with models that predict binding affinity over a range exceeding that of previous resources; captures the impact of DNA modifications and conformational flexibility of multi-TF complexes; and infers specificity directly from in vivo data such as ChIP-seq without peak calling. When coupled with an assay called KD-seq, it determines the absolute affinity of protein-ligand interactions. We also apply ProBound to profile the kinetics of kinase-substrate interactions. ProBound opens new avenues for decoding biological networks and rationally engineering protein-ligand interactions.


Subject(s)
Machine Learning , Transcription Factors , Binding Sites , Chromatin Immunoprecipitation , DNA/genetics , Ligands , Protein Binding , Transcription Factors/metabolism
8.
Nat Commun ; 13(1): 2042, 2022 04 19.
Article in English | MEDLINE | ID: mdl-35440565

ABSTRACT

Non-coding variants coordinate transcription factor (TF) binding and chromatin mark enrichment changes over regions spanning >100 kb. These molecularly coordinated regions are named "variable chromatin modules" (VCMs), providing a conceptual framework of how regulatory variation might shape complex traits. To better understand the molecular mechanisms underlying VCM formation, here, we mechanistically dissect a VCM-modulating noncoding variant that is associated with reduced chronic lymphocytic leukemia (CLL) predisposition and disease progression. This common, germline variant constitutes a 5-bp indel that controls the activity of an AXIN2 gene-linked VCM by creating a MEF2 binding site, which, upon binding, activates a super-enhancer-like regulatory element. This triggers a large change in TF binding activity and chromatin state at an enhancer cluster spanning >150 kb, coinciding with subtle, long-range chromatin compaction and robust AXIN2 up-regulation. Our results support a model in which the indel acts as an AXIN2 VCM-activating TF nucleation event, which modulates CLL pathology.


Subject(s)
Chromatin , Leukemia, Lymphocytic, Chronic, B-Cell , Chromatin/genetics , Enhancer Elements, Genetic/genetics , Gene Expression Regulation , Germ Cells/metabolism , Humans , Leukemia, Lymphocytic, Chronic, B-Cell/genetics , Transcription Factors/metabolism
9.
J Mol Biol ; 432(6): 1801-1815, 2020 Mar 13.
Article in English | MEDLINE | ID: mdl-31689433

ABSTRACT

Epigenetic DNA modification impacts gene expression, but the underlying molecular mechanisms are only partly understood. Adding a methyl group to a cytosine base locally modifies the structural features of DNA in multiple ways, which may change the interaction with DNA-binding transcription factors (TFs) and trigger a cascade of downstream molecular events. Cells can be probed using various functional genomics assays, but it is difficult to disentangle the confounded effects of DNA modification on TF binding, chromatin accessibility, intranuclear variation in local TF concentration, and rate of transcription. Here we discuss how high-throughput in vitro profiling of protein-DNA interactions has enabled comprehensive characterization and quantification of the methylation sensitivity of TFs. Despite the limited structural data for DNA containing methylated cytosine, automated analysis of structural information in the Protein Data Bank (PDB) shows how 5-methylcytosine (5mC) can be recognized in various ways by amino acid side chains. We discuss how a context-dependent effect of methylation on DNA groove geometry can affect DNA binding by homeodomain proteins and how principled modeling of ChIP-seq data can overcome the confounding that makes the interpretation of in vivo data challenging. The emerging picture is that epigenetic modifications affect TF binding in a highly context-specific manner, with a direction and effect size that depend critically on their position within the TF binding site and the amino acid sequence of the TF. With this improved mechanistic knowledge, we have come closer to understanding how cells use DNA modification to acquire, retain, and change their identity.

10.
Epigenetics Chromatin ; 11(1): 6, 2018 02 06.
Article in English | MEDLINE | ID: mdl-29409522

ABSTRACT

BACKGROUND: DNA shape analysis has demonstrated the potential to reveal structure-based mechanisms of protein-DNA binding. However, information about the influence of chemical modification of DNA is limited. Cytosine methylation, the most frequent modification, represents the addition of a methyl group at the major groove edge of the cytosine base. In mammalian genomes, cytosine methylation most frequently occurs at CpG dinucleotides. In addition to changing the chemical signature of C/G base pairs, cytosine methylation can affect DNA structure. Since the original discovery of DNA methylation, major efforts have been made to understand its effect from a sequence perspective. Compared to unmethylated DNA, however, little structural information is available for methylated DNA, due to the limited number of experimentally determined structures. To achieve a better mechanistic understanding of the effect of CpG methylation on local DNA structure, we developed a high-throughput method, methyl-DNAshape, for predicting the effect of cytosine methylation on DNA shape. RESULTS: Using our new method, we found that CpG methylation significantly altered local DNA shape. Four DNA shape features-helix twist, minor groove width, propeller twist, and roll-were considered in this analysis. Distinct distributions of effect size were observed for different features. Roll and propeller twist were the DNA shape features most strongly affected by CpG methylation with an effect size depending on the local sequence context. Methylation-induced changes in DNA shape were predictive of the measured rate of cleavage by DNase I and suggest a possible mechanism for some of the methylation sensitivities that were recently observed for human Pbx-Hox complexes. CONCLUSIONS: CpG methylation is an important epigenetic mark in the mammalian genome. Understanding its role in protein-DNA recognition can further our knowledge of gene regulation. Our high-throughput methyl-DNAshape method can be used to predict the effect of cytosine methylation on DNA shape and its subsequent influence on protein-DNA interactions. This approach overcomes the limited availability of experimental DNA structures that contain 5-methylcytosine.


Subject(s)
DNA Methylation , DNA-Binding Proteins/metabolism , DNA/chemistry , Mammals/genetics , Animals , Base Sequence , CpG Islands , Cytosine/chemistry , DNA/metabolism , DNA-Binding Proteins/chemistry , Epigenesis, Genetic , Humans , Mammals/metabolism , Models, Molecular , Nucleic Acid Conformation
11.
Cell Rep ; 19(11): 2383-2395, 2017 06 13.
Article in English | MEDLINE | ID: mdl-28614722

ABSTRACT

Although DNA modifications play an important role in gene regulation, the underlying mechanisms remain elusive. We developed EpiSELEX-seq to probe the sensitivity of transcription factor binding to DNA modification in vitro using massively parallel sequencing. Feature-based modeling quantifies the effect of cytosine methylation (5mC) on binding free energy in a position-specific manner. Application to the human bZIP proteins ATF4 and C/EBPß and three different Pbx-Hox complexes shows that 5mCpG can both increase and decrease affinity, depending on where the modification occurs within the protein-DNA interface. The TF paralogs tested vary in their methylation sensitivity, for which we provide a structural rationale. We show that 5mCpG can also enhance in vitro p53 binding and provide evidence for increased in vivo p53 occupancy at methylated binding sites, correlating with primed enhancer histone marks. Our results establish a powerful strategy for dissecting the epigenomic modulation of protein-DNA interactions and their role in gene regulation.


Subject(s)
DNA Methylation/genetics , Epigenesis, Genetic/genetics , Epigenomics/methods , Transcription Factors/metabolism , Gene Expression Regulation , Humans , Protein Binding
SELECTION OF CITATIONS
SEARCH DETAIL