Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 10 de 10
Filter
Add more filters











Publication year range
1.
Proc Natl Acad Sci U S A ; 120(4): e2205796120, 2023 01 24.
Article in English | MEDLINE | ID: mdl-36656856

ABSTRACT

DNA-binding proteins play important roles in various cellular processes, but the mechanisms by which proteins recognize genomic target sites remain incompletely understood. Functional groups at the edges of the base pairs (bp) exposed in the DNA grooves represent physicochemical signatures. As these signatures enable proteins to form specific contacts between protein residues and bp, their study can provide mechanistic insights into protein-DNA binding. Existing experimental methods, such as X-ray crystallography, can reveal such mechanisms based on physicochemical interactions between proteins and their DNA target sites. However, the low throughput of structural biology methods limits mechanistic insights for selection of many genomic sites. High-throughput binding assays enable prediction of potential target sites by determining relative binding affinities of a protein to massive numbers of DNA sequences. Many currently available computational methods are based on the sequence of standard Watson-Crick bp. They assume that the contribution of overall binding affinity is independent for each base pair, or alternatively include dinucleotides or short k-mers. These methods cannot directly expand to physicochemical contacts, and they are not suitable to apply to DNA modifications or non-Watson-Crick bp. These variations include DNA methylation, and synthetic or mismatched bp. The proposed method, DeepRec, can predict relative binding affinities as function of physicochemical signatures and the effect of DNA methylation or other chemical modifications on binding. Sequence-based modeling methods are in comparison a coarse-grain description and cannot achieve such insights. Our chemistry-based modeling framework provides a path towards understanding genome function at a mechanistic level.


Subject(s)
DNA-Binding Proteins , DNA , Base Pairing , DNA/metabolism , Protein Binding , DNA-Binding Proteins/metabolism , Binding Sites
2.
Sci Adv ; 8(34): eabm4358, 2022 08 26.
Article in English | MEDLINE | ID: mdl-36001652

ABSTRACT

Genome-wide binding profiles of estrogen receptor (ER) and FOXA1 reflect cancer state in ER+ breast cancer. However, routine profiling of tumor transcription factor (TF) binding is impractical in the clinic. Here, we show that plasma cell-free DNA (cfDNA) contains high-resolution ER and FOXA1 tumor binding profiles for breast cancer. Enrichment of TF footprints in plasma reflects the binding strength of the TF in originating tissue. We defined pure in vivo tumor TF signatures in plasma using ER+ breast cancer xenografts, which can distinguish xenografts with distinct ER states. Furthermore, state-specific ER-binding signatures can partition human breast tumors into groups with significantly different ER expression and mortality. Last, TF footprints in human plasma samples can identify the presence of ER+ breast cancer. Thus, plasma TF footprints enable minimally invasive mapping of the regulatory landscape of breast cancer in humans and open vast possibilities for clinical applications across multiple tumor types.


Subject(s)
Breast Neoplasms , Cell-Free Nucleic Acids , Breast Neoplasms/pathology , Female , Gene Expression Regulation , Gene Expression Regulation, Neoplastic , Hepatocyte Nuclear Factor 3-alpha/genetics , Hepatocyte Nuclear Factor 3-alpha/metabolism , Humans , Nucleosomes , Receptors, Estrogen/genetics
3.
STAR Protoc ; 3(2): 101299, 2022 06 17.
Article in English | MEDLINE | ID: mdl-35463472

ABSTRACT

Here, we present a pipeline to map states of protein-binding DNA in vivo. Our pipeline infers as well as quantifies cooperative binding. Using dual-enzyme single-molecule footprinting (dSMF) data, we show how our workflow identifies binding states at an enhancer in Drosophila S2 cells. Data from cells lacking endogenous DNA methylation are a prerequisite for this pipeline. For complete details on the use and execution of this protocol, please refer to Rao et al. (2021) and Krebs et al. (2017).


Subject(s)
DNA Methylation , DNA , DNA/genetics , DNA Methylation/genetics , Protein Binding , Workflow
4.
Mol Cell ; 81(8): 1651-1665.e4, 2021 04 15.
Article in English | MEDLINE | ID: mdl-33705711

ABSTRACT

Enhancers harbor binding motifs that recruit transcription factors (TFs) for gene activation. While cooperative binding of TFs at enhancers is known to be critical for transcriptional activation of a handful of developmental enhancers, the extent of TF cooperativity genome-wide is unknown. Here, we couple high-resolution nuclease footprinting with single-molecule methylation profiling to characterize TF cooperativity at active enhancers in the Drosophila genome. Enrichment of short micrococcal nuclease (MNase)-protected DNA segments indicates that the majority of enhancers harbor two or more TF-binding sites, and we uncover protected fragments that correspond to co-bound sites in thousands of enhancers. From the analysis of co-binding, we find that cooperativity dominates TF binding in vivo at the majority of active enhancers. Cooperativity is highest between sites spaced 50 bp apart, indicating that cooperativity occurs without apparent protein-protein interactions. Our findings suggest nucleosomes promoting cooperativity because co-binding may effectively clear nucleosomes and promote enhancer function.


Subject(s)
Enhancer Elements, Genetic/genetics , Protein Binding/genetics , Transcription Factors/genetics , Transcription Factors/metabolism , Animals , Binding Sites/genetics , Cell Line , Drosophila/genetics , Drosophila/metabolism , Genome/genetics , Micrococcal Nuclease/genetics , Nucleosomes/genetics , Nucleosomes/metabolism , Protein Interaction Maps/genetics , Transcriptional Activation/genetics
5.
Biochemistry ; 59(48): 4523-4532, 2020 12 08.
Article in English | MEDLINE | ID: mdl-33205945

ABSTRACT

We demonstrate here that the α subunit C-terminal domain of Escherichia coli RNA polymerase (αCTD) recognizes the upstream promoter (UP) DNA element via its characteristic minor groove shape and electrostatic potential. In two compositionally distinct crystallized assemblies, a pair of αCTD subunits bind in tandem to the UP element consensus A-tract that is 6 bp in length (A6-tract), each with their arginine 265 guanidinium group inserted into the minor groove. The A6-tract minor groove is significantly narrowed in these crystal structures, as well as in computationally predicted structures of free and bound DNA duplexes derived by Monte Carlo and molecular dynamics simulations, respectively. The negative electrostatic potential of free A6-tract DNA is substantially enhanced compared to that of generic DNA. Shortening the A-tract by 1 bp is shown to "knock out" binding of the second αCTD through widening of the minor groove. Furthermore, in computationally derived structures with arginine 265 mutated to alanine in either αCTD, either with or without the "knockout" DNA mutation, contact with the DNA is perturbed, highlighting the importance of arginine 265 in achieving αCTD-DNA binding. These results demonstrate that the importance of the DNA shape in sequence-dependent recognition of DNA by RNA polymerase is comparable to that of certain transcription factors.


Subject(s)
DNA, Bacterial/chemistry , DNA, Bacterial/metabolism , DNA-Directed RNA Polymerases/chemistry , DNA-Directed RNA Polymerases/metabolism , Escherichia coli Proteins/chemistry , Escherichia coli Proteins/metabolism , Binding Sites , Crystallography, X-Ray , Cyclic AMP Receptor Protein/chemistry , Cyclic AMP Receptor Protein/genetics , Cyclic AMP Receptor Protein/metabolism , DNA, Bacterial/genetics , DNA-Directed RNA Polymerases/genetics , Escherichia coli/genetics , Escherichia coli/metabolism , Escherichia coli Proteins/genetics , Gene Knockout Techniques , Genes, Bacterial , Models, Molecular , Mutation , Nucleic Acid Conformation , Promoter Regions, Genetic , Protein Domains , Static Electricity
6.
Open Biol ; 10(9): 200119, 2020 09.
Article in English | MEDLINE | ID: mdl-32873154

ABSTRACT

Cell-free DNA (cfDNA) has the potential to enable non-invasive detection of disease states and progression. Beyond its sequence, cfDNA also represents the nucleosomal landscape of cell(s)-of-origin and captures the dynamics of the epigenome. In this review, we highlight the emergence of cfDNA epigenomic methods that assess disease beyond the scope of mutant tumour genotyping. Detection of tumour mutations is the gold standard for sequencing methods in clinical oncology. However, limitations inherent to mutation targeting in cfDNA, and the possibilities of uncovering molecular mechanisms underlying disease, have made epigenomics of cfDNA an exciting alternative. We discuss the epigenomic information revealed by cfDNA, and how epigenomic methods exploit cfDNA to detect and characterize cancer. Future applications of cfDNA epigenomic methods to act complementarily and orthogonally to current clinical practices has the potential to transform cancer management and improve cancer patient outcomes.


Subject(s)
Biomarkers , Cell-Free Nucleic Acids , Phenotype , Chromatin/chemistry , Chromatin/genetics , Chromatin Assembly and Disassembly , Chromatin Immunoprecipitation , Clinical Decision-Making , DNA Methylation , Disease Management , Epigenesis, Genetic , Epigenomics/methods , Gene Expression , Histones/metabolism , Humans , Nucleosomes/genetics , Nucleosomes/metabolism , Outcome Assessment, Health Care , Prognosis
7.
Epigenetics Chromatin ; 11(1): 6, 2018 02 06.
Article in English | MEDLINE | ID: mdl-29409522

ABSTRACT

BACKGROUND: DNA shape analysis has demonstrated the potential to reveal structure-based mechanisms of protein-DNA binding. However, information about the influence of chemical modification of DNA is limited. Cytosine methylation, the most frequent modification, represents the addition of a methyl group at the major groove edge of the cytosine base. In mammalian genomes, cytosine methylation most frequently occurs at CpG dinucleotides. In addition to changing the chemical signature of C/G base pairs, cytosine methylation can affect DNA structure. Since the original discovery of DNA methylation, major efforts have been made to understand its effect from a sequence perspective. Compared to unmethylated DNA, however, little structural information is available for methylated DNA, due to the limited number of experimentally determined structures. To achieve a better mechanistic understanding of the effect of CpG methylation on local DNA structure, we developed a high-throughput method, methyl-DNAshape, for predicting the effect of cytosine methylation on DNA shape. RESULTS: Using our new method, we found that CpG methylation significantly altered local DNA shape. Four DNA shape features-helix twist, minor groove width, propeller twist, and roll-were considered in this analysis. Distinct distributions of effect size were observed for different features. Roll and propeller twist were the DNA shape features most strongly affected by CpG methylation with an effect size depending on the local sequence context. Methylation-induced changes in DNA shape were predictive of the measured rate of cleavage by DNase I and suggest a possible mechanism for some of the methylation sensitivities that were recently observed for human Pbx-Hox complexes. CONCLUSIONS: CpG methylation is an important epigenetic mark in the mammalian genome. Understanding its role in protein-DNA recognition can further our knowledge of gene regulation. Our high-throughput methyl-DNAshape method can be used to predict the effect of cytosine methylation on DNA shape and its subsequent influence on protein-DNA interactions. This approach overcomes the limited availability of experimental DNA structures that contain 5-methylcytosine.


Subject(s)
DNA Methylation , DNA-Binding Proteins/metabolism , DNA/chemistry , Mammals/genetics , Animals , Base Sequence , CpG Islands , Cytosine/chemistry , DNA/metabolism , DNA-Binding Proteins/chemistry , Epigenesis, Genetic , Humans , Mammals/metabolism , Models, Molecular , Nucleic Acid Conformation
8.
Nucleic Acids Res ; 45(21): 12565-12576, 2017 Dec 01.
Article in English | MEDLINE | ID: mdl-29040720

ABSTRACT

Protein-DNA binding is a fundamental component of gene regulatory processes, but it is still not completely understood how proteins recognize their target sites in the genome. Besides hydrogen bonding in the major groove (base readout), proteins recognize minor-groove geometry using positively charged amino acids (shape readout). The underlying mechanism of DNA shape readout involves the correlation between minor-groove width and electrostatic potential (EP). To probe this biophysical effect directly, rather than using minor-groove width as an indirect measure for shape readout, we developed a methodology, DNAphi, for predicting EP in the minor groove and confirmed the direct role of EP in protein-DNA binding using massive sequencing data. The DNAphi method uses a sliding-window approach to mine results from non-linear Poisson-Boltzmann (NLPB) calculations on DNA structures derived from all-atom Monte Carlo simulations. We validated this approach, which only requires nucleotide sequence as input, based on direct comparison with NLPB calculations for available crystal structures. Using statistical machine-learning approaches, we showed that adding EP as a biophysical feature can improve the predictive power of quantitative binding specificity models across 27 transcription factor families. High-throughput prediction of EP offers a novel way to integrate biophysical and genomic studies of protein-DNA binding.


Subject(s)
DNA-Binding Proteins/metabolism , DNA/chemistry , Transcription Factors/metabolism , Binding Sites , DNA/metabolism , DNA-Binding Proteins/chemistry , Escherichia coli Proteins/metabolism , Factor For Inversion Stimulation Protein/metabolism , Genome , Genomics , Homeodomain Proteins/metabolism , Machine Learning , Models, Molecular , Monte Carlo Method , Nucleic Acid Conformation , Phosphates/chemistry , Protein Binding , Static Electricity , Transcription Factors/chemistry
9.
Brief Funct Genomics ; 14(1): 61-73, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25319759

ABSTRACT

Many anecdotal observations exist of a regulatory effect of DNA methylation on gene expression. However, in general, the underlying mechanisms of this effect are poorly understood. In this review, we summarize what is currently known about how this important, but mysterious, epigenetic mark impacts cellular functions. Cytosine methylation can abrogate or enhance interactions with DNA-binding proteins, or it may have no effect, depending on the context. Despite being only a small chemical change, the addition of a methyl group to cytosine can affect base readout via hydrophobic contacts in the major groove and shape readout via electrostatic contacts in the minor groove. We discuss the recent discovery that CpG methylation increases DNase I cleavage at adjacent positions by an order of magnitude through altering the local 3D DNA shape and the possible implications of this structural insight for understanding the methylation sensitivity of transcription factors (TFs). Additionally, 5-methylcytosines change the stability of nucleosomes and, thus, affect the local chromatin structure and access of TFs to genomic DNA. Given these complexities, it seems unlikely that the influence of DNA methylation on protein-DNA binding can be captured in a small set of general rules. Hence, data-driven approaches may be essential to gain a better understanding of these mechanisms.


Subject(s)
Cytosine/metabolism , DNA Methylation , Animals , Base Sequence , Humans , Models, Molecular , Nucleosomes/metabolism , Protein Binding , Transcription Factors/metabolism
10.
Biochim Biophys Acta ; 1834(8): 1520-31, 2013 Aug.
Article in English | MEDLINE | ID: mdl-23665455

ABSTRACT

Specification of the three dimensional structure of a protein from its amino acid sequence, also called a "Grand Challenge" problem, has eluded a solution for over six decades. A modestly successful strategy has evolved over the last couple of decades based on development of scoring functions (e.g. mimicking free energy) that can capture native or native-like structures from an ensemble of decoys generated as plausible candidates for the native structure. A scoring function must be fast enough in discriminating the native from unfolded/misfolded structures, and requires validation on a large data set(s) to generate sufficient confidence in the score. Here we develop a scoring function called pcSM that detects true native structure in the top 5 with 93% accuracy from an ensemble of candidate structures. If we eliminate the native from ensemble of decoys then pcSM is able to capture near native structure (RMSD<=5Ǻ) in top 10 with 86% accuracy. The parameters considered in pcSM are a C-alpha Euclidean metric, secondary structural propensity, surface areas and an intramolecular energy function. pcSM has been tested on 415 systems consisting 142,698 decoys (public and CASP-largest reported hitherto in literature). The average rank for the native is 2.38, a significant improvement over that existing in literature. In-silico protein structure prediction requires robust scoring technique(s). Therefore, pcSM is easily amenable to integration into a successful protein structure prediction strategy. The tool is freely available at http://www.scfbio-iitd.res.in/software/pcsm.jsp.


Subject(s)
Caspase 9/chemistry , Caspases/chemistry , Computational Biology , Protein Folding , Software , Algorithms , Computer Simulation , Humans , Models, Molecular , Protein Conformation
SELECTION OF CITATIONS
SEARCH DETAIL