Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Cell ; 161(2): 307-18, 2015 Apr 09.
Artigo em Inglês | MEDLINE | ID: mdl-25843630

RESUMO

Protein-DNA binding is mediated by the recognition of the chemical signatures of the DNA bases and the 3D shape of the DNA molecule. Because DNA shape is a consequence of sequence, it is difficult to dissociate these modes of recognition. Here, we tease them apart in the context of Hox-DNA binding by mutating residues that, in a co-crystal structure, only recognize DNA shape. Complexes made with these mutants lose the preference to bind sequences with specific DNA shape features. Introducing shape-recognizing residues from one Hox protein to another swapped binding specificities in vitro and gene regulation in vivo. Statistical machine learning revealed that the accuracy of binding specificity predictions improves by adding shape features to a model that only depends on sequence, and feature selection identified shape features important for recognition. Thus, shape readout is a direct and independent component of binding site selection by Hox proteins.


Assuntos
DNA/química , DNA/metabolismo , Proteínas de Drosophila/química , Proteínas de Drosophila/metabolismo , Drosophila melanogaster/metabolismo , Fatores de Transcrição/química , Fatores de Transcrição/metabolismo , Sequência de Aminoácidos , Animais , Cristalografia por Raios X , Proteínas de Homeodomínio/química , Proteínas de Homeodomínio/metabolismo , Dados de Sequência Molecular , Conformação de Ácido Nucleico , Ligação Proteica , Alinhamento de Sequência
2.
Cell ; 147(6): 1270-82, 2011 Dec 09.
Artigo em Inglês | MEDLINE | ID: mdl-22153072

RESUMO

Members of transcription factor families typically have similar DNA binding specificities yet execute unique functions in vivo. Transcription factors often bind DNA as multiprotein complexes, raising the possibility that complex formation might modify their DNA binding specificities. To test this hypothesis, we developed an experimental and computational platform, SELEX-seq, that can be used to determine the relative affinities to any DNA sequence for any transcription factor complex. Applying this method to all eight Drosophila Hox proteins, we show that they obtain novel recognition properties when they bind DNA with the dimeric cofactor Extradenticle-Homothorax (Exd). Exd-Hox specificities group into three main classes that obey Hox gene collinearity rules and DNA structure predictions suggest that anterior and posterior Hox proteins prefer DNA sequences with distinct minor groove topographies. Together, these data suggest that emergent DNA recognition properties revealed by interactions with cofactors contribute to transcription factor specificities in vivo.


Assuntos
DNA/metabolismo , Proteínas de Drosophila/metabolismo , Drosophila/metabolismo , Proteínas de Homeodomínio/metabolismo , Multimerização Proteica , Fatores de Transcrição/metabolismo , Sequência de Aminoácidos , Animais , Proteínas de Drosophila/química , Técnicas Genéticas , Proteínas de Homeodomínio/química , Dados de Sequência Molecular , Estrutura Terciária de Proteína , Fatores de Transcrição/química
3.
Mol Biol Evol ; 35(8): 1958-1967, 2018 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-29850830

RESUMO

Noncoding DNA sequences, which play various roles in gene expression and regulation, are under evolutionary pressure. Gene regulation requires specific protein-DNA binding events, and our previous studies showed that both DNA sequence and shape readout are employed by transcription factors (TFs) to achieve DNA binding specificity. By investigating the shape-disrupting properties of single nucleotide polymorphisms (SNPs) in human regulatory regions, we established a link between disruptive local DNA shape changes and loss of specific TF binding. Furthermore, we described cases where disease-associated SNPs may alter TF binding through DNA shape changes. This link led us to hypothesize that local DNA shape within and around TF binding sites is under selection pressure. To verify this hypothesis, we analyzed SNP data derived from 216 natural strains of Drosophila melanogaster. Comparing SNPs located in functional and nonfunctional regions within experimentally validated cis-regulatory modules (CRMs) from D. melanogaster that are active in the blastoderm stage of development, we found that SNPs within functional regions tended to cause smaller DNA shape variations. Furthermore, SNPs with higher minor allele frequency were more likely to result in smaller DNA shape variations. The same analysis based on a large number of SNPs in putative CRMs of the D. melanogaster genome derived from DNase I accessibility data confirmed these observations. Taken together, our results indicate that common SNPs in functional regions tend to maintain DNA shape, whereas shape-disrupting SNPs are more likely to be eliminated through purifying selection.


Assuntos
DNA , Conformação de Ácido Nucleico , Polimorfismo de Nucleotídeo Único , Seleção Genética , Fatores de Transcrição/metabolismo , Animais , Sítios de Ligação , Drosophila melanogaster , Frequência do Gene , Genoma de Inseto , Humanos
4.
Trends Biochem Sci ; 39(9): 381-99, 2014 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-25129887

RESUMO

Transcription factors (TFs) influence cell fate by interpreting the regulatory DNA within a genome. TFs recognize DNA in a specific manner; the mechanisms underlying this specificity have been identified for many TFs based on 3D structures of protein-DNA complexes. More recently, structural views have been complemented with data from high-throughput in vitro and in vivo explorations of the DNA-binding preferences of many TFs. Together, these approaches have greatly expanded our understanding of TF-DNA interactions. However, the mechanisms by which TFs select in vivo binding sites and alter gene expression remain unclear. Recent work has highlighted the many variables that influence TF-DNA binding, while demonstrating that a biophysical understanding of these many factors will be central to understanding TF function.


Assuntos
Fenômenos Biofísicos/genética , DNA/genética , Genoma/genética , Fatores de Transcrição/metabolismo , Animais , Sítios de Ligação , Biologia Computacional , DNA/metabolismo , Humanos , Ligação Proteica
5.
Proc Natl Acad Sci U S A ; 112(15): 4654-9, 2015 Apr 14.
Artigo em Inglês | MEDLINE | ID: mdl-25775564

RESUMO

DNA binding specificities of transcription factors (TFs) are a key component of gene regulatory processes. Underlying mechanisms that explain the highly specific binding of TFs to their genomic target sites are poorly understood. A better understanding of TF-DNA binding requires the ability to quantitatively model TF binding to accessible DNA as its basic step, before additional in vivo components can be considered. Traditionally, these models were built based on nucleotide sequence. Here, we integrated 3D DNA shape information derived with a high-throughput approach into the modeling of TF binding specificities. Using support vector regression, we trained quantitative models of TF binding specificity based on protein binding microarray (PBM) data for 68 mammalian TFs. The evaluation of our models included cross-validation on specific PBM array designs, testing across different PBM array designs, and using PBM-trained models to predict relative binding affinities derived from in vitro selection combined with deep sequencing (SELEX-seq). Our results showed that shape-augmented models compared favorably to sequence-based models. Although both k-mer and DNA shape features can encode interdependencies between nucleotide positions of the binding site, using DNA shape features reduced the dimensionality of the feature space. In addition, analyzing the feature weights of DNA shape-augmented models uncovered TF family-specific structural readout mechanisms that were not revealed by the DNA sequence. As such, this work combines knowledge from structural biology and genomics, and suggests a new path toward understanding TF binding and genome function.


Assuntos
DNA/química , DNA/metabolismo , Conformação de Ácido Nucleico , Fatores de Transcrição/metabolismo , Algoritmos , Animais , Sequência de Bases , Sítios de Ligação/genética , Biologia Computacional/métodos , DNA/genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Cinética , Camundongos , Modelos Genéticos , Análise Serial de Proteínas , Ligação Proteica , Fatores de Transcrição/genética
6.
Bioinformatics ; 32(8): 1211-3, 2016 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-26668005

RESUMO

UNLABELLED: DNAshapeR predicts DNA shape features in an ultra-fast, high-throughput manner from genomic sequencing data. The package takes either nucleotide sequence or genomic coordinates as input and generates various graphical representations for visualization and further analysis. DNAshapeR further encodes DNA sequence and shape features as user-defined combinations of k-mer and DNA shape features. The resulting feature matrices can be readily used as input of various machine learning software packages for further modeling studies. AVAILABILITY AND IMPLEMENTATION: The DNAshapeR software package was implemented in the statistical programming language R and is freely available through the Bioconductor project at https://www.bioconductor.org/packages/devel/bioc/html/DNAshapeR.html and at the GitHub developer site, http://tsupeichiu.github.io/DNAshapeR/ CONTACT: rohs@usc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
DNA , Genômica , Software , Genoma , Linguagens de Programação
7.
Nucleic Acids Res ; 43(Database issue): D103-9, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25326329

RESUMO

Many regulatory mechanisms require a high degree of specificity in protein-DNA binding. Nucleotide sequence does not provide an answer to the question of why a protein binds only to a small subset of the many putative binding sites in the genome that share the same core motif. Whereas higher-order effects, such as chromatin accessibility, cooperativity and cofactors, have been described, DNA shape recently gained attention as another feature that fine-tunes the DNA binding specificities of some transcription factor families. Our Genome Browser for DNA shape annotations (GBshape; freely available at http://rohslab.cmb.usc.edu/GBshape/) provides minor groove width, propeller twist, roll, helix twist and hydroxyl radical cleavage predictions for the entire genomes of 94 organisms. Additional genomes can easily be added using the GBshape framework. GBshape can be used to visualize DNA shape annotations qualitatively in a genome browser track format, and to download quantitative values of DNA shape features as a function of genomic position at nucleotide resolution. As biological applications, we illustrate the periodicity of DNA shape features that are present in nucleosome-occupied sequences from human, fly and worm, and we demonstrate structural similarities between transcription start sites in the genomes of four Drosophila species.


Assuntos
DNA/química , Bases de Dados de Ácidos Nucleicos , Genoma , Anotação de Sequência Molecular , Navegador , Animais , Sítios de Ligação , Humanos , Conformação de Ácido Nucleico , Nucleossomos/metabolismo , Sítio de Iniciação de Transcrição
8.
Nucleic Acids Res ; 42(1): 430-41, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24078250

RESUMO

Protein-DNA recognition is a critical component of gene regulatory processes but the underlying molecular mechanisms are not yet completely understood. Whereas the DNA binding preferences of transcription factors (TFs) are commonly described using nucleotide sequences, the 3D DNA structure is recognized by proteins and is crucial for achieving binding specificity. However, the ability to analyze DNA shape in a high-throughput manner made it only recently feasible to integrate structural information into studies of protein-DNA binding. Here we focused on the homeodomain family of TFs and analyzed the DNA shape of thousands of their DNA binding sites, investigating the covariation between the protein sequence and the sequence and shape of their DNA targets. We found distinct homeodomain regions that were more correlated with either the nucleotide sequence or the DNA shape of their preferred binding sites, demonstrating different readout mechanisms through which homeodomains attain DNA binding specificity. We identified specific homeodomain residues that likely play key roles in DNA recognition via shape readout. Finally, we showed that adding DNA shape information when characterizing binding sites improved the prediction accuracy of homeodomain binding specificities. Taken together, our findings indicate that DNA shape information can generally provide new mechanistic insights into TF binding.


Assuntos
DNA/química , Proteínas de Homeodomínio/química , Fatores de Transcrição/química , Sequência de Aminoácidos , Animais , Sequência de Bases , Sítios de Ligação , DNA/metabolismo , Proteínas de Drosophila/química , Proteínas de Homeodomínio/metabolismo , Camundongos , Dados de Sequência Molecular , Conformação de Ácido Nucleico , Ligação Proteica , Conformação Proteica , Fatores de Transcrição/metabolismo
9.
Nucleic Acids Res ; 42(Database issue): D148-55, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24214955

RESUMO

Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein-DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone.


Assuntos
DNA/química , Bases de Dados Genéticas , Elementos Reguladores de Transcrição , Fatores de Transcrição/metabolismo , Animais , Fatores de Transcrição Hélice-Alça-Hélice Básicos/metabolismo , Sítios de Ligação , Proteínas de Homeodomínio/metabolismo , Humanos , Internet , Camundongos , Conformação de Ácido Nucleico , Motivos de Nucleotídeos
10.
Proc Natl Acad Sci U S A ; 110(16): 6376-81, 2013 Apr 16.
Artigo em Inglês | MEDLINE | ID: mdl-23576721

RESUMO

DNA binding proteins find their cognate sequences within genomic DNA through recognition of specific chemical and structural features. Here we demonstrate that high-resolution DNase I cleavage profiles can provide detailed information about the shape and chemical modification status of genomic DNA. Analyzing millions of DNA backbone hydrolysis events on naked genomic DNA, we show that the intrinsic rate of cleavage by DNase I closely tracks the width of the minor groove. Integration of these DNase I cleavage data with bisulfite sequencing data for the same cell type's genome reveals that cleavage directly adjacent to cytosine-phosphate-guanine (CpG) dinucleotides is enhanced at least eightfold by cytosine methylation. This phenomenon we show to be attributable to methylation-induced narrowing of the minor groove. Furthermore, we demonstrate that it enables simultaneous mapping of DNase I hypersensitivity and regional DNA methylation levels using dense in vivo cleavage data. Taken together, our results suggest a general mechanism by which CpG methylation can modulate protein-DNA interaction strength via the remodeling of DNA shape.


Assuntos
Metilação de DNA/genética , DNA/química , Desoxirribonuclease I , Genômica/métodos , Modelos Moleculares , Conformação de Ácido Nucleico , Células Cultivadas , Ilhas de CpG/genética , DNA/metabolismo , Desoxirribonuclease I/metabolismo , Humanos , Modelos Genéticos , Análise de Sequência de DNA
11.
Nucleic Acids Res ; 41(Web Server issue): W56-62, 2013 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-23703209

RESUMO

We present a method and web server for predicting DNA structural features in a high-throughput (HT) manner for massive sequence data. This approach provides the framework for the integration of DNA sequence and shape analyses in genome-wide studies. The HT methodology uses a sliding-window approach to mine DNA structural information obtained from Monte Carlo simulations. It requires only nucleotide sequence as input and instantly predicts multiple structural features of DNA (minor groove width, roll, propeller twist and helix twist). The results of rigorous validations of the HT predictions based on DNA structures solved by X-ray crystallography and NMR spectroscopy, hydroxyl radical cleavage data, statistical analysis and cross-validation, and molecular dynamics simulations provide strong confidence in this approach. The DNAshape web server is freely available at http://rohslab.cmb.usc.edu/DNAshape/.


Assuntos
DNA/química , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA , Software , Genômica , Internet , Conformação de Ácido Nucleico
12.
Brief Funct Genomics ; 14(1): 61-73, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25319759

RESUMO

Many anecdotal observations exist of a regulatory effect of DNA methylation on gene expression. However, in general, the underlying mechanisms of this effect are poorly understood. In this review, we summarize what is currently known about how this important, but mysterious, epigenetic mark impacts cellular functions. Cytosine methylation can abrogate or enhance interactions with DNA-binding proteins, or it may have no effect, depending on the context. Despite being only a small chemical change, the addition of a methyl group to cytosine can affect base readout via hydrophobic contacts in the major groove and shape readout via electrostatic contacts in the minor groove. We discuss the recent discovery that CpG methylation increases DNase I cleavage at adjacent positions by an order of magnitude through altering the local 3D DNA shape and the possible implications of this structural insight for understanding the methylation sensitivity of transcription factors (TFs). Additionally, 5-methylcytosines change the stability of nucleosomes and, thus, affect the local chromatin structure and access of TFs to genomic DNA. Given these complexities, it seems unlikely that the influence of DNA methylation on protein-DNA binding can be captured in a small set of general rules. Hence, data-driven approaches may be essential to gain a better understanding of these mechanisms.


Assuntos
Citosina/metabolismo , Metilação de DNA , Animais , Sequência de Bases , Humanos , Modelos Moleculares , Nucleossomos/metabolismo , Ligação Proteica , Fatores de Transcrição/metabolismo
13.
Cell Rep ; 3(4): 1093-104, 2013 Apr 25.
Artigo em Inglês | MEDLINE | ID: mdl-23562153

RESUMO

DNA sequence is a major determinant of the binding specificity of transcription factors (TFs) for their genomic targets. However, eukaryotic cells often express, at the same time, TFs with highly similar DNA binding motifs but distinct in vivo targets. Currently, it is not well understood how TFs with seemingly identical DNA motifs achieve unique specificities in vivo. Here, we used custom protein-binding microarrays to analyze TF specificity for putative binding sites in their genomic sequence context. Using yeast TFs Cbf1 and Tye7 as our case studies, we found that binding sites of these bHLH TFs (i.e., E-boxes) are bound differently in vitro and in vivo, depending on their genomic context. Computational analyses suggest that nucleotides outside E-box binding sites contribute to specificity by influencing the three-dimensional structure of DNA binding sites. Thus, the local shape of target sites might play a widespread role in achieving regulatory specificity within TF families.


Assuntos
Fatores de Transcrição de Zíper de Leucina e Hélice-Alça-Hélix Básicos/metabolismo , DNA/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/genética , Transativadores/metabolismo , Sequência de Aminoácidos , Fatores de Transcrição de Zíper de Leucina e Hélice-Alça-Hélix Básicos/química , Fatores de Transcrição de Zíper de Leucina e Hélice-Alça-Hélix Básicos/genética , Sítios de Ligação , DNA/química , Elementos E-Box , Genoma Fúngico , Dados de Sequência Molecular , Conformação de Ácido Nucleico , Motivos de Nucleotídeos , Ligação Proteica , Estrutura Secundária de Proteína , Proteínas de Saccharomyces cerevisiae/química , Proteínas de Saccharomyces cerevisiae/genética , Transativadores/química , Transativadores/genética
14.
Comput Biol Chem ; 35(5): 319-22, 2011 Oct 12.
Artigo em Inglês | MEDLINE | ID: mdl-22000803

RESUMO

We employed an evolutionary genomics approach to detect genes under lineage-specific positive selection for the two closely related Mycobacterium tuberculosis strains, the virulent H37Rv and the avirulent H37Ra, with the clinical isolate CDC1551 as the outgroup. We found six H37Rv-specific and six H37Ra-specific positively selected genes, among which the former comprised a flavoprotein, a RNA polymerase sigma factor SigM, two PPE family proteins, as well as two hypothetical proteins, while the latter consisted of a dehydrogenase, a (3R)-hydroxyacyl-ACP dehydratase subunit HadA, a PPE family protein, and three PE-PGRS family proteins. Obviously, the PE/PPE/PE-PGRS family proteins were the main targets of positive selection. The functional discussion of our findings implied that those positively selected genes were highly involved in antigen variations and immune evasions of Mycobacterium tuberculosis.


Assuntos
Genes Bacterianos , Mycobacterium tuberculosis/genética , Seleção Genética , Antígenos de Bactérias/genética , Proteínas de Bactérias/química , Proteínas de Bactérias/genética , Estudos de Associação Genética , Genoma Bacteriano , Humanos , Modelos Genéticos , Mycobacterium tuberculosis/isolamento & purificação , Mycobacterium tuberculosis/patogenicidade , Tuberculose Pulmonar/microbiologia , Virulência/genética
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa