Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
1.
Genome Res ; 23(6): 928-40, 2013 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-23471540

RESUMO

Cys2-His2 zinc finger proteins (ZFPs) are the largest group of transcription factors in higher metazoans. A complete characterization of these ZFPs and their associated target sequences is pivotal to fully annotate transcriptional regulatory networks in metazoan genomes. As a first step in this process, we have characterized the DNA-binding specificities of 129 zinc finger sets from Drosophila using a bacterial one-hybrid system. This data set contains the DNA-binding specificities for at least one encoded ZFP from 70 unique genes and 23 alternate splice isoforms representing the largest set of characterized ZFPs from any organism described to date. These recognition motifs can be used to predict genomic binding sites for these factors within the fruit fly genome. Subsets of fingers from these ZFPs were characterized to define their orientation and register on their recognition sequences, thereby allowing us to define the recognition diversity within this finger set. We find that the characterized fingers can specify 47 of the 64 possible DNA triplets. To confirm the utility of our finger recognition models, we employed subsets of Drosophila fingers in combination with an existing archive of artificial zinc finger modules to create ZFPs with novel DNA-binding specificity. These hybrids of natural and artificial fingers can be used to create functional zinc finger nucleases for editing vertebrate genomes.


Assuntos
Sítios de Ligação , Proteínas de Drosophila/genética , Drosophila/genética , Motivos de Nucleotídeos , Dedos de Zinco/genética , Processamento Alternativo , Animais , Sequência de Bases , Análise por Conglomerados , Biologia Computacional/métodos , Proteínas de Drosophila/química , Proteínas de Drosophila/classificação , Modelos Moleculares , Filogenia , Matrizes de Pontuação de Posição Específica , Ligação Proteica , Conformação Proteica
2.
Nucleic Acids Res ; 42(8): 4800-12, 2014 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-24523353

RESUMO

Cys(2)-His(2) zinc finger proteins (ZFPs) are the largest family of transcription factors in higher metazoans. They also represent the most diverse family with regards to the composition of their recognition sequences. Although there are a number of ZFPs with characterized DNA-binding preferences, the specificity of the vast majority of ZFPs is unknown and cannot be directly inferred by homology due to the diversity of recognition residues present within individual fingers. Given the large number of unique zinc fingers and assemblies present across eukaryotes, a comprehensive predictive recognition model that could accurately estimate the DNA-binding specificity of any ZFP based on its amino acid sequence would have great utility. Toward this goal, we have used the DNA-binding specificities of 678 two-finger modules from both natural and artificial sources to construct a random forest-based predictive model for ZFP recognition. We find that our recognition model outperforms previously described determinant-based recognition models for ZFPs, and can successfully estimate the specificity of naturally occurring ZFPs with previously defined specificities.


Assuntos
Proteínas de Ligação a DNA/metabolismo , Elementos Reguladores de Transcrição , Fatores de Transcrição/metabolismo , Dedos de Zinco , Inteligência Artificial , Sítios de Ligação , DNA/química , Proteínas de Ligação a DNA/química , Modelos Biológicos , Motivos de Nucleotídeos , Fatores de Transcrição/química
3.
Bioinformatics ; 28(12): i84-9, 2012 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-22689783

RESUMO

MOTIVATION: Recognition models for protein-DNA interactions, which allow the prediction of specificity for a DNA-binding domain based only on its sequence or the alteration of specificity through rational design, have long been a goal of computational biology. There has been some progress in constructing useful models, especially for C(2)H(2) zinc finger proteins, but it remains a challenging problem with ample room for improvement. For most families of transcription factors the best available methods utilize k-nearest neighbor (KNN) algorithms to make specificity predictions based on the average of the specificities of the k most similar proteins with defined specificities. Homeodomain (HD) proteins are the second most abundant family of transcription factors, after zinc fingers, in most metazoan genomes, and as a consequence an effective recognition model for this family would facilitate predictive models of many transcriptional regulatory networks within these genomes. RESULTS: Using extensive experimental data, we have tested several machine learning approaches and find that both support vector machines and random forests (RFs) can produce recognition models for HD proteins that are significant improvements over KNN-based methods. Cross-validation analyses show that the resulting models are capable of predicting specificities with high accuracy. We have produced a web-based prediction tool, PreMoTF (Predicted Motifs for Transcription Factors) (http://stormo.wustl.edu/PreMoTF), for predicting position frequency matrices from protein sequence using a RF-based model.


Assuntos
Inteligência Artificial , Biologia Computacional/métodos , DNA/química , Proteínas de Homeodomínio/química , Algoritmos , Sequência de Aminoácidos , Animais , Sítios de Ligação , Drosophila , Humanos , Camundongos , Modelos Estatísticos , Alinhamento de Sequência , Máquina de Vetores de Suporte , Fatores de Transcrição/química , Dedos de Zinco
4.
Nucleic Acids Res ; 39(Database issue): D111-7, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21097781

RESUMO

FlyFactorSurvey (http://pgfe.umassmed.edu/TFDBS/) is a database of DNA binding specificities for Drosophila transcription factors (TFs) primarily determined using the bacterial one-hybrid system. The database provides community access to over 400 recognition motifs and position weight matrices for over 200 TFs, including many unpublished motifs. Search tools and flat file downloads are provided to retrieve binding site information (as sequences, matrices and sequence logos) for individual TFs, groups of TFs or for all TFs with characterized binding specificities. Linked analysis tools allow users to identify motifs within our database that share similarity to a query matrix or to view the distribution of occurrences of an individual motif throughout the Drosophila genome. Together, this database and its associated tools provide computational and experimental biologists with resources to predict interactions between Drosophila TFs and target cis-regulatory sequences.


Assuntos
Bases de Dados Genéticas , Proteínas de Drosophila/metabolismo , Drosophila/genética , Elementos Reguladores de Transcrição , Fatores de Transcrição/metabolismo , Animais , Bactérias/genética , Sítios de Ligação , Software , Técnicas do Sistema de Duplo-Híbrido , Interface Usuário-Computador
5.
Sci Adv ; 6(36)2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32917609

RESUMO

Recent advances in single-cell techniques catalyze an emerging field of studying how cells convert from one phenotype to another, in a step-by-step process. Two grand technical challenges, however, impede further development of the field. Fixed cell-based approaches can provide snapshots of high-dimensional expression profiles but have fundamental limits on revealing temporal information, and fluorescence-based live-cell imaging approaches provide temporal information but are technically challenging for multiplex long-term imaging. We first developed a live-cell imaging platform that tracks cellular status change through combining endogenous fluorescent labeling that minimizes perturbation to cell physiology and/or live-cell imaging of high-dimensional cell morphological and texture features. With our platform and an A549 VIM-RFP epithelial-to-mesenchymal transition (EMT) reporter cell line, live-cell trajectories reveal parallel paths of EMT missing from snapshot data due to cell-cell dynamic heterogeneity. Our results emphasize the necessity of extracting dynamical information of phenotypic transitions from multiplex live-cell imaging.

6.
Proc Natl Acad Sci U S A ; 104(25): 10352-7, 2007 Jun 19.
Artigo em Inglês | MEDLINE | ID: mdl-17553968

RESUMO

A recent model for the mechanism of intrinsic transcription termination involves dissociation of the RNA from forward-translocated (hypertranslocated) states of the complex [Yarnell WS, Roberts JW (1999) Science, 284:611-615]. The current study demonstrates that halted elongation complexes of T7 RNA polymerase in the absence of termination signals can also dissociate via a forward-translocation mechanism. Shortening of the downstream DNA or the introduction of a stretch of mismatched DNA immediately downstream of the halt site reduces a barrier to forward translocation and correspondingly reduces the lifetime of halted complexes. Conversely, introduction of a cross-link downstream of the halt site increases the same barrier and leads to an increase in complex lifetime. Introduction of a mismatch within the bubble reduces a driving force for forward translocation and correspondingly increases the lifetime of the complex, but only for mismatches at the upstream edge of the bubble, as predicted by the model. Mismatching only the two most upstream of the eight bases in the bubble provides a maximal increase in complex stability, suggesting that dissociation occurs primarily from early forward-translocated states. Finally, addition in trans of an oligonucleotide complementary to the nascent RNA just beyond the hybrid complements the loss of driving force derived from placement of a mismatch within the bubble, confirming the expected additivity of effects. Thus, forward translocation is likely a general mechanism for dissociation of elongation complexes, both in the presence and absence of intrinsic termination signals.


Assuntos
RNA Polimerases Dirigidas por DNA/metabolismo , Proteínas Virais/metabolismo , Sequência de Bases , Transporte Biológico , DNA Viral/química , DNA Viral/genética , DNA Viral/metabolismo , RNA Polimerases Dirigidas por DNA/genética , RNA Polimerases Dirigidas por DNA/isolamento & purificação , Estabilidade Enzimática , Escherichia coli/enzimologia , Cinética , Modelos Genéticos , Mutação , Regiões Promotoras Genéticas , Moldes Genéticos , Transcrição Gênica , Proteínas Virais/genética , Proteínas Virais/isolamento & purificação
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA