Your browser doesn't support javascript.
loading
TFpredict and SABINE: sequence-based prediction of structural and functional characteristics of transcription factors.
Eichner, Johannes; Topf, Florian; Dräger, Andreas; Wrzodek, Clemens; Wanke, Dierk; Zell, Andreas.
Afiliação
  • Eichner J; Center of Bioinformatics Tuebingen (ZBIT), University of Tuebingen, Tübingen, Germany.
  • Topf F; Center of Bioinformatics Tuebingen (ZBIT), University of Tuebingen, Tübingen, Germany.
  • Dräger A; Center of Bioinformatics Tuebingen (ZBIT), University of Tuebingen, Tübingen, Germany ; University of California San Diego, La Jolla, California, United States of America.
  • Wrzodek C; Center of Bioinformatics Tuebingen (ZBIT), University of Tuebingen, Tübingen, Germany.
  • Wanke D; Center for Plant Physiology Tuebingen (ZMBP), University of Tuebingen, Tübingen, Germany.
  • Zell A; Center of Bioinformatics Tuebingen (ZBIT), University of Tuebingen, Tübingen, Germany.
PLoS One ; 8(12): e82238, 2013.
Article em En | MEDLINE | ID: mdl-24349230
ABSTRACT
One of the key mechanisms of transcriptional control are the specific connections between transcription factors (TF) and cis-regulatory elements in gene promoters. The elucidation of these specific protein-DNA interactions is crucial to gain insights into the complex regulatory mechanisms and networks underlying the adaptation of organisms to dynamically changing environmental conditions. As experimental techniques for determining TF binding sites are expensive and mostly performed for selected TFs only, accurate computational approaches are needed to analyze transcriptional regulation in eukaryotes on a genome-wide level. We implemented a four-step classification workflow which for a given protein sequence (1) discriminates TFs from other proteins, (2) determines the structural superclass of TFs, (3) identifies the DNA-binding domains of TFs and (4) predicts their cis-acting DNA motif. While existing tools were extended and adapted for performing the latter two prediction steps, the first two steps are based on a novel numeric sequence representation which allows for combining existing knowledge from a BLAST scan with robust machine learning-based classification. By evaluation on a set of experimentally confirmed TFs and non-TFs, we demonstrate that our new protein sequence representation facilitates more reliable identification and structural classification of TFs than previously proposed sequence-derived features. The algorithms underlying our proposed methodology are implemented in the two complementary tools TFpredict and SABINE. The online and stand-alone versions of TFpredict and SABINE are freely available to academics at http//www.cogsys.cs.uni-tuebingen.de/software/TFpredict/ and http//www.cogsys.cs.uni-tuebingen.de/software/SABINE/.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Fatores de Transcrição / Software / Biologia Computacional Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2013 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Fatores de Transcrição / Software / Biologia Computacional Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2013 Tipo de documento: Article