Búsqueda | Biblioteca Virtual en Salud Fronteriza

De novo motif discovery facilitates identification of interactions between transcription factors in Saccharomyces cerevisiae.

Chen, Mei-Ju May; Chou, Lih-Ching; Hsieh, Tsung-Ting; Lee, Ding-Dar; Liu, Kai-Wei; Yu, Chi-Yuan; Oyang, Yen-Jen; Tsai, Huai-Kuang; Chen, Chien-Yu.

Bioinformatics ; 28(5): 701-8, 2012 Mar 01.

Artículo en Inglés | MEDLINE | ID: mdl-22238267

RESUMEN

MOTIVATION: Gene regulation involves complicated mechanisms such as cooperativity between a set of transcription factors (TFs). Previous studies have used target genes shared by two TFs as a clue to infer TF-TF interactions. However, this task remains challenging because the target genes with low binding affinity are frequently omitted by experimental data, especially when a single strict threshold is employed. This article aims at improving the accuracy of inferring TF-TF interactions by incorporating motif discovery as a fundamental step when detecting overlapping targets of TFs based on ChIP-chip data. RESULTS: The proposed method, simTFBS, outperforms three naïve methods that adopt fixed thresholds when inferring TF-TF interactions based on ChIP-chip data. In addition, simTFBS is compared with two advanced methods and demonstrates its advantages in predicting TF-TF interactions. By comparing simTFBS with predictions based on the set of available annotated yeast TF binding motifs, we demonstrate that the good performance of simTFBS is indeed coming from the additional motifs found by the proposed procedures. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

Redes Reguladoras de Genes , Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/metabolismo , Factores de Transcripción/metabolismo , Inmunoprecipitación de Cromatina , Regulación Fúngica de la Expresión Génica , Análisis de Secuencia por Matrices de Oligonucleótidos , Unión Proteica , Proteínas de Saccharomyces cerevisiae/genética

Predicting protein-protein interactions in unbalanced data using the primary structure of proteins.

Yu, Chi-Yuan; Chou, Lih-Ching; Chang, Darby Tien-Hao.

BMC Bioinformatics ; 11: 167, 2010 Apr 02.

Artículo en Inglés | MEDLINE | ID: mdl-20361868

RESUMEN

BACKGROUND: Elucidating protein-protein interactions (PPIs) is essential to constructing protein interaction networks and facilitating our understanding of the general principles of biological systems. Previous studies have revealed that interacting protein pairs can be predicted by their primary structure. Most of these approaches have achieved satisfactory performance on datasets comprising equal number of interacting and non-interacting protein pairs. However, this ratio is highly unbalanced in nature, and these techniques have not been comprehensively evaluated with respect to the effect of the large number of non-interacting pairs in realistic datasets. Moreover, since highly unbalanced distributions usually lead to large datasets, more efficient predictors are desired when handling such challenging tasks. RESULTS: This study presents a method for PPI prediction based only on sequence information, which contributes in three aspects. First, we propose a probability-based mechanism for transforming protein sequences into feature vectors. Second, the proposed predictor is designed with an efficient classification algorithm, where the efficiency is essential for handling highly unbalanced datasets. Third, the proposed PPI predictor is assessed with several unbalanced datasets with different positive-to-negative ratios (from 1:1 to 1:15). This analysis provides solid evidence that the degree of dataset imbalance is important to PPI predictors. CONCLUSIONS: Dealing with data imbalance is a key issue in PPI prediction since there are far fewer interacting protein pairs than non-interacting ones. This article provides a comprehensive study on this issue and develops a practical tool that achieves both good prediction performance and efficiency using only protein sequence information.

Asunto(s)

Mapeo de Interacción de Proteínas/métodos , Proteínas/química , Proteómica/métodos , Secuencia de Aminoácidos , Sitios de Unión , Bases de Datos de Proteínas , Proteínas/metabolismo , Análisis de Secuencia de Proteína

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA