Your browser doesn't support javascript.
loading
A partially function-to-topic model for protein function prediction.
Liu, Lin; Tang, Lin; Tang, Mingjing; Zhou, Wei.
Afiliación
  • Liu L; School of Information, Yunnan Normal University, Kunming, 650500, Yunnan, China.
  • Tang L; Key Laboratory of Educational Informatization for Nationalities Ministry of Education, Yunnan Normal University, Kunming, 650500, Yunnan, China. maitanweng2@163.com.
  • Tang M; President's Office, Yunnan Normal University, Kunming, 650500, Yunnan, China.
  • Zhou W; School of Software, Yunnan University, Kunming, 650091, Yunnan, China. zwei@ynu.edu.cn.
BMC Genomics ; 19(Suppl 10): 883, 2018 Dec 31.
Article en En | MEDLINE | ID: mdl-30598098
BACKGROUND: Proteins are a kind of macromolecules and the main component of a cell, and thus it is the most essential and versatile material of life. The research of protein functions is of great significance in decoding the secret of life. In recent years, researchers have introduced multi-label supervised topic model such as Labeled Latent Dirichlet Allocation (Labeled-LDA) into protein function prediction, which can obtain more accurate and explanatory prediction. However, the topic-label corresponding way of Labeled-LDA is associating each label (GO term) with a corresponding topic directly, which makes the latent topics to be completely degenerated, and ignores the differences between labels and latent topics. RESULT: To achieve more accurate probabilistic modeling of function label, we propose a Partially Function-to-Topic Prediction (PFTP) model for introducing the local topics subset corresponding to each function label. Meanwhile, PFTP not only supports latent topics subset within a given function label but also a background topic corresponding to a 'fake' function label, which represents common semantic of protein function. Related definitions and the topic modeling process of PFTP are described in this paper. In a 5-fold cross validation experiment on yeast and human datasets, PFTP significantly outperforms five widely adopted methods for protein function prediction. Meanwhile, the impact of model parameters on prediction performance and the latent topics discovered by PFTP are also discussed in this paper. CONCLUSION: All of the experimental results provide evidence that PFTP is effective and have potential value for predicting protein function. Based on its ability of discovering more-refined latent sub-structure of function label, we can anticipate that PFTP is a potential method to reveal a deeper biological explanation for protein functions.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Proteínas / Modelos Biológicos Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: BMC Genomics Asunto de la revista: GENETICA Año: 2018 Tipo del documento: Article País de afiliación: China Pais de publicación: Reino Unido

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Proteínas / Modelos Biológicos Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: BMC Genomics Asunto de la revista: GENETICA Año: 2018 Tipo del documento: Article País de afiliación: China Pais de publicación: Reino Unido