Your browser doesn't support javascript.
loading
Glypre: In Silico Prediction of Protein Glycation Sites by Fusing Multiple Features and Support Vector Machine.
Zhao, Xiaowei; Zhao, Xiaosa; Bao, Lingling; Zhang, Yonggang; Dai, Jiangyan; Yin, Minghao.
Afiliación
  • Zhao X; School of Computer Science and Information Technology, Northeast Normal University, Changchun 130117, China. zhaoxw303@nenu.edu.cn.
  • Zhao X; Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China. zhaoxw303@nenu.edu.cn.
  • Bao L; School of Computer Science and Information Technology, Northeast Normal University, Changchun 130117, China. zhaoxw303@126.com.
  • Zhang Y; School of Computer Science and Information Technology, Northeast Normal University, Changchun 130117, China. baoll601@nenu.edu.cn.
  • Dai J; Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China. zhangygcq@163.com.
  • Yin M; School of Computer Engineering, Weifang University, Weifang 261061, China. longwind111@126.com.
Molecules ; 22(11)2017 Nov 03.
Article en En | MEDLINE | ID: mdl-29099805
ABSTRACT
Glycation is a non-enzymatic process occurring inside or outside the host body by attaching a sugar molecule to a protein or lipid molecule. It is an important form of post-translational modification (PTM), which impairs the function and changes the characteristics of the proteins so that the identification of the glycation sites may provide some useful guidelines to understand various biological functions of proteins. In this study, we proposed an accurate prediction tool, named Glypre, for lysine glycation. Firstly, we used multiple informative features to encode the peptides. These features included the position scoring function, secondary structure, AAindex, and the composition of k-spaced amino acid pairs. Secondly, the distribution of distinctive features of the residues surrounding the glycation and non-glycation sites was statistically analysed. Thirdly, based on the distribution of these features, we developed a new predictor by using different optimal window sizes for different properties and a two-step feature selection method, which utilized the maximum relevance minimum redundancy method followed by a greedy feature selection procedure. The performance of Glypre was measured with a sensitivity of 57.47%, a specificity of 90.78%, an accuracy of 79.68%, area under the receiver-operating characteristic (ROC) curve (AUC) of 0.86, and a Matthews's correlation coefficient (MCC) of 0.52 by 10-fold cross-validation. The detailed analysis results showed that our predictor may play a complementary role to other existing methods for identifying protein lysine glycation. The source code and datasets of the Glypre are available in the Supplementary File.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Simulación por Computador / Proteínas / Máquina de Vectores de Soporte / Aminoácidos Tipo de estudio: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Molecules Asunto de la revista: BIOLOGIA Año: 2017 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Simulación por Computador / Proteínas / Máquina de Vectores de Soporte / Aminoácidos Tipo de estudio: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Molecules Asunto de la revista: BIOLOGIA Año: 2017 Tipo del documento: Article País de afiliación: China
...