Your browser doesn't support javascript.
loading
Prediction of sumoylation sites in proteins using linear discriminant analysis.
Xu, Yan; Ding, Ya-Xin; Deng, Nai-Yang; Liu, Li-Ming.
Afiliación
  • Xu Y; Department of Information and Computer Science, University of Science and Technology Beijing, Beijing 100083, China.
  • Ding YX; Department of Information and Computer Science, University of Science and Technology Beijing, Beijing 100083, China.
  • Deng NY; College of Science, China Agricultural University, Beijing 100083, China.
  • Liu LM; School of Statistics, Capital University of Economics and Business, Beijing, 100070, China. Electronic address: llm5609@163.com.
Gene ; 576(1 Pt 1): 99-104, 2016 Jan 15.
Article en En | MEDLINE | ID: mdl-26432000
ABSTRACT
Sumoylation is a multifunctional post-translation modification (PTM) in proteins by the small ubiquitin-related modifiers (SUMOs), which have relations to ubiquitin in molecular structure. Sumoylation has been found to be involved in some cellular processes. It is very significant to identify the exact sumoylation sites in proteins for not only basic researches but also drug developments. Comparing with time exhausting experiment methods, it is highly desired to develop computational methods for prediction of sumoylation sites as a complement to experiment in the post-genomic age. In this work, three feature constructions (AAIndex, position-specific amino acid propensity and modification of composition of k-space amino acid pairs) and five different combinations of them were used to construct features. At last, 178 features were selected as the optimal features according to the Mathew's correlation coefficient values in 10-fold cross validation based on linear discriminant analysis. In 10-fold cross-validation on the benchmark dataset, the accuracy and Mathew's correlation coefficient were 86.92% and 0.6845. Comparing with those existing predictors, SUMO_LDA showed its better performance.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Proteínas / Análisis de Secuencia de Proteína / Bases de Datos de Proteínas / Sumoilación Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Gene Año: 2016 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Proteínas / Análisis de Secuencia de Proteína / Bases de Datos de Proteínas / Sumoilación Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Gene Año: 2016 Tipo del documento: Article País de afiliación: China