Your browser doesn't support javascript.
loading
A Transfer-Learning-Based Deep Convolutional Neural Network for Predicting Leukemia-Related Phosphorylation Sites from Protein Primary Sequences.
He, Jian; Wu, Yanling; Pu, Xuemei; Li, Menglong; Guo, Yanzhi.
Afiliación
  • He J; College of Chemistry, Sichuan University, Chengdu 610064, China.
  • Wu Y; College of Chemistry, Sichuan University, Chengdu 610064, China.
  • Pu X; College of Chemistry, Sichuan University, Chengdu 610064, China.
  • Li M; College of Chemistry, Sichuan University, Chengdu 610064, China.
  • Guo Y; College of Chemistry, Sichuan University, Chengdu 610064, China.
Int J Mol Sci ; 23(3)2022 Feb 03.
Article en En | MEDLINE | ID: mdl-35163663
ABSTRACT
As one of the most important post-translational modifications (PTMs), phosphorylation refers to the binding of a phosphate group with amino acid residues like Ser (S), Thr (T) and Tyr (Y) thus resulting in diverse functions at the molecular level. Abnormal phosphorylation has been proved to be closely related with human diseases. To our knowledge, no research has been reported describing specific disease-associated phosphorylation sites prediction which is of great significance for comprehensive understanding of disease mechanism. In this work, focusing on three types of leukemia, we aim to develop a reliable leukemia-related phosphorylation site prediction models by combing deep convolutional neural network (CNN) with transfer-learning. CNN could automatically discover complex representations of phosphorylation patterns from the raw sequences, and hence it provides a powerful tool for improvement of leukemia-related phosphorylation site prediction. With the largest dataset of myelogenous leukemia, the optimal models for S/T/Y phosphorylation sites give the AUC values of 0.8784, 0.8328 and 0.7716 respectively. When transferred learning on the small size datasets, the models for T-cell and lymphoid leukemia also give the promising performance by common sharing the optimal parameters. Compared with other five machine-learning methods, our CNN models reveal the superior performance. Finally, the leukemia-related pathogenesis analysis and distribution analysis on phosphorylated proteins along with K-means clustering analysis and position-specific conversation profiles on the phosphorylation site all indicate the strong practical feasibility of our easy-to-use CNN models.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Leucemia / Redes Neurales de la Computación / Aprendizaje Profundo Tipo de estudio: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: Int J Mol Sci Año: 2022 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Leucemia / Redes Neurales de la Computación / Aprendizaje Profundo Tipo de estudio: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: Int J Mol Sci Año: 2022 Tipo del documento: Article País de afiliación: China