Your browser doesn't support javascript.
loading
A representation transfer learning approach for enhanced prediction of growth hormone binding proteins.
Yadav, Amisha; Sahu, Roopshikha; Nath, Abhigyan.
Afiliação
  • Yadav A; Department of Biochemistry, Pt. Jawahar Lal Nehru Memorial Medical College, Raipur 492001, India.
  • Sahu R; Department of Biochemistry, Pt. Jawahar Lal Nehru Memorial Medical College, Raipur 492001, India.
  • Nath A; Department of Biochemistry, Pt. Jawahar Lal Nehru Memorial Medical College, Raipur 492001, India. Electronic address: abhigyannath01@gmail.com.
Comput Biol Chem ; 87: 107274, 2020 May 05.
Article em En | MEDLINE | ID: mdl-32416563
Growth hormone binding proteins (GHBPs) are soluble proteins that play an important role in the modulation of signaling pathways pertaining to growth hormones. GHBPs are selective and bind non-covalently with growth hormones, but their functions are still not fully understood. Identification and characterization of GHBPs are the preliminary steps for understanding their roles in various cellular processes. As wet lab based experimental methods involve high cost and labor, computational methods can facilitate in narrowing down the search space of putative GHBPs. Performance of machine learning algorithms largely depends on the quality of features that it feeds on. Informative and non-redundant features generally result in enhanced performance and for this purpose feature selection algorithms are commonly used. In the present work, a novel representation transfer learning approach is presented for prediction of GHBPs. For their accurate prediction, deep autoencoder based features were extracted and subsequently SMO-PolyK classifier is trained. The prediction model is evaluated by both leave one out cross validation (LOOCV) and hold out independent testing set. On LOOCV, the prediction model achieved 89.8%% accuracy, with 89.4% sensitivity and 90.2% specificity and accuracy of 93.5%, sensitivity of 90.2% and specificity of 96.8% is attained on the hold out testing set. Further a comparison was made between the full set of sequence-based features, top performing sequence features extracted using feature selection algorithm, deep autoencoder based features and generalized low rank model based features on the prediction accuracy. Principal component analysis of the representative features along with t-sne visualization demonstrated the effectiveness of deep features in prediction of GHBPs. The present method is robust and accurate and may complement other wet lab based methods for identification of novel GHBPs.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Comput Biol Chem Assunto da revista: BIOLOGIA / INFORMATICA MEDICA / QUIMICA Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Índia

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Comput Biol Chem Assunto da revista: BIOLOGIA / INFORMATICA MEDICA / QUIMICA Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Índia