Your browser doesn't support javascript.
loading
Prediction of serine phosphorylation sites mapping on Schizosaccharomyces Pombe by fusing three encoding schemes with the random forest classifier.
Tasmia, Samme Amena; Kibria, Md Kaderi; Tuly, Khanis Farhana; Islam, Md Ariful; Khatun, Mst Shamima; Hasan, Md Mehedi; Mollah, Md Nurul Haque.
Afiliação
  • Tasmia SA; Bioinformatics Laboratory, Department of Statistics, University of Rajshahi, Rajshahi, 6205, Bangladesh.
  • Kibria MK; Bioinformatics Laboratory, Department of Statistics, University of Rajshahi, Rajshahi, 6205, Bangladesh.
  • Tuly KF; Bioinformatics Laboratory, Department of Statistics, University of Rajshahi, Rajshahi, 6205, Bangladesh.
  • Islam MA; Bioinformatics Laboratory, Department of Statistics, University of Rajshahi, Rajshahi, 6205, Bangladesh.
  • Khatun MS; Department of Microbiology and Immunology, Tulane University School of Medicine, Tulane University, New Orleans, LA, 70112, USA.
  • Hasan MM; Tulane Center for Biomedical Informatics and Genomics, Division of Biomedical Informatics and Genomics, John W. Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, 70112, USA.
  • Mollah MNH; Bioinformatics Laboratory, Department of Statistics, University of Rajshahi, Rajshahi, 6205, Bangladesh. mollah.stat.bio@ru.ac.bd.
Sci Rep ; 12(1): 2632, 2022 02 16.
Article em En | MEDLINE | ID: mdl-35173235
Serine phosphorylation is one type of protein post-translational modifications (PTMs), which plays an essential role in various cellular processes and disease pathogenesis. Numerous methods are used for the prediction of phosphorylation sites. However, the traditional wet-lab based experimental approaches are time-consuming, laborious, and expensive. In this work, a computational predictor was proposed to predict serine phosphorylation sites mapping on Schizosaccharomyces pombe (SP) by the fusion of three encoding schemes namely k-spaced amino acid pair composition (CKSAAP), binary and amino acid composition (AAC) with the random forest (RF) classifier. So far, the proposed method is firstly developed to predict serine phosphorylation sites for SP. Both the training and independent test performance scores were used to investigate the success of the proposed RF based fusion prediction model compared to others. We also investigated their performances by 5-fold cross-validation (CV). In all cases, it was observed that the recommended predictor achieves the largest scores of true positive rate (TPR), true negative rate (TNR), accuracy (ACC), Mathew coefficient of correlation (MCC), Area under the ROC curve (AUC) and pAUC (partial AUC) at false positive rate (FPR) = 0.20. Thus, the prediction performance as discussed in this paper indicates that the proposed approach may be a beneficial and motivating computational resource for predicting serine phosphorylation sites in the case of Fungi. The online interface of the software for the proposed prediction model is publicly available at http://mollah-bioinformaticslab-stat.ru.ac.bd/PredSPS/ .
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Schizosaccharomyces / Serina / Processamento de Proteína Pós-Traducional / Biologia Computacional Tipo de estudo: Clinical_trials / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Sci Rep Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Bangladesh

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Schizosaccharomyces / Serina / Processamento de Proteína Pós-Traducional / Biologia Computacional Tipo de estudo: Clinical_trials / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Sci Rep Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Bangladesh