Your browser doesn't support javascript.
loading
PPSNO: A Feature-Rich SNO Sites Predictor by Stacking Ensemble Strategy from Protein Sequence-Derived Information.
Zhu, Lun; Wang, Liuyang; Yang, Zexi; Xu, Piao; Yang, Sen.
Affiliation
  • Zhu L; School of Computer Science and Artificial Intelligence Aliyun School of Big Data School of Software, Changzhou University, Changzhou, 213164, China.
  • Wang L; School of Computer Science and Artificial Intelligence Aliyun School of Big Data School of Software, Changzhou University, Changzhou, 213164, China.
  • Yang Z; School of Computer Science and Artificial Intelligence Aliyun School of Big Data School of Software, Changzhou University, Changzhou, 213164, China.
  • Xu P; College of Economics and Management, Nanjing Forestry University, Nanjing, 210037, China.
  • Yang S; School of Computer Science and Artificial Intelligence Aliyun School of Big Data School of Software, Changzhou University, Changzhou, 213164, China. ys@cczu.edu.cn.
Interdiscip Sci ; 16(1): 192-217, 2024 Mar.
Article in En | MEDLINE | ID: mdl-38206557
ABSTRACT
The protein S-nitrosylation (SNO) is a significant post-translational modification that affects the stability, activity, cellular localization, and function of proteins. Therefore, highly accurate prediction of SNO sites aids in grasping biological function mechanisms. In this document, we have constructed a predictor, named PPSNO, forecasting protein SNO sites using stacked integrated learning. PPSNO integrates multiple machine learning techniques into an ensemble model, enhancing its predictive accuracy. First, we established benchmark datasets by collecting SNO sites from various sources, including literature, databases, and other predictors. Second, various techniques for feature extraction are applied to derive characteristics from protein sequences, which are subsequently amalgamated into the PPSNO predictor for training. Five-fold cross-validation experiments show that PPSNO outperformed existing predictors, such as PSNO, PreSNO, pCysMod, DeepNitro, RecSNO, and Mul-SNO. The PPSNO predictor achieved an impressive accuracy of 92.8%, an area under the curve (AUC) of 96.1%, a Matthews correlation coefficient (MCC) of 81.3%, an F1-score of 85.6%, an SN of 79.3%, an SP of 97.7%, and an average precision (AP) of 92.2%. We also employed ROC curves, PR curves, and radar plots to show the superior performance of PPSNO. Our study shows that fused protein sequence features and two-layer stacked ensemble models can improve the accuracy of predicting SNO sites, which can aid in comprehending cellular processes and disease mechanisms. The codes and data are available at https//github.com/serendipity-wly/PPSNO .
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Proteins / Machine Learning Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Interdiscip Sci Journal subject: BIOLOGIA Year: 2024 Document type: Article Affiliation country: China Country of publication: Germany

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Proteins / Machine Learning Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Interdiscip Sci Journal subject: BIOLOGIA Year: 2024 Document type: Article Affiliation country: China Country of publication: Germany