Your browser doesn't support javascript.
loading
KbhbXG: A Machine learning architecture based on XGBoost for prediction of lysine ß-Hydroxybutyrylation (Kbhb) modification sites.
Chen, Leqi; Liu, Liwen; Su, Haiyan; Xu, Yan.
Afiliação
  • Chen L; Department of Statistics, University of Science and Technology Beijing, Beijing 100083, China.
  • Liu L; The Open University of China, Beijing 100039, China.
  • Su H; School of Computing, Montclair State University, NJ 07043, USA.
  • Xu Y; Department of Statistics, University of Science and Technology Beijing, Beijing 100083, China. Electronic address: xuyan@ustb.edu.cn.
Methods ; 227: 27-34, 2024 Jul.
Article em En | MEDLINE | ID: mdl-38679187
ABSTRACT
Lysine ß-hydroxybutyrylation is an important post-translational modification (PTM) involved in various physiological and biological processes. In this research, we introduce a novel predictor KbhbXG, which utilizes XGBoost to identify ß-hydroxybutyrylation modification sites based on protein sequence information. The traditional experimental methods employed for the identification of ß-hydroxybutyrylated sites using proteomic techniques are both costly and time-consuming. Thus, the development of computational methods and predictors can play a crucial role in facilitating the rapid identification of ß-hydroxybutyrylation sites. Our proposed KbhbXG model first utilizes machine learning algorithm XGBoost to predict ß-hydroxybutyrylation modification sites. On the independent test set, KbhbXG achieves an accuracy of 0.7457, specificity of 0.7771, and an impressive area under the curve (AUC) score of 0.8172. The high AUC score achieved by our method demonstrates its potential for effectively identifying novel ß-hydroxybutyrylation sites, thereby facilitating further research and exploration of the ß-hydroxybutyrylation process. Also, functional analyses have revealed that different organisms preferentially engage in distinct biological processes and pathways, which can provide valuable insights for understanding the mechanism of ß-hydroxybutyrylation and guide experimental verification. To promote transparency and reproducibility, we have made both the codes and dataset of KbhbXG publicly available. Researchers interested in utilizing our proposed model can access these resources at https//github.com/Lab-Xu/KbhbXG.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Processamento de Proteína Pós-Traducional / Aprendizado de Máquina / Lisina Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Processamento de Proteína Pós-Traducional / Aprendizado de Máquina / Lisina Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article