Your browser doesn't support javascript.
loading
AraPathogen2.0: An Improved Prediction of Plant-Pathogen Protein-Protein Interactions Empowered by the Natural Language Processing Technique.
Lei, Chenping; Zhou, Kewei; Zheng, Jingyan; Zhao, Miao; Huang, Yan; He, Huaqin; Yang, Shiping; Zhang, Ziding.
Afiliação
  • Lei C; State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China.
  • Zhou K; State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China.
  • Zheng J; State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China.
  • Zhao M; State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China.
  • Huang Y; State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China.
  • He H; College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou 350002, China.
  • Yang S; State Key Laboratory of Plant Environmental Resilience, College of Biological Sciences, China Agricultural University, Beijing 100193, China.
  • Zhang Z; State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China.
J Proteome Res ; 23(1): 494-499, 2024 01 05.
Article em En | MEDLINE | ID: mdl-38069805
Plant-pathogen protein-protein interactions (PPIs) play crucial roles in the arm race between plants and pathogens. Therefore, the identification of these interspecies PPIs is very important for the mechanistic understanding of pathogen infection and plant immunity. Computational prediction methods can complement experimental efforts, but their predictive performance still needs to be improved. Motivated by the rapid development of natural language processing and its successful applications in the field of protein bioinformatics, here we present an improved XGBoost-based plant-pathogen PPI predictor (i.e., AraPathogen2.0), in which sequence encodings from the pretrained protein language model ESM2 and Arabidopsis PPI network-related node representations from the graph embedding technique struc2vec are used as input. Stringent benchmark experiments showed that AraPathogen2.0 could achieve a better performance than its precedent version, especially for processing the test data set with novel proteins unseen in the training data.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Arabidopsis / Mapeamento de Interação de Proteínas Idioma: En Revista: J Proteome Res Assunto da revista: BIOQUIMICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Arabidopsis / Mapeamento de Interação de Proteínas Idioma: En Revista: J Proteome Res Assunto da revista: BIOQUIMICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China