Your browser doesn't support javascript.
loading
HNetGO: protein function prediction via heterogeneous network transformer.
Zhang, Xiaoshuai; Guo, Huannan; Zhang, Fan; Wang, Xuan; Wu, Kaitao; Qiu, Shizheng; Liu, Bo; Wang, Yadong; Hu, Yang; Li, Junyi.
Afiliação
  • Zhang X; School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, Guangdong 518055, China.
  • Guo H; General Hospital of Heilongjiang Province Land Reclamation Bureau, Harbin 150086, China.
  • Zhang F; Center NHC Key Laboratory of Cell Transplantation, The First Affiliated Hospital of Harbin Medical University, Harbin 150086, China.
  • Wang X; School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, Guangdong 518055, China.
  • Wu K; School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, Guangdong 518055, China.
  • Qiu S; School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China.
  • Liu B; School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China.
  • Wang Y; School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, Guangdong 518055, China.
  • Hu Y; School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China.
  • Li J; School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China.
Brief Bioinform ; 24(6)2023 09 22.
Article em En | MEDLINE | ID: mdl-37861172
ABSTRACT
Protein function annotation is one of the most important research topics for revealing the essence of life at molecular level in the post-genome era. Current research shows that integrating multisource data can effectively improve the performance of protein function prediction models. However, the heavy reliance on complex feature engineering and model integration methods limits the development of existing methods. Besides, models based on deep learning only use labeled data in a certain dataset to extract sequence features, thus ignoring a large amount of existing unlabeled sequence data. Here, we propose an end-to-end protein function annotation model named HNetGO, which innovatively uses heterogeneous network to integrate protein sequence similarity and protein-protein interaction network information and combines the pretraining model to extract the semantic features of the protein sequence. In addition, we design an attention-based graph neural network model, which can effectively extract node-level features from heterogeneous networks and predict protein function by measuring the similarity between protein nodes and gene ontology term nodes. Comparative experiments on the human dataset show that HNetGO achieves state-of-the-art performance on cellular component and molecular function branches.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / Mapas de Interação de Proteínas Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / Mapas de Interação de Proteínas Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article