Your browser doesn't support javascript.
loading
DeepGraphGO: graph neural network for large-scale, multispecies protein function prediction.
You, Ronghui; Yao, Shuwei; Mamitsuka, Hiroshi; Zhu, Shanfeng.
Afiliación
  • You R; School of Computer Science, Fudan University, Shanghai 200433, China.
  • Yao S; School of Computer Science, Fudan University, Shanghai 200433, China.
  • Mamitsuka H; Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto Prefecture 611-0011, Japan.
  • Zhu S; Department of Computer Science, Aalto University, Espoo, Finland.
Bioinformatics ; 37(Suppl_1): i262-i271, 2021 07 12.
Article en En | MEDLINE | ID: mdl-34252926
ABSTRACT
MOTIVATION Automated function prediction (AFP) of proteins is a large-scale multi-label classification problem. Two limitations of most network-based methods for AFP are (i) a single model must be trained for each species and (ii) protein sequence information is totally ignored. These limitations cause weaker performance than sequence-based methods. Thus, the challenge is how to develop a powerful network-based method for AFP to overcome these limitations.

RESULTS:

We propose DeepGraphGO, an end-to-end, multispecies graph neural network-based method for AFP, which makes the most of both protein sequence and high-order protein network information. Our multispecies strategy allows one single model to be trained for all species, indicating a larger number of training samples than existing methods. Extensive experiments with a large-scale dataset show that DeepGraphGO outperforms a number of competing state-of-the-art methods significantly, including DeepGOPlus and three representative network-based

methods:

GeneMANIA, deepNF and clusDCA. We further confirm the effectiveness of our multispecies strategy and the advantage of DeepGraphGO over so-called difficult proteins. Finally, we integrate DeepGraphGO into the state-of-the-art ensemble method, NetGO, as a component and achieve a further performance improvement. AVAILABILITY AND IMPLEMENTATION https//github.com/yourh/DeepGraphGO. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Proteínas / Redes Neurales de la Computación Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2021 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Proteínas / Redes Neurales de la Computación Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2021 Tipo del documento: Article País de afiliación: China