Your browser doesn't support javascript.
loading
GPSFun: geometry-aware protein sequence function predictions with language models.
Yuan, Qianmu; Tian, Chong; Song, Yidong; Ou, Peihua; Zhu, Mingming; Zhao, Huiying; Yang, Yuedong.
  • Yuan Q; School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, Guangdong 510000, China.
  • Tian C; School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, Guangdong 510000, China.
  • Song Y; School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, Guangdong 510000, China.
  • Ou P; School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, Guangdong 510000, China.
  • Zhu M; School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, Guangdong 510000, China.
  • Zhao H; Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, Guangdong 510000, China.
  • Yang Y; School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, Guangdong 510000, China.
Nucleic Acids Res ; 52(W1): W248-W255, 2024 Jul 05.
Article en En | MEDLINE | ID: mdl-38738636
ABSTRACT
Knowledge of protein function is essential for elucidating disease mechanisms and discovering new drug targets. However, there is a widening gap between the exponential growth of protein sequences and their limited function annotations. In our prior studies, we have developed a series of methods including GraphPPIS, GraphSite, LMetalSite and SPROF-GO for protein function annotations at residue or protein level. To further enhance their applicability and performance, we now present GPSFun, a versatile web server for Geometry-aware Protein Sequence Function annotations, which equips our previous tools with language models and geometric deep learning. Specifically, GPSFun employs large language models to efficiently predict 3D conformations of the input protein sequences and extract informative sequence embeddings. Subsequently, geometric graph neural networks are utilized to capture the sequence and structure patterns in the protein graphs, facilitating various downstream predictions including protein-ligand binding sites, gene ontologies, subcellular locations and protein solubility. Notably, GPSFun achieves superior performance to state-of-the-art methods across diverse tasks without requiring multiple sequence alignments or experimental protein structures. GPSFun is freely available to all users at https//bio-web1.nscc-gz.cn/app/GPSFun with user-friendly interfaces and rich visualizations.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Programas Informáticos / Proteínas Límite: Humans Idioma: En Año: 2024 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Programas Informáticos / Proteínas Límite: Humans Idioma: En Año: 2024 Tipo del documento: Article