GPSFun: geometry-aware protein sequence function predictions with language models.
Nucleic Acids Res
; 52(W1): W248-W255, 2024 Jul 05.
Article
em En
| MEDLINE
| ID: mdl-38738636
ABSTRACT
Knowledge of protein function is essential for elucidating disease mechanisms and discovering new drug targets. However, there is a widening gap between the exponential growth of protein sequences and their limited function annotations. In our prior studies, we have developed a series of methods including GraphPPIS, GraphSite, LMetalSite and SPROF-GO for protein function annotations at residue or protein level. To further enhance their applicability and performance, we now present GPSFun, a versatile web server for Geometry-aware Protein Sequence Function annotations, which equips our previous tools with language models and geometric deep learning. Specifically, GPSFun employs large language models to efficiently predict 3D conformations of the input protein sequences and extract informative sequence embeddings. Subsequently, geometric graph neural networks are utilized to capture the sequence and structure patterns in the protein graphs, facilitating various downstream predictions including protein-ligand binding sites, gene ontologies, subcellular locations and protein solubility. Notably, GPSFun achieves superior performance to state-of-the-art methods across diverse tasks without requiring multiple sequence alignments or experimental protein structures. GPSFun is freely available to all users at https//bio-web1.nscc-gz.cn/app/GPSFun with user-friendly interfaces and rich visualizations.
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Software
/
Proteínas
Limite:
Humans
Idioma:
En
Revista:
Nucleic Acids Res
Ano de publicação:
2024
Tipo de documento:
Article
País de afiliação:
China