AnnoPRO: a strategy for protein function annotation based on multi-scale protein representation and a hybrid deep learning of dual-path encoding.
Genome Biol
; 25(1): 41, 2024 02 01.
Article
en En
| MEDLINE
| ID: mdl-38303023
ABSTRACT
Protein function annotation has been one of the longstanding issues in biological sciences, and various computational methods have been developed. However, the existing methods suffer from a serious long-tail problem, with a large number of GO families containing few annotated proteins. Herein, an innovative strategy named AnnoPRO was therefore constructed by enabling sequence-based multi-scale protein representation, dual-path protein encoding using pre-training, and function annotation by long short-term memory-based decoding. A variety of case studies based on different benchmarks were conducted, which confirmed the superior performance of AnnoPRO among available methods. Source code and models have been made freely available at https//github.com/idrblab/AnnoPRO and https//zenodo.org/records/10012272.
Palabras clave
Texto completo:
1
Colección:
01-internacional
Banco de datos:
MEDLINE
Asunto principal:
Aprendizaje Profundo
Límite:
Humans
Idioma:
En
Revista:
Genome Biol
Asunto de la revista:
BIOLOGIA MOLECULAR
/
GENETICA
Año:
2024
Tipo del documento:
Article
País de afiliación:
China