NMPFamsDB: a database of novel protein families from microbial metagenomes and metatranscriptomes.
Nucleic Acids Res
; 52(D1): D502-D512, 2024 Jan 05.
Article
en En
| MEDLINE
| ID: mdl-37811892
ABSTRACT
The Novel Metagenome Protein Families Database (NMPFamsDB) is a database of metagenome- and metatranscriptome-derived protein families, whose members have no hits to proteins of reference genomes or Pfam domains. Each protein family is accompanied by multiple sequence alignments, Hidden Markov Models, taxonomic information, ecosystem and geolocation metadata, sequence and structure predictions, as well as 3D structure models predicted with AlphaFold2. In its current version, NMPFamsDB hosts over 100 000 protein families, each with at least 100 members. The reported protein families significantly expand (more than double) the number of known protein sequence clusters from reference genomes and reveal new insights into their habitat distribution, origins, functions and taxonomy. We expect NMPFamsDB to be a valuable resource for microbial proteome-wide analyses and for further discovery and characterization of novel functions. NMPFamsDB is publicly available in http//www.nmpfamsdb.org/ or https//bib.fleming.gr/NMPFamsDB.
Texto completo:
1
Banco de datos:
MEDLINE
Asunto principal:
Proteínas
/
Bases de Datos de Proteínas
/
Metagenoma
Tipo de estudio:
Prognostic_studies
Idioma:
En
Año:
2024
Tipo del documento:
Article