PathFams: statistical detection of pathogen-associated protein domains.
BMC Genomics
; 22(1): 663, 2021 Sep 14.
Article
em En
| MEDLINE
| ID: mdl-34521345
ABSTRACT
BACKGROUND:
A substantial fraction of genes identified within bacterial genomes encode proteins of unknown function. Identifying which of these proteins represent potential virulence factors, and mapping their key virulence determinants, is a challenging but important goal.RESULTS:
To facilitate virulence factor discovery, we performed a comprehensive analysis of 17,929 protein domain families within the Pfam database, and scored them based on their overrepresentation in pathogenic versus non-pathogenic species, taxonomic distribution, relative abundance in metagenomic datasets, and other factors.CONCLUSIONS:
We identify pathogen-associated domain families, candidate virulence factors in the human gut, and eukaryotic-like mimicry domains with likely roles in virulence. Furthermore, we provide an interactive database called PathFams to allow users to explore pathogen-associated domains as well as identify pathogen-associated domains and domain architectures in user-uploaded sequences of interest. PathFams is freely available at https//pathfams.uwaterloo.ca .Palavras-chave
Texto completo:
1
Base de dados:
MEDLINE
Assunto principal:
Fatores de Virulência
/
Metagenômica
Tipo de estudo:
Diagnostic_studies
/
Prognostic_studies
/
Risk_factors_studies
Limite:
Humans
Idioma:
En
Ano de publicação:
2021
Tipo de documento:
Article