Your browser doesn't support javascript.
loading
The M5nr: a novel non-redundant database containing protein sequences and annotations from multiple sources and associated tools.
Wilke, Andreas; Harrison, Travis; Wilkening, Jared; Field, Dawn; Glass, Elizabeth M; Kyrpides, Nikos; Mavrommatis, Konstantinos; Meyer, Folker.
Afiliação
  • Wilke A; Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA.
BMC Bioinformatics ; 13: 141, 2012 Jun 21.
Article em En | MEDLINE | ID: mdl-22720753
ABSTRACT

BACKGROUND:

Computing of sequence similarity results is becoming a limiting factor in metagenome analysis. Sequence similarity search results encoded in an open, exchangeable format have the potential to limit the needs for computational reanalysis of these data sets. A prerequisite for sharing of similarity results is a common reference. DESCRIPTION We introduce a mechanism for automatically maintaining a comprehensive, non-redundant protein database and for creating a quarterly release of this resource. In addition, we present tools for translating similarity searches into many annotation namespaces, e.g. KEGG or NCBI's GenBank.

CONCLUSIONS:

The data and tools we present allow the creation of multiple result sets using a single computation, permitting computational results to be shared between groups for large sequence data sets.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Bases de Dados de Proteínas Tipo de estudo: Risk_factors_studies Idioma: En Ano de publicação: 2012 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Bases de Dados de Proteínas Tipo de estudo: Risk_factors_studies Idioma: En Ano de publicação: 2012 Tipo de documento: Article