NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy.

Pruitt, Kim D; Tatusova, Tatiana; Brown, Garth R; Maglott, Donna R

Pruitt, Kim D; Tatusova, Tatiana; Brown, Garth R; Maglott, Donna R.

Afiliação

Pruitt KD; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA. pruitt@ncbi.nlm.nih.gov

Nucleic Acids Res ; 40(Database issue): D130-5, 2012 Jan.

Article em En | MEDLINE | ID: mdl-22121212

ABSTRACT

ABSTRACT

The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database is a collection of genomic, transcript and protein sequence records. These records are selected and curated from public sequence archives and represent a significant reduction in redundancy compared to the volume of data archived by the International Nucleotide Sequence Database Collaboration. The database includes over 16,00 organisms, 2.4 × 0(6) genomic records, 13 × 10(6) proteins and 2 × 10(6) RNA records spanning prokaryotes, eukaryotes and viruses (RefSeq release 49, September 2011). The RefSeq database is maintained by a combined approach of automated analyses, collaboration and manual curation to generate an up-to-date representation of the sequence, its features, names and cross-links to related sources of information. We report here on recent growth, the status of curating the human RefSeq data set, more extensive feature annotation and current policy for eukaryotic genome annotation via the NCBI annotation pipeline. More information about the resource is available online (see http//www.ncbi.nlm.nih.gov/RefSeq/).

Assuntos

Bases de Dados Genéticas; Anotação de Sequência Molecular; Análise de Sequência/normas; Genômica/normas; Humanos; Padrões de Referência; Análise de Sequência de DNA/normas; Análise de Sequência de Proteína/normas; Análise de Sequência de RNA/normas

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Análise de Sequência / Bases de Dados Genéticas / Anotação de Sequência Molecular Limite: Humans Idioma: En Revista: Nucleic Acids Res Ano de publicação: 2012 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google