Pesquisa | BVS Integralidade em Saúde

1.

GenBank 2024 Update.

Sayers, Eric W; Cavanaugh, Mark; Clark, Karen; Pruitt, Kim D; Sherry, Stephen T; Yankie, Linda; Karsch-Mizrachi, Ilene.

Nucleic Acids Res ; 52(D1): D134-D137, 2024 Jan 05.

Artigo em Inglês | MEDLINE | ID: mdl-37889039

RESUMO

GenBank® (https://www.ncbi.nlm.nih.gov/genbank/) is a comprehensive, public database that contains 25 trillion base pairs from over 3.7 billion nucleotide sequences for 557 000 formally described species. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. Recent updates include policies for including spatio-temporal metadata, clarified documentation for GenBank data processing, enhanced foreign contamination screening tools, new processes in the Submission Portal, migration of Entrez Genome and Assembly displays into NCBI Datasets, and the impending retirement of tbl2asn, replaced by table2asn.

Assuntos

Bases de Dados de Ácidos Nucleicos , Genômica , Sequência de Bases , Internet , Humanos

2.

GenBank 2023 update.

Sayers, Eric W; Cavanaugh, Mark; Clark, Karen; Pruitt, Kim D; Sherry, Stephen T; Yankie, Linda; Karsch-Mizrachi, Ilene.

Nucleic Acids Res ; 51(D1): D141-D144, 2023 01 06.

Artigo em Inglês | MEDLINE | ID: mdl-36350640

RESUMO

GenBank® (https://www.ncbi.nlm.nih.gov/genbank/) is a comprehensive, public database that contains 19.6 trillion base pairs from over 2.9 billion nucleotide sequences for 504 000 formally described species. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. Recent updates include resources for data from the SARS-CoV-2 virus, NCBI Datasets, BLAST ClusteredNR, the Submission Portal, table2asn, a Foreign Contamination Screening tool and BioSample.

Assuntos

Bases de Dados de Ácidos Nucleicos , Humanos , COVID-19/genética , Genômica , SARS-CoV-2/genética

3.

GenBank.

Sayers, Eric W; Cavanaugh, Mark; Clark, Karen; Pruitt, Kim D; Schoch, Conrad L; Sherry, Stephen T; Karsch-Mizrachi, Ilene.

Nucleic Acids Res ; 50(D1): D161-D164, 2022 01 07.

Artigo em Inglês | MEDLINE | ID: mdl-34850943

RESUMO

GenBank® (https://www.ncbi.nlm.nih.gov/genbank/) is a comprehensive, public database that contains 15.3 trillion base pairs from over 2.5 billion nucleotide sequences for 504 000 formally described species. Recent updates include resources for data from the SARS-CoV-2 virus, including a SARS-CoV-2 landing page, NCBI Datasets, NCBI Virus and the Submission Portal. We also discuss upcoming changes to GI identifiers, a new data management interface for BioProject, and advice for providing contextual metadata in submissions.

Assuntos

Bases de Dados de Ácidos Nucleicos , Vírus/genética , Genoma Viral , National Library of Medicine (U.S.) , SARS-CoV-2/genética , Estados Unidos , Interface Usuário-Computador

4.

The international nucleotide sequence database collaboration.

Arita, Masanori; Karsch-Mizrachi, Ilene; Cochrane, Guy.

Nucleic Acids Res ; 49(D1): D121-D124, 2021 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-33166387

RESUMO

The International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org/) has been the core infrastructure for collecting and providing nucleotide sequence data and metadata for >30 years. Three partner organizations, the DNA Data Bank of Japan (DDBJ) at the National Institute of Genetics in Mishima, Japan; the European Nucleotide Archive (ENA) at the European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI) in Hinxton, UK; and GenBank at National Center for Biotechnology Information (NCBI), National Library of Medicine, National Institutes of Health in Bethesda, Maryland, USA have been collaboratively maintaining the INSDC for the benefit of not only science but all types of community worldwide.

Assuntos

Bases de Dados de Ácidos Nucleicos , Metadados/estatística & dados numéricos , Nucleotídeos/genética , Análise de Sequência de DNA/estatística & dados numéricos , Análise de Sequência de RNA/estatística & dados numéricos , Academias e Institutos , Sequência de Bases , Europa (Continente) , Sequenciamento de Nucleotídeos em Larga Escala/estatística & dados numéricos , Humanos , Cooperação Internacional , Japão , Nucleotídeos/metabolismo , Estados Unidos

5.

GenBank.

Sayers, Eric W; Cavanaugh, Mark; Clark, Karen; Pruitt, Kim D; Schoch, Conrad L; Sherry, Stephen T; Karsch-Mizrachi, Ilene.

Nucleic Acids Res ; 49(D1): D92-D96, 2021 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-33196830

RESUMO

GenBank® (https://www.ncbi.nlm.nih.gov/genbank/) is a comprehensive, public database that contains 9.9 trillion base pairs from over 2.1 billion nucleotide sequences for 478 000 formally described species. Daily data exchange with the European Nucleotide Archive and the DNA Data Bank of Japan ensures worldwide coverage. Recent updates include new resources for data from the SARS-CoV-2 virus, updates to the NCBI Submission Portal and associated submission wizards for dengue and SARS-CoV-2 viruses, new taxonomy queries for viruses and prokaryotes, and simplified submission processes for EST and GSS sequences.

Assuntos

Biologia Computacional/estatística & dados numéricos , Bases de Dados de Ácidos Nucleicos , Genômica/métodos , SARS-CoV-2/genética , Análise de Sequência de DNA/métodos , Animais , COVID-19/epidemiologia , COVID-19/virologia , Biologia Computacional/métodos , Humanos , Armazenamento e Recuperação da Informação/métodos , Internet , Anotação de Sequência Molecular/métodos , Pandemias

6.

GenBank.

Sayers, Eric W; Cavanaugh, Mark; Clark, Karen; Ostell, James; Pruitt, Kim D; Karsch-Mizrachi, Ilene.

Nucleic Acids Res ; 48(D1): D84-D86, 2020 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-31665464

RESUMO

GenBank® (www.ncbi.nlm.nih.gov/genbank/) is a comprehensive, public database that contains over 6.25 trillion base pairs from over 1.6 billion nucleotide sequences for 450 000 formally described species. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. Recent updates include a new version of Genome Workbench that supports GenBank submissions, new submission wizards for viral genomes, enhancements to BankIt and improved handling of taxonomy for sequences from pathogens.

Assuntos

Biologia Computacional/métodos , Bases de Dados de Ácidos Nucleicos , Genômica/métodos , Software , Anotação de Sequência Molecular , National Institutes of Health (U.S.) , Estados Unidos , Navegador

7.

Ribovore: ribosomal RNA sequence analysis for GenBank submissions and database curation.

Schäffer, Alejandro A; McVeigh, Richard; Robbertse, Barbara; Schoch, Conrad L; Johnston, Anjanette; Underwood, Beverly A; Karsch-Mizrachi, Ilene; Nawrocki, Eric P.

BMC Bioinformatics ; 22(1): 400, 2021 Aug 12.

Artigo em Inglês | MEDLINE | ID: mdl-34384346

RESUMO

BACKGROUND: The DNA sequences encoding ribosomal RNA genes (rRNAs) are commonly used as markers to identify species, including in metagenomics samples that may combine many organismal communities. The 16S small subunit ribosomal RNA (SSU rRNA) gene is typically used to identify bacterial and archaeal species. The nuclear 18S SSU rRNA gene, and 28S large subunit (LSU) rRNA gene have been used as DNA barcodes and for phylogenetic studies in different eukaryote taxonomic groups. Because of their popularity, the National Center for Biotechnology Information (NCBI) receives a disproportionate number of rRNA sequence submissions and BLAST queries. These sequences vary in quality, length, origin (nuclear, mitochondria, plastid), and organism source and can represent any region of the ribosomal cistron. RESULTS: To improve the timely verification of quality, origin and loci boundaries, we developed Ribovore, a software package for sequence analysis of rRNA sequences. The ribotyper and ribosensor programs are used to validate incoming sequences of bacterial and archaeal SSU rRNA. The ribodbmaker program is used to create high-quality datasets of rRNAs from different taxonomic groups. Key algorithmic steps include comparing candidate sequences against rRNA sequence profile hidden Markov models (HMMs) and covariance models of rRNA sequence and secondary-structure conservation, as well as other tests. Nine freely available blastn rRNA databases created and maintained with Ribovore are used for checking incoming GenBank submissions and used by the blastn browser interface at NCBI. Since 2018, Ribovore has been used to analyze more than 50 million prokaryotic SSU rRNA sequences submitted to GenBank, and to select at least 10,435 fungal rRNA RefSeq records from type material of 8350 taxa. CONCLUSION: Ribovore combines single-sequence and profile-based methods to improve GenBank processing and analysis of rRNA sequences. It is a standalone, portable, and extensible software package for the alignment, classification and validation of rRNA sequences. Researchers planning on submitting SSU rRNA sequences to GenBank are encouraged to download and use Ribovore to analyze their sequences prior to submission to determine which sequences are likely to be automatically accepted into GenBank.

Assuntos

Bases de Dados de Ácidos Nucleicos , RNA Ribossômico , DNA Ribossômico , Filogenia , RNA Ribossômico 16S/genética , RNA Ribossômico 18S/genética , Análise de Sequência de RNA

8.

GenBank.

Sayers, Eric W; Cavanaugh, Mark; Clark, Karen; Ostell, James; Pruitt, Kim D; Karsch-Mizrachi, Ilene.

Nucleic Acids Res ; 47(D1): D94-D99, 2019 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-30365038

RESUMO

GenBank® (www.ncbi.nlm.nih.gov/genbank/) is a comprehensive database that contains publicly available nucleotide sequences for 420 000 formally described species. Most GenBank submissions are made using BankIt, the NCBI Submission Portal, or the tool tbl2asn, and are obtained from individual laboratories and batch submissions from large-scale sequencing projects, including whole genome shotgun (WGS) and environmental sampling projects. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Nucleotide database, which links to related information such as taxonomy, genomes, protein sequences and structures, and biomedical journal literature in PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. Recent updates include an expansion of sequence identifier formats to accommodate expected database growth, submission wizards for ribosomal RNA, and the transfer of Expressed Sequence Tag (EST) and Genome Survey Sequence (GSS) data into the Nucleotide database.

Assuntos

Bases de Dados de Ácidos Nucleicos , Navegador , Biologia Computacional/métodos , Bases de Dados de Ácidos Nucleicos/tendências , Genômica/métodos , Humanos , Armazenamento e Recuperação da Informação , Design de Software

9.

VADR: validation and annotation of virus sequence submissions to GenBank.

Schäffer, Alejandro A; Hatcher, Eneida L; Yankie, Linda; Shonkwiler, Lara; Brister, J Rodney; Karsch-Mizrachi, Ilene; Nawrocki, Eric P.

BMC Bioinformatics ; 21(1): 211, 2020 May 24.

Artigo em Inglês | MEDLINE | ID: mdl-32448124

RESUMO

BACKGROUND: GenBank contains over 3 million viral sequences. The National Center for Biotechnology Information (NCBI) previously made available a tool for validating and annotating influenza virus sequences that is used to check submissions to GenBank. Before this project, there was no analogous tool in use for non-influenza viral sequence submissions. RESULTS: We developed a system called VADR (Viral Annotation DefineR) that validates and annotates viral sequences in GenBank submissions. The annotation system is based on the analysis of the input nucleotide sequence using models built from curated RefSeqs. Hidden Markov models are used to classify sequences by determining the RefSeq they are most similar to, and feature annotation from the RefSeq is mapped based on a nucleotide alignment of the full sequence to a covariance model. Predicted proteins encoded by the sequence are validated with nucleotide-to-protein alignments using BLAST. The system identifies 43 types of "alerts" that (unlike the previous BLAST-based system) provide deterministic and rigorous feedback to researchers who submit sequences with unexpected characteristics. VADR has been integrated into GenBank's submission processing pipeline allowing for viral submissions passing all tests to be accepted and annotated automatically, without the need for any human (GenBank indexer) intervention. Unlike the previous submission-checking system, VADR is freely available (https://github.com/nawrockie/vadr) for local installation and use. VADR has been used for Norovirus submissions since May 2018 and for Dengue virus submissions since January 2019. Since March 2020, VADR has also been used to check SARS-CoV-2 sequence submissions. Other viruses with high numbers of submissions will be added incrementally. CONCLUSION: VADR improves the speed with which non-flu virus submissions to GenBank can be checked and improves the content and quality of the GenBank annotations. The availability and portability of the software allow researchers to run the GenBank checks prior to submitting their viral sequences, and thereby gain confidence that their submissions will be accepted immediately without the need to correspond with GenBank staff. Reciprocally, the adoption of VADR frees GenBank staff to spend more time on services other than checking routine viral sequence submissions.

Assuntos

Betacoronavirus , Infecções por Coronavirus , Bases de Dados de Ácidos Nucleicos , Anotação de Sequência Molecular , Pandemias , Pneumonia Viral , Software , Betacoronavirus/genética , COVID-19 , Infecções por Coronavirus/genética , Vírus de DNA , Genômica , Humanos , Anotação de Sequência Molecular/normas , Pneumonia Viral/genética , SARS-CoV-2 , Vírus

10.

The international nucleotide sequence database collaboration.

Karsch-Mizrachi, Ilene; Takagi, Toshihisa; Cochrane, Guy.

Nucleic Acids Res ; 46(D1): D48-D51, 2018 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-29190397

RESUMO

For more than 30 years, the International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org/) has been committed to capturing, preserving and providing access to comprehensive public domain nucleotide sequence and associated metadata which enables discovery in biomedicine, biodiversity and biological sciences. Since 1987, the DNA Data Bank of Japan (DDBJ) at the National Institute for Genetics in Mishima, Japan; the European Nucleotide Archive (ENA) at the European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI) in Hinxton, UK; and GenBank at National Center for Biotechnology Information (NCBI), National Library of Medicine, National Institutes of Health in Bethesda, Maryland, USA have worked collaboratively to enable access to nucleotide sequence data in standardized formats for the worldwide scientific community. In this article, we reiterate the principles of the INSDC collaboration and briefly summarize the trends of the archival content.

Assuntos

Bases de Dados de Ácidos Nucleicos , Animais , Classificação , Biologia Computacional , Bases de Dados Factuais , Bases de Dados de Ácidos Nucleicos/tendências , Europa (Continente) , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Cooperação Internacional , Japão , National Library of Medicine (U.S.) , Estados Unidos

11.

GenBank.

Benson, Dennis A; Cavanaugh, Mark; Clark, Karen; Karsch-Mizrachi, Ilene; Ostell, James; Pruitt, Kim D; Sayers, Eric W.

Nucleic Acids Res ; 46(D1): D41-D47, 2018 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-29140468

RESUMO

GenBank® (www.ncbi.nlm.nih.gov/genbank/) is a comprehensive database that contains publicly available nucleotide sequences for 400 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole genome shotgun and environmental sampling projects. Most submissions are made using BankIt, the National Center for Biotechnology Information (NCBI) Submission Portal, or the tool tbl2asn. GenBank staff assign accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive and the DNA Data Bank of Japan ensures worldwide coverage. GenBank is accessible through the NCBI Nucleotide database, which links to related information such as taxonomy, genomes, protein sequences and structures, and biomedical journal literature in PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. Recent updates include changes to sequence identifiers, submission wizards for 16S and Influenza sequences, and an Identical Protein Groups resource.

Assuntos

Bases de Dados de Ácidos Nucleicos , Animais , Biologia Computacional , Bases de Dados de Ácidos Nucleicos/estatística & dados numéricos , Bases de Dados de Ácidos Nucleicos/tendências , Europa (Continente) , Genômica , Humanos , Disseminação de Informação , Armazenamento e Recuperação da Informação , Internet , Japão , National Library of Medicine (U.S.) , Orthomyxoviridae/genética , Proteômica , RNA Ribossômico/genética , Alinhamento de Sequência , Estados Unidos

12.

VecScreen_plus_taxonomy: imposing a tax(onomy) increase on vector contamination screening.

Schäffer, Alejandro A; Nawrocki, Eric P; Choi, Yoon; Kitts, Paul A; Karsch-Mizrachi, Ilene; McVeigh, Richard.

Bioinformatics ; 34(5): 755-759, 2018 03 01.

Artigo em Inglês | MEDLINE | ID: mdl-29069347

RESUMO

Motivation: Nucleic acid sequences in public databases should not contain vector contamination, but many sequences in GenBank do (or did) contain vectors. The National Center for Biotechnology Information uses the program VecScreen to screen submitted sequences for contamination. Additional tools are needed to distinguish true-positive (contamination) from false-positive (not contamination) VecScreen matches. Results: A principal reason for false-positive VecScreen matches is that the sequence and the matching vector subsequence originate from closely related or identical organisms (for example, both originate in Escherichia coli). We collected information on the taxonomy of sources of vector segments in the UniVec database used by VecScreen. We used that information in two overlapping software pipelines for retrospective analysis of contamination in GenBank and for prospective analysis of contamination in new sequence submissions. Using the retrospective pipeline, we identified and corrected over 8000 contaminated sequences in the nonredundant nucleotide database. The prospective analysis pipeline has been in production use since April 2017 to evaluate some new GenBank submissions. Availability and implementation: Data on the sources of UniVec entries were included in release 10.0 (ftp://ftp.ncbi.nih.gov/pub/UniVec/). The main software is freely available at https://github.com/aaschaffer/vecscreen_plus_taxonomy. Contact: aschaffe@helix.nih.gov. Supplementary information: Supplementary data are available at Bioinformatics online.

Assuntos

Bases de Dados de Ácidos Nucleicos/normas , Análise de Sequência de DNA/métodos , Software , Bactérias , Eucariotos

13.

GenBank.

Benson, Dennis A; Cavanaugh, Mark; Clark, Karen; Karsch-Mizrachi, Ilene; Lipman, David J; Ostell, James; Sayers, Eric W.

Nucleic Acids Res ; 45(D1): D37-D42, 2017 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-27899564

RESUMO

GenBank® (www.ncbi.nlm.nih.gov/genbank/) is a comprehensive database that contains publicly available nucleotide sequences for 370 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or the NCBI Submission Portal. GenBank staff assign accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Nucleotide database, which links to related information such as taxonomy, genomes, protein sequences and structures, and biomedical journal literature in PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. Recent updates include changes to policies regarding sequence identifiers, an improved 16S submission wizard, targeted loci studies, the ability to submit methylation and BioNano mapping files, and a database of anti-microbial resistance genes.

Assuntos

Bases de Dados de Ácidos Nucleicos , Análise de Sequência de DNA , Animais , Metilação de DNA , Genoma Bacteriano , Genômica , Humanos , RNA Ribossômico 16S/genética , beta-Lactamases/genética

14.

The International Nucleotide Sequence Database Collaboration.

Cochrane, Guy; Karsch-Mizrachi, Ilene; Takagi, Toshihisa.

Nucleic Acids Res ; 44(D1): D48-50, 2016 Jan 04.

Artigo em Inglês | MEDLINE | ID: mdl-26657633

RESUMO

The International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org) comprises three global partners committed to capturing, preserving and providing comprehensive public-domain nucleotide sequence information. The INSDC establishes standards, formats and protocols for data and metadata to make it easier for individuals and organisations to submit their nucleotide data reliably to public archives. This work enables the continuous, global exchange of information about living things. Here we present an update of the INSDC in 2015, including data growth and diversification, new standards and requirements by publishers for authors to submit their data to the public archives. The INSDC serves as a model for data sharing in the life sciences.

Assuntos

Bases de Dados de Ácidos Nucleicos , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA , Comportamento Cooperativo , Bases de Dados de Ácidos Nucleicos/normas , Políticas

15.

GenBank.

Clark, Karen; Karsch-Mizrachi, Ilene; Lipman, David J; Ostell, James; Sayers, Eric W.

Nucleic Acids Res ; 44(D1): D67-72, 2016 Jan 04.

Artigo em Inglês | MEDLINE | ID: mdl-26590407

RESUMO

GenBank(®) (www.ncbi.nlm.nih.gov/genbank/) is a comprehensive database that contains publicly available nucleotide sequences for over 340 000 formally described species. Recent developments include a new starting page for submitters, a shift toward using accession.version identifiers rather than GI numbers, a wizard for submitting 16S rRNA sequences, and an Identical Protein Report to address growing issues of data redundancy. GenBank organizes the sequence data received from individual laboratories and large-scale sequencing projects into 18 divisions, and GenBank staff assign unique accession.version identifiers upon data receipt. Most submitters use the web-based BankIt or standalone Sequin programs. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the nuccore, nucest, and nucgss databases of the Entrez retrieval system, which integrates these records with a variety of other data including taxonomy nodes, genomes, protein structures, and biomedical journal literature in PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP.

Assuntos

Bases de Dados de Ácidos Nucleicos , Análise de Sequência de DNA , Proteínas/genética , RNA Ribossômico 16S/genética

16.

GenBank.

Benson, Dennis A; Clark, Karen; Karsch-Mizrachi, Ilene; Lipman, David J; Ostell, James; Sayers, Eric W.

Nucleic Acids Res ; 43(Database issue): D30-5, 2015 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-25414350

RESUMO

GenBank(®) (http://www.ncbi.nlm.nih.gov/genbank/) is a comprehensive database that contains publicly available nucleotide sequences for over 300 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole-genome shotgun and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and GenBank staff assign accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive and the DNA Data Bank of Japan ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP.

Assuntos

Bases de Dados de Ácidos Nucleicos , Bactérias/classificação , Genômica , Internet , Análise de Sequência de DNA , Análise de Sequência de Proteína

17.

GenBank.

Benson, Dennis A; Clark, Karen; Karsch-Mizrachi, Ilene; Lipman, David J; Ostell, James; Sayers, Eric W.

Nucleic Acids Res ; 42(Database issue): D32-7, 2014 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-24217914

RESUMO

GenBank is a comprehensive database that contains publicly available nucleotide sequences for over 280,000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole-genome shotgun and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and GenBank staff assign accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive and the DNA Data Bank of Japan ensures worldwide coverage. GenBank is accessible through the National Center for Biotechnology Information (NCBI) Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. To access GenBank and its related retrieval and analysis services, begin at the NCBI home page: www.ncbi.nlm.nih.gov.

Assuntos

Bases de Dados de Ácidos Nucleicos , Análise de Sequência de DNA , Bactérias/classificação , Bactérias/genética , Sequenciamento de Nucleotídeos em Larga Escala , Internet , Anotação de Sequência Molecular

18.

The International Nucleotide Sequence Database Collaboration.

Nakamura, Yasukazu; Cochrane, Guy; Karsch-Mizrachi, Ilene.

Nucleic Acids Res ; 41(Database issue): D21-4, 2013 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-23180798

RESUMO

The International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org), one of the longest-standing global alliances of biological data archives, captures, preserves and provides comprehensive public domain nucleotide sequence information. Three partners of the INSDC work in cooperation to establish formats for data and metadata and protocols that facilitate reliable data submission to their databases and support continual data exchange around the world. In this article, the INSDC current status and update for the year of 2012 are presented. Among discussed items of international collaboration meeting in 2012, BioSample database and changes in submission are described as topics.

Assuntos

Sequência de Bases , Bases de Dados de Ácidos Nucleicos , Genômica , Internet , Anotação de Sequência Molecular

19.

GenBank.

Benson, Dennis A; Cavanaugh, Mark; Clark, Karen; Karsch-Mizrachi, Ilene; Lipman, David J; Ostell, James; Sayers, Eric W.

Nucleic Acids Res ; 41(Database issue): D36-42, 2013 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-23193287

RESUMO

GenBank® (http://www.ncbi.nlm.nih.gov) is a comprehensive database that contains publicly available nucleotide sequences for almost 260 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole-genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and GenBank staff assigns accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. To access GenBank and its related retrieval and analysis services, begin at the NCBI home page: www.ncbi.nlm.nih.gov.

Assuntos

Sequência de Bases , Bases de Dados de Ácidos Nucleicos , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Internet , Anotação de Sequência Molecular , Análise de Sequência de DNA

20.

The Genomic Standards Consortium.

Field, Dawn; Amaral-Zettler, Linda; Cochrane, Guy; Cole, James R; Dawyndt, Peter; Garrity, George M; Gilbert, Jack; Glöckner, Frank Oliver; Hirschman, Lynette; Karsch-Mizrachi, Ilene; Klenk, Hans-Peter; Knight, Rob; Kottmann, Renzo; Kyrpides, Nikos; Meyer, Folker; San Gil, Inigo; Sansone, Susanna-Assunta; Schriml, Lynn M; Sterk, Peter; Tatusova, Tatiana; Ussery, David W; White, Owen; Wooley, John.

PLoS Biol ; 9(6): e1001088, 2011 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-21713030

RESUMO

A vast and rich body of information has grown up as a result of the world's enthusiasm for 'omics technologies. Finding ways to describe and make available this information that maximise its usefulness has become a major effort across the 'omics world. At the heart of this effort is the Genomic Standards Consortium (GSC), an open-membership organization that drives community-based standardization activities, Here we provide a short history of the GSC, provide an overview of its range of current activities, and make a call for the scientific community to join forces to improve the quality and quantity of contextual information about our public collections of genomes, metagenomes, and marker gene sequences.

Assuntos

Bases de Dados Genéticas , Genômica/normas , Cooperação Internacional , Metagenoma

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

Detalhe da pesquisa