Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 28
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Nucleic Acids Res ; 52(D1): D33-D43, 2024 Jan 05.
Artículo en Inglés | MEDLINE | ID: mdl-37994677

RESUMEN

The National Center for Biotechnology Information (NCBI) provides online information resources for biology, including the GenBank® nucleic acid sequence database and the PubMed® database of citations and abstracts published in life science journals. NCBI provides search and retrieval operations for most of these data from 35 distinct databases. The E-utilities serve as the programming interface for most of these databases. Resources receiving significant updates in the past year include PubMed, PMC, Bookshelf, SciENcv, the NIH Comparative Genomics Resource (CGR), NCBI Virus, SRA, RefSeq, foreign contamination screening tools, Taxonomy, iCn3D, ClinVar, GTR, MedGen, dbSNP, ALFA, ClinicalTrials.gov, Pathogen Detection, antimicrobial resistance resources, and PubChem. These resources can be accessed through the NCBI home page at https://www.ncbi.nlm.nih.gov.


Asunto(s)
Bases de Datos Genéticas , National Library of Medicine (U.S.) , Biotecnología/instrumentación , Bases de Datos de Ácidos Nucleicos , Internet , Estados Unidos
2.
Nucleic Acids Res ; 51(D1): D29-D38, 2023 01 06.
Artículo en Inglés | MEDLINE | ID: mdl-36370100

RESUMEN

The National Center for Biotechnology Information (NCBI) provides online information resources for biology, including the GenBank® nucleic acid sequence database and the PubMed® database of citations and abstracts published in life science journals. NCBI provides search and retrieval operations for most of these data from 35 distinct databases. The E-utilities serve as the programming interface for most of these databases. New resources include the Comparative Genome Resource (CGR) and the BLAST ClusteredNR database. Resources receiving significant updates in the past year include PubMed, PMC, Bookshelf, IgBLAST, GDV, RefSeq, NCBI Virus, GenBank type assemblies, iCn3D, ClinVar, GTR, dbGaP, ALFA, ClinicalTrials.gov, Pathogen Detection, antimicrobial resistance resources, and PubChem. These resources can be accessed through the NCBI home page at https://www.ncbi.nlm.nih.gov.


Asunto(s)
Bases de Datos Genéticas , Bases de Datos de Ácidos Nucleicos , Estados Unidos , National Library of Medicine (U.S.) , Alineación de Secuencia , Biotecnología , Internet
3.
Nucleic Acids Res ; 49(D1): D10-D17, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33095870

RESUMEN

The National Center for Biotechnology Information (NCBI) provides a large suite of online resources for biological information and data, including the GenBank® nucleic acid sequence database and the PubMed® database of citations and abstracts published in life science journals. The Entrez system provides search and retrieval operations for most of these data from 34 distinct databases. The E-utilities serve as the programming interface for the Entrez system. Custom implementations of the BLAST program provide sequence-based searching of many specialized datasets. New resources released in the past year include a new PubMed interface and NCBI datasets. Additional resources that were updated in the past year include PMC, Bookshelf, Genome Data Viewer, SRA, ClinVar, dbSNP, dbVar, Pathogen Detection, BLAST, Primer-BLAST, IgBLAST, iCn3D and PubChem. All of these resources can be accessed through the NCBI home page at https://www.ncbi.nlm.nih.gov.


Asunto(s)
Bases de Datos Genéticas , National Library of Medicine (U.S.) , Biología Computacional/métodos , Bases de Datos de Compuestos Químicos , Bases de Datos de Ácidos Nucleicos , Bases de Datos de Proteínas , Genómica/métodos , Humanos , PubMed , Estados Unidos
4.
Antimicrob Agents Chemother ; 66(4): e0033322, 2022 04 19.
Artículo en Inglés | MEDLINE | ID: mdl-35380458

RESUMEN

Assigning names to ß-lactamase variants has been inconsistent and has led to confusion in the published literature. The common availability of whole genome sequencing has resulted in an exponential growth in the number of new ß-lactamase genes. In November 2021 an international group of ß-lactamase experts met virtually to develop a consensus for the way naturally-occurring ß-lactamase genes should be named. This document formalizes the process for naming novel ß-lactamases, followed by their subsequent publication.


Asunto(s)
Inhibidores de beta-Lactamasas , beta-Lactamasas , Consenso , beta-Lactamasas/genética
5.
Artículo en Inglés | MEDLINE | ID: mdl-31712217

RESUMEN

Unlike for classes A and B, a standardized amino acid numbering scheme has not been proposed for the class C (AmpC) ß-lactamases, which complicates communication in the field. Here, we propose a scheme developed through a collaborative approach that considers both sequence and structure, preserves traditional numbering of catalytically important residues (Ser64, Lys67, Tyr150, and Lys315), is adaptable to new variants or enzymes yet to be discovered and includes a variation for genetic and epidemiological applications.


Asunto(s)
Proteínas Bacterianas/clasificación , Bacterias Gramnegativas/genética , Bacterias Grampositivas/genética , Mutación , Terminología como Asunto , Resistencia betalactámica/genética , beta-Lactamasas/clasificación , Secuencia de Aminoácidos , Antibacterianos/química , Antibacterianos/farmacología , Proteínas Bacterianas/antagonistas & inhibidores , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Expresión Génica , Bacterias Gramnegativas/efectos de los fármacos , Bacterias Gramnegativas/enzimología , Bacterias Grampositivas/efectos de los fármacos , Bacterias Grampositivas/enzimología , Cooperación Internacional , Estructura Secundaria de Proteína , Alineación de Secuencia , Homología de Secuencia de Aminoácido , Inhibidores de beta-Lactamasas/química , Inhibidores de beta-Lactamasas/farmacología , beta-Lactamasas/genética , beta-Lactamasas/metabolismo , beta-Lactamas/química , beta-Lactamas/farmacología
6.
Artículo en Inglés | MEDLINE | ID: mdl-31427293

RESUMEN

Antimicrobial resistance (AMR) is a major public health problem that requires publicly available tools for rapid analysis. To identify AMR genes in whole-genome sequences, the National Center for Biotechnology Information (NCBI) has produced AMRFinder, a tool that identifies AMR genes using a high-quality curated AMR gene reference database. The Bacterial Antimicrobial Resistance Reference Gene Database consists of up-to-date gene nomenclature, a set of hidden Markov models (HMMs), and a curated protein family hierarchy. Currently, it contains 4,579 antimicrobial resistance proteins and more than 560 HMMs. Here, we describe AMRFinder and its associated database. To assess the predictive ability of AMRFinder, we measured the consistency between predicted AMR genotypes from AMRFinder and resistance phenotypes of 6,242 isolates from the National Antimicrobial Resistance Monitoring System (NARMS). This included 5,425 Salmonella enterica, 770 Campylobacter spp., and 47 Escherichia coli isolates phenotypically tested against various antimicrobial agents. Of 87,679 susceptibility tests performed, 98.4% were consistent with predictions. To assess the accuracy of AMRFinder, we compared its gene symbol output with that of a 2017 version of ResFinder, another publicly available resistance gene detection system. Most gene calls were identical, but there were 1,229 gene symbol differences (8.8%) between them, with differences due to both algorithmic differences and database composition. AMRFinder missed 16 loci that ResFinder found, while ResFinder missed 216 loci that AMRFinder identified. Based on these results, AMRFinder appears to be a highly accurate AMR gene detection system.

7.
J Antimicrob Chemother ; 73(10): 2625-2630, 2018 10 01.
Artículo en Inglés | MEDLINE | ID: mdl-30053115

RESUMEN

The initial report of the mcr-1 (mobile colistin resistance) gene has led to many reports of mcr-1 variants and other mcr genes from different bacterial species originating from human, animal and environmental samples in different geographical locations. Resistance gene nomenclature is complex and unfortunately problems such as different names being used for the same gene/protein or the same name being used for different genes/proteins are not uncommon. Registries exist for some families, such as bla (ß-lactamase) genes, but there is as yet no agreed nomenclature scheme for mcr genes. The National Center for Biotechnology Information (NCBI) recently took over assigning bla allele numbers from the longstanding Lahey ß-lactamase website and has agreed to do the same for mcr genes. Here, we propose a nomenclature scheme that we hope will be acceptable to researchers in this area and that will reduce future confusion.


Asunto(s)
Alelos , Antibacterianos/farmacología , Bacterias/genética , Colistina/farmacología , Farmacorresistencia Bacteriana/genética , Genes MDR , Bacterias/efectos de los fármacos , Escherichia coli/efectos de los fármacos , Proteínas de Escherichia coli/genética , Pruebas de Sensibilidad Microbiana , Terminología como Asunto , Secuenciación Completa del Genoma , beta-Lactamasas/genética
8.
Clin Infect Dis ; 63(3): 380-6, 2016 08 01.
Artículo en Inglés | MEDLINE | ID: mdl-27090985

RESUMEN

Listeria monocytogenes (Lm) causes severe foodborne illness (listeriosis). Previous molecular subtyping methods, such as pulsed-field gel electrophoresis (PFGE), were critical in detecting outbreaks that led to food safety improvements and declining incidence, but PFGE provides limited genetic resolution. A multiagency collaboration began performing real-time, whole-genome sequencing (WGS) on all US Lm isolates from patients, food, and the environment in September 2013, posting sequencing data into a public repository. Compared with the year before the project began, WGS, combined with epidemiologic and product trace-back data, detected more listeriosis clusters and solved more outbreaks (2 outbreaks in pre-WGS year, 5 in WGS year 1, and 9 in year 2). Whole-genome multilocus sequence typing and single nucleotide polymorphism analyses provided equivalent phylogenetic relationships relevant to investigations; results were most useful when interpreted in context of epidemiological data. WGS has transformed listeriosis outbreak surveillance and is being implemented for other foodborne pathogens.


Asunto(s)
Brotes de Enfermedades , Enfermedades Transmitidas por los Alimentos/epidemiología , Genoma Bacteriano/genética , Listeria monocytogenes/clasificación , Listeriosis/epidemiología , Secuenciación Completa del Genoma/métodos , Inocuidad de los Alimentos , Enfermedades Transmitidas por los Alimentos/microbiología , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Listeria monocytogenes/genética , Listeria monocytogenes/aislamiento & purificación , Listeriosis/microbiología , Tipificación de Secuencias Multilocus , Filogenia , Análisis de Secuencia de ADN
10.
Microorganisms ; 11(10)2023 Oct 14.
Artículo en Inglés | MEDLINE | ID: mdl-37894219

RESUMEN

The sharing of genome sequences in online data repositories allows for large scale analyses of specific genes or gene families. This can result in the detection of novel gene subtypes as well as the development of improved detection methods. Here, we used publicly available WGS data to detect a novel Stx subtype, Stx2n in two clinical E. coli strains isolated in the USA. During this process, additional Stx2 subtypes were detected; six Stx2j, one Stx2m strain, and one Stx2o, were all analyzed for variability from the originally described subtypes. Complete genome sequences were assembled from short- or long-read sequencing and analyzed for serotype, and ST types. The WGS data from Stx2n- and Stx2o-producing STEC strains were further analyzed for virulence genes pro-phage analysis and phage insertion sites. Nucleotide and amino acid maximum parsimony trees showed expected clustering of the previously described subtypes and a clear separation of the novel Stx2n subtype. WGS data were used to design OMNI PCR primers for the detection of all known stx1 (283 bp amplicon), stx2 (400 bp amplicon), intimin encoded by eae (221 bp amplicon), and stx2f (438 bp amplicon) subtypes. These primers were tested in three different laboratories, using standard reference strains. An analysis of the complete genome sequence showed variability in serogroup, virulence genes, and ST type, and Stx2 pro-phages showed variability in size, gene composition, and phage insertion sites. The strains with Stx2j, Stx2m, Stx2n, and Stx2o showed toxicity to Vero cells. Stx2j carrying strain, 2012C-4221, was induced when grown with sub-inhibitory concentrations of ciprofloxacin, and toxicity was detected. Taken together, these data highlight the need to reinforce genomic surveillance to identify the emergence of potential new Stx2 or Stx1 variants. The importance of this surveillance has a paramount impact on public health. Per our description in this study, we suggest that 2017C-4317 be designated as the Stx2n type-strain.

11.
Microb Genom ; 9(12)2023 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-38085797

RESUMEN

Fast, efficient public health actions require well-organized and coordinated systems that can supply timely and accurate knowledge. Public databases of pathogen genomic data, such as the International Nucleotide Sequence Database Collaboration (INSDC), have become essential tools for efficient public health decisions. However, these international resources began primarily for academic purposes, rather than for surveillance or interventions. Now, queries need to access not only the whole genomes of multiple pathogens but also make connections using robust contextual metadata to identify issues of public health relevance. Databases that over time developed a patchwork of submission formats and requirements need to be consistently organized and coordinated internationally to allow effective searches.To help resolve these issues, we propose a common pathogen data structure called the Pathogen Data Object Model (DOM) that will formalize the minimum pieces of sequence data and contextual data necessary for general public health uses, while recognizing that submitters will likely withhold a wide range of non-public contextual data. Further, we propose contributors use the Pathogen DOM for all pathogen submissions (bacterial, viral, fungal, and parasites), which will simplify data submissions and provide a consistent and transparent data structure for downstream data analyses. We also highlight how improved submission tools can support the Pathogen DOM, offering users additional easy-to-use methods to ensure this structure is followed.


Asunto(s)
Nucleótidos , Salud Pública , Secuencia de Bases , Genómica/métodos , Bases de Datos de Ácidos Nucleicos
12.
Microb Genom ; 8(6)2022 06.
Artículo en Inglés | MEDLINE | ID: mdl-35675101

RESUMEN

Antimicrobial resistance (AMR) is a significant public health threat. Low-cost whole-genome sequencing, which is often used in surveillance programmes, provides an opportunity to assess AMR gene content in these genomes using in silico approaches. A variety of bioinformatic tools have been developed to identify these genomic elements. Most of those tools rely on reference databases of nucleotide or protein sequences and collections of models and rules for analysis. While the tools are critical for the identification of AMR genes, the databases themselves also provide significant utility for researchers, for applications ranging from sequence analysis to information about AMR phenotypes. Additionally, these databases can be evaluated by domain experts and others to ensure their accuracy. Here we describe how we curate the genes, point mutations and blast rules, and hidden Markov models used in NCBI's AMRFinderPlus, along with the quality-control steps we take to ensure database quality. We also describe the web interfaces that display the full structure of the database and their newly developed cross-browser relationships. Then, using the Reference Gene Catalog as an example, we detail how the databases, rules and models are made publicly available, as well as how to access the software. In addition, as part of the Pathogen Detection system, we have analysed over 1 million publicly available genomes using AMRFinderPlus and its databases. We discuss how the computed analyses generated by those tools can be accessed through a web interface. Finally, we conclude with NCBI's plans to make these databases accessible over the long-term.


Asunto(s)
Biología Computacional , Programas Informáticos , Secuencia de Aminoácidos , Secuenciación Completa del Genoma
13.
J Food Prot ; 85(5): 755-772, 2022 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-35259246

RESUMEN

ABSTRACT: This multiagency report developed by the Interagency Collaboration for Genomics for Food and Feed Safety provides an overview of the use of and transition to whole genome sequencing (WGS) technology for detection and characterization of pathogens transmitted commonly by food and for identification of their sources. We describe foodborne pathogen analysis, investigation, and harmonization efforts among the following federal agencies: National Institutes of Health; Department of Health and Human Services, Centers for Disease Control and Prevention (CDC) and U.S. Food and Drug Administration (FDA); and the U.S. Department of Agriculture, Food Safety and Inspection Service, Agricultural Research Service, and Animal and Plant Health Inspection Service. We describe single nucleotide polymorphism, core-genome, and whole genome multilocus sequence typing data analysis methods as used in the PulseNet (CDC) and GenomeTrakr (FDA) networks, underscoring the complementary nature of the results for linking genetically related foodborne pathogens during outbreak investigations while allowing flexibility to meet the specific needs of Interagency Collaboration partners. We highlight how we apply WGS to pathogen characterization (virulence and antimicrobial resistance profiles) and source attribution efforts and increase transparency by making the sequences and other data publicly available through the National Center for Biotechnology Information. We also highlight the impact of current trends in the use of culture-independent diagnostic tests for human diagnostic testing on analytical approaches related to food safety and what is next for the use of WGS in the area of food safety.


Asunto(s)
Enfermedades Transmitidas por los Alimentos , Animales , Brotes de Enfermedades/prevención & control , Inocuidad de los Alimentos , Enfermedades Transmitidas por los Alimentos/epidemiología , Enfermedades Transmitidas por los Alimentos/prevención & control , Genómica , Estados Unidos , Secuenciación Completa del Genoma
16.
Nucleic Acids Res ; 37(Database issue): D32-6, 2009 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-18927115

RESUMEN

NCBI's Reference Sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) is a curated non-redundant collection of sequences representing genomes, transcripts and proteins. RefSeq records integrate information from multiple sources and represent a current description of the sequence, the gene and sequence features. The database includes over 5300 organisms spanning prokaryotes, eukaryotes and viruses, with records for more than 5.5 x 10(6) proteins (RefSeq release 30). Feature annotation is applied by a combination of curation, collaboration, propagation from other sources and computation. We report here on the recent growth of the database, recent changes to feature annotations and record types for eukaryotic (primarily vertebrate) species and policies regarding species inclusion and genome annotation. In addition, we introduce RefSeqGene, a new initiative to support reporting variation data on a stable genomic coordinate system.


Asunto(s)
Bases de Datos Genéticas , Análisis de Secuencia/normas , Animales , Exones , Genómica/normas , Humanos , Ratones , Proteínas/química , Seudogenes , ARN no Traducido/química , Estándares de Referencia
17.
Nucleic Acids Res ; 37(Database issue): D216-23, 2009 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-18940865

RESUMEN

Rapid increases in DNA sequencing capabilities have led to a vast increase in the data generated from prokaryotic genomic studies, which has been a boon to scientists studying micro-organism evolution and to those who wish to understand the biological underpinnings of microbial systems. The NCBI Protein Clusters Database (ProtClustDB) has been created to efficiently maintain and keep the deluge of data up to date. ProtClustDB contains both curated and uncurated clusters of proteins grouped by sequence similarity. The May 2008 release contains a total of 285 386 clusters derived from over 1.7 million proteins encoded by 3806 nt sequences from the RefSeq collection of complete chromosomes and plasmids from four major groups: prokaryotes, bacteriophages and the mitochondrial and chloroplast organelles. There are 7180 clusters containing 376 513 proteins with curated gene and protein functional annotation. PubMed identifiers and external cross references are collected for all clusters and provide additional information resources. A suite of web tools is available to explore more detailed information, such as multiple alignments, phylogenetic trees and genomic neighborhoods. ProtClustDB provides an efficient method to aggregate gene and protein annotation for researchers and is available at http://www.ncbi.nlm.nih.gov/sites/entrez?db=proteinclusters.


Asunto(s)
Bases de Datos de Proteínas , Proteínas/clasificación , Análisis por Conglomerados , Genómica , Proteínas/química , Proteínas/genética , Homología de Secuencia de Aminoácido
18.
Sci Rep ; 11(1): 12728, 2021 06 16.
Artículo en Inglés | MEDLINE | ID: mdl-34135355

RESUMEN

Antimicrobial resistance (AMR) is a significant public health threat. With the rise of affordable whole genome sequencing, in silico approaches to assessing AMR gene content can be used to detect known resistance mechanisms and potentially identify novel mechanisms. To enable accurate assessment of AMR gene content, as part of a multi-agency collaboration, NCBI developed a comprehensive AMR gene database, the Bacterial Antimicrobial Resistance Reference Gene Database and the AMR gene detection tool AMRFinder. Here, we describe the expansion of the Reference Gene Database, now called the Reference Gene Catalog, to include putative acid, biocide, metal, stress resistance genes, in addition to virulence genes and species-specific point mutations. Genes and point mutations are classified by broad functions, as well as more detailed functions. As we have expanded both the functional repertoire of identified genes and functionality, NCBI released a new version of AMRFinder, known as AMRFinderPlus. This new tool allows users the option to utilize only the core set of AMR elements, or include stress response and virulence genes, too. AMRFinderPlus can detect acquired genes and point mutations in both protein and nucleotide sequence. In addition, the evidence used to identify the gene has been expanded to include whether nucleotide or protein sequence was used, its location in the contig, and presence of an internal stop codon. These database improvements and functional expansions will enable increased precision in identifying AMR genes, linking AMR genotypes and phenotypes, and determining possible relationships between AMR, virulence, and stress response.


Asunto(s)
Antibacterianos/farmacología , Bacterias/efectos de los fármacos , Bases de Datos Genéticas , Farmacorresistencia Bacteriana/genética , Genes Bacterianos , Bacterias/genética , Bacterias/patogenicidad , Farmacorresistencia Bacteriana Múltiple/genética , Genoma Bacteriano , Mercurio/farmacología , Plásmidos , Salmonella/efectos de los fármacos , Salmonella/genética , Virulencia/genética
19.
Nat Commun ; 10(1): 3313, 2019 07 25.
Artículo en Inglés | MEDLINE | ID: mdl-31346170

RESUMEN

FDA proactively invests in tools to support innovation of emerging technologies, such as infectious disease next generation sequencing (ID-NGS). Here, we introduce FDA-ARGOS quality-controlled reference genomes as a public database for diagnostic purposes and demonstrate its utility on the example of two use cases. We provide quality control metrics for the FDA-ARGOS genomic database resource and outline the need for genome quality gap filling in the public domain. In the first use case, we show more accurate microbial identification of Enterococcus avium from metagenomic samples with FDA-ARGOS reference genomes compared to non-curated GenBank genomes. In the second use case, we demonstrate the utility of FDA-ARGOS reference genomes for Ebola virus target sequence comparison as part of a composite validation strategy for ID-NGS diagnostic tests. The use of FDA-ARGOS as an in silico target sequence comparator tool combined with representative clinical testing could reduce the burden for completing ID-NGS clinical trials.


Asunto(s)
Enfermedades Transmisibles/diagnóstico , Bases de Datos de Ácidos Nucleicos/normas , Genoma , Acceso a la Información , Enfermedades Transmisibles/microbiología , Bases de Datos de Ácidos Nucleicos/organización & administración , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Estados Unidos , United States Food and Drug Administration
20.
OMICS ; 12(2): 137-41, 2008 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-18416670

RESUMEN

The methodologies used to generate genome and metagenome annotations are diverse and vary between groups and laboratories. Descriptions of the annotation process are helpful in interpreting genome annotation data. Some groups have produced Standard Operating Procedures (SOPs) that describe the annotation process, but standards are lacking for structure and content of these descriptions. In addition, there is no central repository to store and disseminate procedures and protocols for genome annotation. We highlight the importance of SOPs for genome annotation and endorse an online repository of SOPs.


Asunto(s)
Bases de Datos Genéticas/normas , Genómica , Sistemas en Línea/normas , Internet
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA