Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Más filtros

Bases de datos
Tipo del documento
País de afiliación
Intervalo de año de publicación
1.
Nucleic Acids Res ; 44(D1): D73-80, 2016 Jan 04.
Artículo en Inglés | MEDLINE | ID: mdl-26578580

RESUMEN

The NCBI Assembly database (www.ncbi.nlm.nih.gov/assembly/) provides stable accessioning and data tracking for genome assembly data. The model underlying the database can accommodate a range of assembly structures, including sets of unordered contig or scaffold sequences, bacterial genomes consisting of a single complete chromosome, or complex structures such as a human genome with modeled allelic variation. The database provides an assembly accession and version to unambiguously identify the set of sequences that make up a particular version of an assembly, and tracks changes to updated genome assemblies. The Assembly database reports metadata such as assembly names, simple statistical reports of the assembly (number of contigs and scaffolds, contiguity metrics such as contig N50, total sequence length and total gap length) as well as the assembly update history. The Assembly database also tracks the relationship between an assembly submitted to the International Nucleotide Sequence Database Consortium (INSDC) and the assembly represented in the NCBI RefSeq project. Users can find assemblies of interest by querying the Assembly Resource directly or by browsing available assemblies for a particular organism. Links in the Assembly Resource allow users to easily download sequence and annotations for current versions of genome assemblies from the NCBI genomes FTP site.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Genómica , Animales , Genoma , Humanos , Internet , Ratones
2.
Genome Biol ; 25(1): 60, 2024 02 26.
Artículo en Inglés | MEDLINE | ID: mdl-38409096

RESUMEN

Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBI's Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1-10 min. Testing FCS-GX on artificially fragmented genomes demonstrates high sensitivity and specificity for diverse contaminant species. We used FCS-GX to screen 1.6 million GenBank assemblies and identified 36.8 Gbp of contamination, comprising 0.16% of total bases, with half from 161 assemblies. We updated assemblies in NCBI RefSeq to reduce detected contamination to 0.01% of bases. FCS-GX is available at https://github.com/ncbi/fcs/ or https://doi.org/10.5281/zenodo.10651084 .


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Genoma , Programas Informáticos
3.
bioRxiv ; 2023 06 06.
Artículo en Inglés | MEDLINE | ID: mdl-37292984

RESUMEN

Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBI's Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1-10 minutes. Testing FCS-GX on artificially fragmented genomes demonstrates sensitivity >95% for diverse contaminant species and specificity >99.93%. We used FCS-GX to screen 1.6 million GenBank assemblies and identified 36.8 Gbp of contamination (0.16% of total bases), with half from 161 assemblies. We updated assemblies in NCBI RefSeq to reduce detected contamination to 0.01% of bases. FCS-GX is available at https://github.com/ncbi/fcs/.

4.
Science ; 324(5926): 522-8, 2009 Apr 24.
Artículo en Inglés | MEDLINE | ID: mdl-19390049

RESUMEN

To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides a resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production.


Asunto(s)
Evolución Biológica , Genoma , Empalme Alternativo , Animales , Animales Domésticos , Bovinos , Evolución Molecular , Femenino , Variación Genética , Humanos , Masculino , MicroARNs/genética , Datos de Secuencia Molecular , Proteínas/genética , Análisis de Secuencia de ADN , Especificidad de la Especie , Sintenía
5.
Science ; 314(5801): 941-52, 2006 Nov 10.
Artículo en Inglés | MEDLINE | ID: mdl-17095691

RESUMEN

We report the sequence and analysis of the 814-megabase genome of the sea urchin Strongylocentrotus purpuratus, a model for developmental and systems biology. The sequencing strategy combined whole-genome shotgun and bacterial artificial chromosome (BAC) sequences. This use of BAC clones, aided by a pooling strategy, overcame difficulties associated with high heterozygosity of the genome. The genome encodes about 23,300 genes, including many previously thought to be vertebrate innovations or known only outside the deuterostomes. This echinoderm genome provides an evolutionary outgroup for the chordates and yields insights into the evolution of deuterostomes.


Asunto(s)
Genoma , Análisis de Secuencia de ADN , Strongylocentrotus purpuratus/genética , Animales , Calcificación Fisiológica , Moléculas de Adhesión Celular/genética , Moléculas de Adhesión Celular/fisiología , Activación de Complemento/genética , Biología Computacional , Desarrollo Embrionario/genética , Evolución Molecular , Regulación del Desarrollo de la Expresión Génica , Genes , Inmunidad Innata/genética , Factores Inmunológicos/genética , Factores Inmunológicos/fisiología , Masculino , Fenómenos Fisiológicos del Sistema Nervioso , Proteínas/genética , Proteínas/fisiología , Transducción de Señal , Strongylocentrotus purpuratus/embriología , Strongylocentrotus purpuratus/inmunología , Strongylocentrotus purpuratus/fisiología , Factores de Transcripción/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA