Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Nat Protoc ; 17(12): 2815-2839, 2022 12.
Artículo en Inglés | MEDLINE | ID: mdl-36171387

RESUMEN

Metagenomic experiments expose the wide range of microscopic organisms in any microbial environment through high-throughput DNA sequencing. The computational analysis of the sequencing data is critical for the accurate and complete characterization of the microbial community. To facilitate efficient and reproducible metagenomic analysis, we introduce a step-by-step protocol for the Kraken suite, an end-to-end pipeline for the classification, quantification and visualization of metagenomic datasets. Our protocol describes the execution of the Kraken programs, via a sequence of easy-to-use scripts, in two scenarios: (1) quantification of the species in a given metagenomics sample; and (2) detection of a pathogenic agent from a clinical sample taken from a human patient. The protocol, which is executed within 1-2 h, is targeted to biologists and clinicians working in microbiome or metagenomics analysis who are familiar with the Unix command-line environment.


Asunto(s)
Metagenoma , Microbiota , Humanos , Programas Informáticos , Metagenómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Microbiota/genética , Análisis de Secuencia de ADN/métodos
2.
Nat Commun ; 13(1): 2830, 2022 05 20.
Artículo en Inglés | MEDLINE | ID: mdl-35595835

RESUMEN

The lack of validated, distributed comprehensive genomic profiling assays for patients with cancer inhibits access to precision oncology treatment. To address this, we describe elio tissue complete, which has been FDA-cleared for examination of 505 cancer-related genes. Independent analyses of clinically and biologically relevant sequence changes across 170 clinical tumor samples using MSK-IMPACT, FoundationOne, and PCR-based methods reveals a positive percent agreement of >97%. We observe high concordance with whole-exome sequencing for evaluation of tumor mutational burden for 307 solid tumors (Pearson r = 0.95) and comparison of the elio tissue complete microsatellite instability detection approach with an independent PCR assay for 223 samples displays a positive percent agreement of 99%. Finally, evaluation of amplifications and translocations against DNA- and RNA-based approaches exhibits >98% negative percent agreement and positive percent agreement of 86% and 82%, respectively. These methods provide an approach for pan-solid tumor comprehensive genomic profiling with high analytical performance.


Asunto(s)
Neoplasias , Biomarcadores de Tumor/genética , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Mutación , Neoplasias/patología , Medicina de Precisión
3.
Front Bioinform ; 1: 808003, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-36303747

RESUMEN

Ten years ago, the dramatic rise in the number of microbial genomes led to an inflection point, when the approach of finding short, exact matches in a comprehensive database became just as accurate as older, slower approaches. The new idea led to a method that was hundreds of times times faster than those that came before. Today, exact k-mer matching is a standard technique at the heart of many microbiome analysis tools.

4.
Genome Biol ; 20(1): 257, 2019 11 28.
Artículo en Inglés | MEDLINE | ID: mdl-31779668

RESUMEN

Although Kraken's k-mer-based approach provides a fast taxonomic classification of metagenomic sequence data, its large memory requirements can be limiting for some applications. Kraken 2 improves upon Kraken 1 by reducing memory usage by 85%, allowing greater amounts of reference genomic data to be used, while maintaining high accuracy and increasing speed fivefold. Kraken 2 also introduces a translated search mode, providing increased sensitivity in viral metagenomics analysis.


Asunto(s)
Metagenómica/métodos , Programas Informáticos
5.
Sci Transl Med ; 10(457)2018 09 05.
Artículo en Inglés | MEDLINE | ID: mdl-30185652

RESUMEN

Variability in the accuracy of somatic mutation detection may affect the discovery of alterations and the therapeutic management of cancer patients. To address this issue, we developed a somatic mutation discovery approach based on machine learning that outperformed existing methods in identifying experimentally validated tumor alterations (sensitivity of 97% versus 90 to 99%; positive predictive value of 98% versus 34 to 92%). Analysis of paired tumor-normal exome data from 1368 TCGA (The Cancer Genome Atlas) samples using this method revealed concordance for 74% of mutation calls but also identified likely false-positive and false-negative changes in TCGA data, including in clinically actionable genes. Determination of high-quality somatic mutation calls improved tumor mutation load-based predictions of clinical outcome for melanoma and lung cancer patients previously treated with immune checkpoint inhibitors. Integration of high-quality machine learning mutation detection in clinical next-generation sequencing (NGS) analyses increased the accuracy of test results compared to other clinical sequencing analyses. These analyses provide an approach for improved identification of tumor-specific mutations and have important implications for research and clinical management of cancer patients.


Asunto(s)
Aprendizaje Automático , Mutación/genética , Exoma/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Inmunoterapia , Neoplasias/genética , Neoplasias/inmunología , Neoplasias/terapia , Programas Informáticos , Secuenciación del Exoma
6.
PeerJ ; 2: e675, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25426337

RESUMEN

The raw data from a genome sequencing project sometimes contains DNA from contaminating organisms, which may be introduced during sample collection or sequence preparation. In some instances, these contaminants remain in the sequence even after assembly and deposition of the genome into public databases. As a result, searches of these databases may yield erroneous and confusing results. We used efficient microbiome analysis software to scan the draft assembly of domestic cow, Bos taurus, and identify 173 small contigs that appeared to derive from microbial contaminants. In the course of verifying these findings, we discovered that one genome, Neisseria gonorrhoeae TCDC-NG08107, although putatively a complete genome, contained multiple sequences that actually derived from the cow and sheep genomes. Our findings illustrate the need to carefully validate findings of anomalous DNA that rely on comparisons to either draft or finished genomes.

7.
Genome Biol ; 15(3): R46, 2014 Mar 03.
Artículo en Inglés | MEDLINE | ID: mdl-24580807

RESUMEN

Kraken is an ultrafast and highly accurate program for assigning taxonomic labels to metagenomic DNA sequences. Previous programs designed for this task have been relatively slow and computationally expensive, forcing researchers to use faster abundance estimation programs, which only classify small subsets of metagenomic data. Using exact alignment of k-mers, Kraken achieves classification accuracy comparable to the fastest BLAST program. In its fastest mode, Kraken classifies 100 base pair reads at a rate of over 4.1 million reads per minute, 909 times faster than Megablast and 11 times faster than the abundance estimation program MetaPhlAn. Kraken is available at http://ccb.jhu.edu/software/kraken/.


Asunto(s)
Metagenómica/métodos , Alineación de Secuencia/métodos , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Archaea/clasificación , Archaea/genética , Bacterias/clasificación , Bacterias/genética , Clasificación , Humanos , Metagenoma , Sensibilidad y Especificidad
8.
Biol Direct ; 7: 37, 2012 Oct 30.
Artículo en Inglés | MEDLINE | ID: mdl-23111013

RESUMEN

BACKGROUND: The dramatic reduction in the cost of sequencing has allowed many researchers to join in the effort of sequencing and annotating prokaryotic genomes. Annotation methods vary considerably and may fail to identify some genes. Here we draw attention to a large number of likely genes missing from annotations using common tools such as Glimmer and BLAST. RESULTS: By analyzing 1,474 prokaryotic genome annotations in GenBank, we identify 13,602 likely missed genes that are homologs to non-hypothetical proteins, and 11,792 likely missed genes that are homologs only to hypothetical proteins, yet have supporting evidence of their protein-coding nature from COMBREX, a newly created gene function database. We also estimate the likelihood that each potential missing gene found is a genuine protein-coding gene using COMBREX. CONCLUSIONS: Our analysis of the causes of missed genes suggests that larger annotation centers tend to produce annotations with fewer missed genes than smaller centers, and many of the missed genes are short genes <300 bp. Over 1,000 of the likely missed genes could be associated with phenotype information available in COMBREX. 359 of these genes, found in pathogenic organisms, may be potential targets for pharmaceutical research. The newly identified genes are available on COMBREX's website. REVIEWERS: This article was reviewed by Daniel Haft, Arcady Mushegian, and M. Pilar Francino (nominated by David Ardell).


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Genes Bacterianos , Anotación de Secuencia Molecular/métodos , Sistemas de Lectura Abierta , Bacterias/genética , Biología Computacional/métodos , Variación Genética , Genoma Bacteriano , Alineación de Secuencia , Análisis de Secuencia de ADN , Homología de Secuencia , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...