Your browser doesn't support javascript.
loading
Dashing: fast and accurate genomic distances with HyperLogLog.
Baker, Daniel N; Langmead, Ben.
Afiliación
  • Baker DN; Department of Computer Science, Johns Hopkins University, 3400 N Charles St, Baltimore, 21218, USA. dnb@cs.jhu.edu.
  • Langmead B; Department of Computer Science, Johns Hopkins University, 3400 N Charles St, Baltimore, 21218, USA. langmea@cs.jhu.edu.
Genome Biol ; 20(1): 265, 2019 12 04.
Article en En | MEDLINE | ID: mdl-31801633
Dashing is a fast and accurate software tool for estimating similarities of genomes or sequencing datasets. It uses the HyperLogLog sketch together with cardinality estimation methods that are specialized for set unions and intersections. Dashing summarizes genomes more rapidly than previous MinHash-based methods while providing greater accuracy across a wide range of input sizes and sketch sizes. It can sketch and calculate pairwise distances for over 87K genomes in 6 minutes. Dashing is open source and available at https://github.com/dnbaker/dashing.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Programas Informáticos / Genómica Tipo de estudio: Evaluation_studies Idioma: En Revista: Genome Biol Asunto de la revista: BIOLOGIA MOLECULAR / GENETICA Año: 2019 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Programas Informáticos / Genómica Tipo de estudio: Evaluation_studies Idioma: En Revista: Genome Biol Asunto de la revista: BIOLOGIA MOLECULAR / GENETICA Año: 2019 Tipo del documento: Article País de afiliación: Estados Unidos