Your browser doesn't support javascript.
loading
Mapping-free variant calling using haplotype reconstruction from k-mer frequencies.
Audano, Peter A; Ravishankar, Shashidhar; Vannberg, Fredrik O.
Afiliação
  • Audano PA; School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA.
  • Ravishankar S; School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA.
  • Vannberg FO; School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA.
Bioinformatics ; 34(10): 1659-1665, 2018 05 15.
Article em En | MEDLINE | ID: mdl-29186321
Motivation: The standard protocol for detecting variation in DNA is to map millions of short sequence reads to a known reference and find loci that differ. While this approach works well, it cannot be applied where the sample contains dense variants or is too distant from known references. De novo assembly or hybrid methods can recover genomic variation, but the cost of computation is often much higher. We developed a novel k-mer algorithm and software implementation, Kestrel, capable of characterizing densely packed SNPs and large indels without mapping, assembly or de Bruijn graphs. Results: When applied to mosaic penicillin binding protein (PBP) genes in Streptococcus pneumoniae, we found near perfect concordance with assembled contigs at a fraction of the CPU time. Multilocus sequence typing (MLST) with this approach was able to bypass de novo assemblies. Kestrel has a very low false-positive rate when applied to the whole genome, and while Kestrel identified many variants missed by other methods, limitations of a purely k-mer based approach affect overall sensitivity. Availability and implementation: Source code and documentation for a Java implementation of Kestrel can be found at https://github.com/paudano/kestrel. All test code for this publication is located at https://github.com/paudano/kescases. Contact: paudano@gatech.edu or fredrik.vannberg@biology.gatech.edu. Supplementary information: Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Haplótipos / Software / Genoma Bacteriano / Tipagem de Sequências Multilocus Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2018 Tipo de documento: Article País de afiliação: Estados Unidos País de publicação: Reino Unido

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Haplótipos / Software / Genoma Bacteriano / Tipagem de Sequências Multilocus Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2018 Tipo de documento: Article País de afiliação: Estados Unidos País de publicação: Reino Unido