Your browser doesn't support javascript.
loading
Vargas: heuristic-free alignment for assessing linear and graph read aligners.
Darby, Charlotte A; Gaddipati, Ravi; Schatz, Michael C; Langmead, Ben.
Afiliação
  • Darby CA; Department of Computer Science.
  • Gaddipati R; Department of Biomedical Engineering.
  • Schatz MC; Department of Computer Science.
  • Langmead B; Department of Biology, Johns Hopkins University, Baltimore, MD 21218, USA.
Bioinformatics ; 36(12): 3712-3718, 2020 06 01.
Article em En | MEDLINE | ID: mdl-32321164
ABSTRACT
MOTIVATION Read alignment is central to many aspects of modern genomics. Most aligners use heuristics to accelerate processing, but these heuristics can fail to find the optimal alignments of reads. Alignment accuracy is typically measured through simulated reads; however, the simulated location may not be the (only) location with the optimal alignment score.

RESULTS:

Vargas implements a heuristic-free algorithm guaranteed to find the highest-scoring alignment for real sequencing reads to a linear or graph genome. With semiglobal and local alignment modes and affine gap and quality-scaled mismatch penalties, it can implement the scoring functions of commonly used aligners to calculate optimal alignments. While this is computationally intensive, Vargas uses multi-core parallelization and vectorized (SIMD) instructions to make it practical to optimally align large numbers of reads, achieving a maximum speed of 456 billion cell updates per second. We demonstrate how these 'gold standard' Vargas alignments can be used to improve heuristic alignment accuracy by optimizing command-line parameters in Bowtie 2, BWA-maximal exact match and vg to align more reads correctly. AVAILABILITY AND IMPLEMENTATION Source code implemented in C++ and compiled binary releases are available at https//github.com/langmead-lab/vargas under the MIT license. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Sequenciamento de Nucleotídeos em Larga Escala / Heurística Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2020 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Sequenciamento de Nucleotídeos em Larga Escala / Heurística Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2020 Tipo de documento: Article