Your browser doesn't support javascript.
loading
Sigmoni: classification of nanopore signal with a compressed pangenome index.
Shivakumar, Vikram S; Ahmed, Omar Y; Kovaka, Sam; Zakeri, Mohsen; Langmead, Ben.
Afiliação
  • Shivakumar VS; Department of Computer Science, Johns Hopkins University, 3400 North Charles St., Baltimore, MD 21218, United States.
  • Ahmed OY; Department of Computer Science, Johns Hopkins University, 3400 North Charles St., Baltimore, MD 21218, United States.
  • Kovaka S; Department of Computer Science, Johns Hopkins University, 3400 North Charles St., Baltimore, MD 21218, United States.
  • Zakeri M; Department of Computer Science, Johns Hopkins University, 3400 North Charles St., Baltimore, MD 21218, United States.
  • Langmead B; Department of Computer Science, Johns Hopkins University, 3400 North Charles St., Baltimore, MD 21218, United States.
Bioinformatics ; 40(Suppl 1): i287-i296, 2024 06 28.
Article em En | MEDLINE | ID: mdl-38940135
ABSTRACT

SUMMARY:

Improvements in nanopore sequencing necessitate efficient classification methods, including pre-filtering and adaptive sampling algorithms that enrich for reads of interest. Signal-based approaches circumvent the computational bottleneck of basecalling. But past methods for signal-based classification do not scale efficiently to large, repetitive references like pangenomes, limiting their utility to partial references or individual genomes. We introduce Sigmoni a rapid, multiclass classification method based on the r-index that scales to references of hundreds of Gbps. Sigmoni quantizes nanopore signal into a discrete alphabet of picoamp ranges. It performs rapid, approximate matching using matching statistics, classifying reads based on distributions of picoamp matching statistics and co-linearity statistics, all in linear query time without the need for seed-chain-extend. Sigmoni is 10-100× faster than previous methods for adaptive sampling in host depletion experiments with improved accuracy, and can query reads against large microbial or human pangenomes. Sigmoni is the first signal-based tool to scale to a complete human genome and pangenome while remaining fast enough for adaptive sampling applications. AVAILABILITY AND IMPLEMENTATION Sigmoni is implemented in Python, and is available open-source at https//github.com/vshiv18/sigmoni.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article