Your browser doesn't support javascript.
loading
Rapid Real-time Squiggle Classification for Read until using RawMap.
Sadasivan, Harisankar; Wadden, Jack; Goliya, Kush; Ranjan, Piyush; Dickson, Robert P; Blaauw, David; Das, Reetuparna; Narayanasamy, Satish.
Afiliação
  • Sadasivan H; Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, 48109, USA.
  • Wadden J; Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, 48109, USA.
  • Goliya K; Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, 48109, USA.
  • Ranjan P; Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, 48109, USA.
  • Dickson RP; Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, 48109, USA.
  • Blaauw D; Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, 48109, USA.
  • Das R; Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, 48109, USA.
  • Narayanasamy S; Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, 48109, USA.
Arch Clin Biomed Res ; 7(1): 45-57, 2023.
Article em En | MEDLINE | ID: mdl-36938368
ReadUntil enables Oxford Nanopore Technology's (ONT) sequencers to selectively sequence reads of target species in real-time. This enables efficient microbial enrichment for applications such as microbial abundance estimation and is particularly beneficial for metagenomic samples with a very high fraction of non-target reads (> 99% can be human reads). However, read-until requires a fast and accurate software filter that analyzes a short prefix of a read and determines if it belongs to a microbe of interest (target) or not. The baseline Read Until pipeline uses a deep neural network-based basecaller called Guppy and is slow and inaccurate for this task (~60% of bases sequenced are unclassified). We present RawMap, an efficient CPU-only microbial species-agnostic Read Until classifier for filtering non-target human reads in the squiggle space. RawMap uses a Support Vector Machine (SVM), which is trained to distinguish human from microbe using non-linear and non-stationary characteristics of ONT's squiggle output (continuous electrical signals). Compared to the baseline Read Until pipeline, RawMap is a 1327X faster classifier and significantly improves the sequencing time and cost, and compute time savings. We show that RawMap augmented pipelines reduce sequencing time and cost by ~24% and computing cost by 22%. Additionally, since RawMap is agnostic to microbial species, it can also classify microbial species it is not trained on. We also discuss how RawMap may be used as an alternative to the RT-PCR test for viral load quantification of SARS-CoV-2.
Palavras-chave

Texto completo: 1 Bases de dados: MEDLINE Idioma: En Revista: Arch Clin Biomed Res Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Bases de dados: MEDLINE Idioma: En Revista: Arch Clin Biomed Res Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Estados Unidos