Your browser doesn't support javascript.
loading
SquiggleNet: real-time, direct classification of nanopore signals.
Bao, Yuwei; Wadden, Jack; Erb-Downward, John R; Ranjan, Piyush; Zhou, Weichen; McDonald, Torrin L; Mills, Ryan E; Boyle, Alan P; Dickson, Robert P; Blaauw, David; Welch, Joshua D.
Afiliação
  • Bao Y; Department of Computer Science and Engineering, University of Michigan, Ann Arbor, 48109, MI, USA.
  • Wadden J; Department of Computer Science and Engineering, University of Michigan, Ann Arbor, 48109, MI, USA.
  • Erb-Downward JR; Department of Electrical and Computer Engineering, University of Michigan, Ann Arbor, 48109, MI, USA.
  • Ranjan P; Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, 48109, MI, USA.
  • Zhou W; Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, 48109, MI, USA.
  • McDonald TL; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, 48109, MI, USA.
  • Mills RE; Department of Human Genetics, University of Michigan Medical, Ann Arbor, 48109, MI, USA.
  • Boyle AP; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, 48109, MI, USA.
  • Dickson RP; Department of Human Genetics, University of Michigan Medical, Ann Arbor, 48109, MI, USA.
  • Blaauw D; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, 48109, MI, USA.
  • Welch JD; Department of Human Genetics, University of Michigan Medical, Ann Arbor, 48109, MI, USA.
Genome Biol ; 22(1): 298, 2021 10 27.
Article em En | MEDLINE | ID: mdl-34706748
ABSTRACT
We present SquiggleNet, the first deep-learning model that can classify nanopore reads directly from their electrical signals. SquiggleNet operates faster than DNA passes through the pore, allowing real-time classification and read ejection. Using 1 s of sequencing data, the classifier achieves significantly higher accuracy than base calling followed by sequence alignment. Our approach is also faster and requires an order of magnitude less memory than alignment-based approaches. SquiggleNet distinguished human from bacterial DNA with over 90% accuracy, generalized to unseen bacterial species in a human respiratory meta genome sample, and accurately classified sequences containing human long interspersed repeat elements.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Aprendizado Profundo / Sequenciamento por Nanoporos Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Aprendizado Profundo / Sequenciamento por Nanoporos Idioma: En Ano de publicação: 2021 Tipo de documento: Article