Your browser doesn't support javascript.
loading
NPSV-deep: a deep learning method for genotyping structural variants in short read genome sequencing data.
Linderman, Michael D; Wallace, Jacob; van der Heyde, Alderik; Wieman, Eliza; Brey, Daniel; Shi, Yiran; Hansen, Peter; Shamsi, Zahra; Liu, Jeremiah; Gelb, Bruce D; Bashir, Ali.
Afiliação
  • Linderman MD; Department of Computer Science, Middlebury College, Middlebury, VT 05753, United States.
  • Wallace J; Department of Computer Science, Middlebury College, Middlebury, VT 05753, United States.
  • van der Heyde A; Department of Computer Science, Middlebury College, Middlebury, VT 05753, United States.
  • Wieman E; Department of Computer Science, Middlebury College, Middlebury, VT 05753, United States.
  • Brey D; Department of Computer Science, Middlebury College, Middlebury, VT 05753, United States.
  • Shi Y; Department of Computer Science, Middlebury College, Middlebury, VT 05753, United States.
  • Hansen P; Department of Computer Science, Middlebury College, Middlebury, VT 05753, United States.
  • Shamsi Z; Google, Mountain View, CA 94043, United States.
  • Liu J; Google, Mountain View, CA 94043, United States.
  • Gelb BD; Mindich Child Health and Development Institute and the Departments of Pediatrics and Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, United States.
  • Bashir A; Google, Mountain View, CA 94043, United States.
Bioinformatics ; 40(3)2024 03 04.
Article em En | MEDLINE | ID: mdl-38444093
ABSTRACT
MOTIVATION Structural variants (SVs) play a causal role in numerous diseases but can be difficult to detect and accurately genotype (determine zygosity) with short-read genome sequencing data (SRS). Improving SV genotyping accuracy in SRS data, particularly for the many SVs first detected with long-read sequencing, will improve our understanding of genetic variation.

RESULTS:

NPSV-deep is a deep learning-based approach for genotyping previously reported insertion and deletion SVs that recasts this task as an image similarity problem. NPSV-deep predicts the SV genotype based on the similarity between pileup images generated from the actual SRS data and matching SRS simulations. We show that NPSV-deep consistently matches or improves upon the state-of-the-art for SV genotyping accuracy across different SV call sets, samples and variant types, including a 25% reduction in genotyping errors for the Genome-in-a-Bottle (GIAB) high-confidence SVs. NPSV-deep is not limited to the SVs as described; it improves deletion genotyping concordance a further 1.5 percentage points for GIAB SVs (92%) by automatically correcting imprecise/incorrectly described SVs. AVAILABILITY AND IMPLEMENTATION Python/C++ source code and pre-trained models freely available at https//github.com/mlinderm/npsv2.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Aprendizado Profundo Limite: Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Aprendizado Profundo Limite: Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Estados Unidos