Your browser doesn't support javascript.
loading
Ultrafast prediction of somatic structural variations by filtering out reads matched to pan-genome k-mer sets.
Sohn, Jang-Il; Choi, Min-Hak; Yi, Dohun; Menon, Vipin A; Kim, Yeon Jeong; Lee, Junehawk; Park, Jung Woo; Kyung, Sungkyu; Shin, Seung-Ho; Na, Byunggook; Joung, Je-Gun; Ju, Young Seok; Yeom, Min Sun; Koh, Youngil; Yoon, Sung-Soo; Baek, Daehyun; Kim, Tae-Min; Nam, Jin-Wu.
Afiliação
  • Sohn JI; Department of Life Science, Hanyang University, Seoul, Republic of Korea.
  • Choi MH; Research Institute for Convergence of Basic Sciences, Hanyang University, Seoul, Republic of Korea.
  • Yi D; Department of Life Science, Hanyang University, Seoul, Republic of Korea.
  • Menon VA; Department of Life Science, Hanyang University, Seoul, Republic of Korea.
  • Kim YJ; Department of Life Science, Hanyang University, Seoul, Republic of Korea.
  • Lee J; Samsung Genome Institute, Samsung Medical Center, Seoul, Republic of Korea.
  • Park JW; Center for Supercomputing Applications, Division of National Supercomputing, Korea Institute of Science and Technology Information, Daejeon, Republic of Korea.
  • Kyung S; Center for Supercomputing Applications, Division of National Supercomputing, Korea Institute of Science and Technology Information, Daejeon, Republic of Korea.
  • Shin SH; GENINUS Inc., Seoul, Republic of Korea.
  • Na B; GENINUS Inc., Seoul, Republic of Korea.
  • Joung JG; Department of Electrical and Computer Engineering, Seoul National University, Seoul, Republic of Korea.
  • Ju YS; Department of Biomedical Science, College of Life Science, CHA University, Seongnam, Republic of Korea.
  • Yeom MS; Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea.
  • Koh Y; Biomedical Science and Engineering Interdisciplinary Program, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea.
  • Yoon SS; Center for Supercomputing Applications, Division of National Supercomputing, Korea Institute of Science and Technology Information, Daejeon, Republic of Korea.
  • Baek D; College of Medicine, Seoul National University, Seoul, Republic of Korea.
  • Kim TM; College of Medicine, Seoul National University, Seoul, Republic of Korea.
  • Nam JW; School of Biological Sciences, Seoul National University, Seoul, Republic of Korea.
Nat Biomed Eng ; 7(7): 853-866, 2023 07.
Article em En | MEDLINE | ID: mdl-36536253
ABSTRACT
Variant callers typically produce massive numbers of false positives for structural variations, such as cancer-relevant copy-number alterations and fusion genes resulting from genome rearrangements. Here we describe an ultrafast and accurate detector of somatic structural variations that reduces read-mapping costs by filtering out reads matched to pan-genome k-mer sets. The detector, which we named ETCHING (for efficient detection of chromosomal rearrangements and fusion genes), reduces the number of false positives by leveraging machine-learning classifiers trained with six breakend-related features (clipped-read count, split-reads count, supporting paired-end read count, average mapping quality, depth difference and total length of clipped bases). When benchmarked against six callers on reference cell-free DNA, validated biomarkers of structural variants, matched tumour and normal whole genomes, and tumour-only targeted sequencing datasets, ETCHING was 11-fold faster than the second-fastest structural-variant caller at comparable performance and memory use. The speed and accuracy of ETCHING may aid large-scale genome projects and facilitate practical implementations in precision medicine.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Sequenciamento de Nucleotídeos em Larga Escala / Neoplasias Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Sequenciamento de Nucleotídeos em Larga Escala / Neoplasias Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article