Your browser doesn't support javascript.
loading
Comparison of structural variant callers for massive whole-genome sequence data.
Joe, Soobok; Park, Jong-Lyul; Kim, Jun; Kim, Sangok; Park, Ji-Hwan; Yeo, Min-Kyung; Lee, Dongyoon; Yang, Jin Ok; Kim, Seon-Young.
  • Joe S; Korea Bioinformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.
  • Park JL; Aging Convergence Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.
  • Kim J; Department of Functional Genomics, University of Science and Technology (UST), 34113, Daejeon, Republic of Korea.
  • Kim S; Department of Convergent Bioscience and Informatics, College of Bioscience and Biotechnology, Chungnam National University, Daejeon, 34134, Republic of Korea.
  • Park JH; Korea Bioinformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.
  • Yeo MK; Korea Bioinformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.
  • Lee D; Department of Bioscience, University of Science and Technology (UST), Daejeon, 34113, Republic of Korea.
  • Yang JO; Department of Pathology, Chungnam National University School of Medicine, Daejeon, 35015, Republic of Korea.
  • Kim SY; Korea Bioinformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.
BMC Genomics ; 25(1): 318, 2024 Mar 28.
Article en En | MEDLINE | ID: mdl-38549092
ABSTRACT

BACKGROUND:

Detecting structural variations (SVs) at the population level using next-generation sequencing (NGS) requires substantial computational resources and processing time. Here, we compared the performances of 11 SV callers Delly, Manta, GridSS, Wham, Sniffles, Lumpy, SvABA, Canvas, CNVnator, MELT, and INSurVeyor. These SV callers have been recently published and have been widely employed for processing massive whole-genome sequencing datasets. We evaluated the accuracy, sequence depth, running time, and memory usage of the SV callers.

RESULTS:

Notably, several callers exhibited better calling performance for deletions than for duplications, inversions, and insertions. Among the SV callers, Manta identified deletion SVs with better performance and efficient computing resources, and both Manta and MELT demonstrated relatively good precision regarding calling insertions. We confirmed that the copy number variation callers, Canvas and CNVnator, exhibited better performance in identifying long duplications as they employ the read-depth approach. Finally, we also verified the genotypes inferred from each SV caller using a phased long-read assembly dataset, and Manta showed the highest concordance in terms of the deletions and insertions.

CONCLUSIONS:

Our findings provide a comprehensive understanding of the accuracy and computational efficiency of SV callers, thereby facilitating integrative analysis of SV profiles in diverse large-scale genomic datasets.
Asunto(s)
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Genómica / Variaciones en el Número de Copia de ADN Límite: Humans Idioma: En Año: 2024 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Genómica / Variaciones en el Número de Copia de ADN Límite: Humans Idioma: En Año: 2024 Tipo del documento: Article