Your browser doesn't support javascript.
loading
VCFShark: how to squeeze a VCF file.
Deorowicz, Sebastian; Danek, Agnieszka; Kokot, Marek.
Afiliação
  • Deorowicz S; Faculty of Automatic Control, Electronics and Computer Science, Department of Algorithmics and Software, Silesian University of Technology, Gliwice PL-44100, Poland.
  • Danek A; Faculty of Automatic Control, Electronics and Computer Science, Department of Algorithmics and Software, Silesian University of Technology, Gliwice PL-44100, Poland.
  • Kokot M; Faculty of Automatic Control, Electronics and Computer Science, Department of Algorithmics and Software, Silesian University of Technology, Gliwice PL-44100, Poland.
Bioinformatics ; 37(19): 3358-3360, 2021 Oct 11.
Article em En | MEDLINE | ID: mdl-33787870
ABSTRACT

SUMMARY:

Variant Call Format (VCF) files with results of sequencing projects take a lot of space. We propose the VCFShark, which is able to compress VCF files up to an order of magnitude better than the de facto standards (gzipped VCF and BCF). The advantage over competitors is the greatest when compressing VCF files containing large amounts of genotype data. The processing speeds up to 100 MB/s and main memory requirements lower than 30 GB allow to use our tool at typical workstations even for large datasets. AVAILABILITY AND IMPLEMENTATION https//github.com/refresh-bio/vcfshark. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2021 Tipo de documento: Article