GTShark: genotype compression in large projects.
Bioinformatics
; 35(22): 4791-4793, 2019 11 01.
Article
en En
| MEDLINE
| ID: mdl-31225861
ABSTRACT
SUMMARY:
Nowadays large sequencing projects handle tens of thousands of individuals. The huge files summarizing the findings definitely require compression. We propose a tool able to compress large collections of genotypes almost 30% better than the best tool to date, i.e. squeezing human genotype to less than 62 KB. Moreover, it can also compress single samples in reference to the existing database achieving comparable results. AVAILABILITY AND IMPLEMENTATION https//github.com/refresh-bio/GTShark. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Texto completo:
1
Colección:
01-internacional
Banco de datos:
MEDLINE
Asunto principal:
Genómica
/
Compresión de Datos
Límite:
Humans
Idioma:
En
Revista:
Bioinformatics
Asunto de la revista:
INFORMATICA MEDICA
Año:
2019
Tipo del documento:
Article
País de afiliación:
Polonia