Your browser doesn't support javascript.
loading
FAIR privacy-preserving operation of large genomic variant calling format (VCF) data without download or installation.
Martins, Yasmmin C; Bhawsar, Praphulla Ms; Balasubramanian, Jeya B; Russ, Daniel; Wong, Wendy Sw; Maass, Wolfgang; Almeida, Jonas S.
Afiliação
  • Martins YC; National Laboratory of Scientific Computing, Petrópolis, Brazil.
  • Bhawsar PM; Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20850, USA.
  • Balasubramanian JB; Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20850, USA.
  • Russ D; Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20850, USA.
  • Wong WS; Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20850, USA.
  • Maass W; Saarland University, 66123 Saarbrücken, Germany.
  • Almeida JS; Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20850, USA.
Article em En | MEDLINE | ID: mdl-38827109
ABSTRACT
Motivation The proliferation of genetic testing and consumer genomics represents a logistic challenge to the personalized use of GWAS data in VCF format. Specifically, the challenge of retrieving target genetic variation from large compressed files filled with unrelated variation information. Compounding the data traversal challenge, privacy-sensitive VCF files are typically managed as large stand-alone single files (no companion index file) composed of variable-sized compressed chunks, hosted in consumer-facing environments with no native support for hosted execution.

Results:

A portable JavaScript module was developed to support in-browser fetching of partial content using byte-range requests. This includes on-the-fly decompressing irregularly positioned compressed chunks, coupled with a binary search algorithm iteratively identifying chromosome-position ranges. The in-browser zero-footprint solution (no downloads, no installations) enables the interoperability, reusability, and user-facing governance advanced by the FAIR principles for stewardship of scientific data. Availability - https//episphere.github.io/vcf, including supplementary material.

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article