Your browser doesn't support javascript.
loading
Streamlining remote nanopore data access with slow5curl.
Wong, Bonson; Ferguson, James M; Do, Jessica Y; Gamaarachchi, Hasindu; Deveson, Ira W.
Afiliación
  • Wong B; Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, NSW 2010, Australia.
  • Ferguson JM; Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute,Sydney, NSW 2010, Australia.
  • Do JY; School of Computer Science and Engineering, University of New South Wales, Sydney, NSW 2052, Australia.
  • Gamaarachchi H; Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, NSW 2010, Australia.
  • Deveson IW; Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute,Sydney, NSW 2010, Australia.
Gigascience ; 132024 01 02.
Article en En | MEDLINE | ID: mdl-38608279
ABSTRACT

BACKGROUND:

As adoption of nanopore sequencing technology continues to advance, the need to maintain large volumes of raw current signal data for reanalysis with updated algorithms is a growing challenge. Here we introduce slow5curl, a software package designed to streamline nanopore data sharing, accessibility, and reanalysis.

RESULTS:

Slow5curl allows a user to fetch a specified read or group of reads from a raw nanopore dataset stored on a remote server, such as a public data repository, without downloading the entire file. Slow5curl uses an index to quickly fetch specific reads from a large dataset in SLOW5/BLOW5 format and highly parallelized data access requests to maximize download speeds. Using all public nanopore data from the Human Pangenome Reference Consortium (>22 TB), we demonstrate how slow5curl can be used to quickly fetch and reanalyze raw signal reads corresponding to a set of target genes from each individual in large cohort dataset (n = 91), minimizing the time, egress costs, and local storage requirements for their reanalysis.

CONCLUSIONS:

We provide slow5curl as a free, open-source package that will reduce frictions in data sharing for the nanopore community https//github.com/BonsonW/slow5curl.
Asunto(s)
Palabras clave

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Nanoporos / Secuenciación de Nanoporos Límite: Humans Idioma: En Revista: Gigascience Año: 2024 Tipo del documento: Article País de afiliación: Australia

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Nanoporos / Secuenciación de Nanoporos Límite: Humans Idioma: En Revista: Gigascience Año: 2024 Tipo del documento: Article País de afiliación: Australia