Your browser doesn't support javascript.
loading
Experiences Building Globus Genomics: A Next-Generation Sequencing Analysis Service using Galaxy, Globus, and Amazon Web Services.
Madduri, Ravi K; Sulakhe, Dinanath; Lacinski, Lukasz; Liu, Bo; Rodriguez, Alex; Chard, Kyle; Dave, Utpal J; Foster, Ian T.
Afiliação
  • Madduri RK; Computation Institute University of Chicago and Argonne National Laboratory Chicago, IL.
  • Sulakhe D; Computation Institute University of Chicago and Argonne National Laboratory Chicago, IL.
  • Lacinski L; Computation Institute University of Chicago and Argonne National Laboratory Chicago, IL.
  • Liu B; Computation Institute University of Chicago and Argonne National Laboratory Chicago, IL.
  • Rodriguez A; Computation Institute University of Chicago and Argonne National Laboratory Chicago, IL.
  • Chard K; Computation Institute University of Chicago and Argonne National Laboratory Chicago, IL.
  • Dave UJ; Computation Institute University of Chicago and Argonne National Laboratory Chicago, IL.
  • Foster IT; Computation Institute University of Chicago and Argonne National Laboratory Chicago, IL.
Concurr Comput ; 26(13): 2266-2279, 2014 Sep 10.
Article em En | MEDLINE | ID: mdl-25342933
ABSTRACT
We describe Globus Genomics, a system that we have developed for rapid analysis of large quantities of next-generation sequencing (NGS) genomic data. This system achieves a high degree of end-to-end automation that encompasses every stage of data analysis including initial data retrieval from remote sequencing centers or storage (via the Globus file transfer system); specification, configuration, and reuse of multi-step processing pipelines (via the Galaxy workflow system); creation of custom Amazon Machine Images and on-demand resource acquisition via a specialized elastic provisioner (on Amazon EC2); and efficient scheduling of these pipelines over many processors (via the HTCondor scheduler). The system allows biomedical researchers to perform rapid analysis of large NGS datasets in a fully automated manner, without software installation or a need for any local computing infrastructure. We report performance and cost results for some representative workloads.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2014 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2014 Tipo de documento: Article