Your browser doesn't support javascript.
loading
Building Portable and Reproducible Cancer Informatics Workflows: An RNA Sequencing Case Study.
Kaushik, Gaurav; Davis-Dusenbery, Brandi.
Afiliação
  • Kaushik G; Foundation Medicine, Cambridge, MA, USA.
  • Davis-Dusenbery B; Seven Bridges Genomics, 1 Main Street, Cambridge, 02142, MA, USA. brandi@sbgenomics.com.
Methods Mol Biol ; 1878: 39-64, 2019.
Article em En | MEDLINE | ID: mdl-30378068
The Seven Bridges Cancer Genomics Cloud (CGC) is part of the National Cancer Institute Cloud Resource project, which was created to explore the paradigm of co-locating massive datasets with the computational resources to analyze them. The CGC was designed to allow researchers to easily find the data they need and analyze it with robust applications in a scalable and reproducible fashion. To enable this, individual tools are packaged within Docker containers and described by the Common Workflow Language (CWL), an emerging standard for enabling reproducible data analysis. On the CGC, researchers can deploy individual tools and customize massive workflows by chaining together tools. Here, we discuss a case study in which RNA sequencing data is analyzed with different methods and compared on the Seven Bridges CGC. We highlight best practices for designing command line tools, Docker containers, and CWL descriptions to enable massively parallelized and reproducible biomedical computation with cloud resources.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: RNA / Neoplasias Tipo de estudo: Guideline Limite: Humans Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: RNA / Neoplasias Tipo de estudo: Guideline Limite: Humans Idioma: En Ano de publicação: 2019 Tipo de documento: Article