Your browser doesn't support javascript.
loading
Experiences with workflows for automating data-intensive bioinformatics.
Spjuth, Ola; Bongcam-Rudloff, Erik; Hernández, Guillermo Carrasco; Forer, Lukas; Giovacchini, Mario; Guimera, Roman Valls; Kallio, Aleksi; Korpelainen, Eija; Kandula, Maciej M; Krachunov, Milko; Kreil, David P; Kulev, Ognyan; Labaj, Pawel P; Lampa, Samuel; Pireddu, Luca; Schönherr, Sebastian; Siretskiy, Alexey; Vassilev, Dimitar.
Afiliação
  • Spjuth O; Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, SE-75124, Uppsala, P.O. Box 591, Sweden. ola.spjuth@farmbio.uu.se.
  • Bongcam-Rudloff E; SLU-Global Bioinformatics Centre, Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Uppsala, Sweden. Erik.Bongcam@slu.se.
  • Hernández GC; Science for Life Laboratory, Karolinska Institutet, SE-17121, Stockholm, P.O. Box 1031, Sweden. guillermo.carrasco@scilifelab.se.
  • Forer L; Division of Genetic Epidemiology, Medical University of Innsbruck, Innsbruck, 6020, Austria. lukas.forer@i-med.ac.at.
  • Giovacchini M; Science for Life Laboratory, Karolinska Institutet, SE-17121, Stockholm, P.O. Box 1031, Sweden. mario.giovacchini@scilifelab.se.
  • Guimera RV; Science for Life Laboratory, Karolinska Institutet, SE-17121, Stockholm, P.O. Box 1031, Sweden. brainstorm@nopcode.org.
  • Kallio A; CSC - IT Center for Science Ltd., FI-02101, Espoo, P.O. Box 405, Finland. aleksi.kallio@csc.fi.
  • Korpelainen E; CSC - IT Center for Science Ltd., FI-02101, Espoo, P.O. Box 405, Finland. eija.korpelainen@csc.fi.
  • Kandula MM; Chair of Bioinformatics Research Group, Boku University, Vienna, Austria. maciej.kandula@boku.ac.at.
  • Krachunov M; Faculty of Mathematics and Informatics, Sofia University, Sofia, Bulgaria. wfxp@milko.3mhz.net.
  • Kreil DP; Chair of Bioinformatics Research Group, Boku University, Vienna, Austria. david.kreil@boku.ac.at.
  • Kulev O; Faculty of Mathematics and Informatics, Sofia University, Sofia, Bulgaria. okulev@fmi.uni-sofia.bg.
  • Labaj PP; Chair of Bioinformatics Research Group, Boku University, Vienna, Austria. pawel.labaj@boku.ac.at.
  • Lampa S; Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, SE-75124, Uppsala, P.O. Box 591, Sweden. samuel.lampa@it.uu.se.
  • Pireddu L; CRS4 Polaris, Pula, Italy. luca.pireddu@crs4.it.
  • Schönherr S; Division of Genetic Epidemiology, Medical University of Innsbruck, Innsbruck, 6020, Austria. sebastian.schoenherr@i-med.ac.at.
  • Siretskiy A; Department of Information Technology, Uppsala University, SE-75105, Uppsala, P.O. Box 337, Sweden. alexey.siretskiy@it.uu.se.
  • Vassilev D; AgroBioInstitute and Joint Genomic Centre, Sofia, Bulgaria. jim6329@gmail.com.
Biol Direct ; 10: 43, 2015 Aug 19.
Article em En | MEDLINE | ID: mdl-26282399
High-throughput technologies, such as next-generation sequencing, have turned molecular biology into a data-intensive discipline, requiring bioinformaticians to use high-performance computing resources and carry out data management and analysis tasks on large scale. Workflow systems can be useful to simplify construction of analysis pipelines that automate tasks, support reproducibility and provide measures for fault-tolerance. However, workflow systems can incur significant development and administration overhead so bioinformatics pipelines are often still built without them. We present the experiences with workflows and workflow systems within the bioinformatics community participating in a series of hackathons and workshops of the EU COST action SeqAhead. The organizations are working on similar problems, but we have addressed them with different strategies and solutions. This fragmentation of efforts is inefficient and leads to redundant and incompatible solutions. Based on our experiences we define a set of recommendations for future systems to enable efficient yet simple bioinformatics workflow construction and execution.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Processamento Eletrônico de Dados / Biologia Computacional / Fluxo de Trabalho Tipo de estudo: Prognostic_studies Idioma: En Revista: Biol Direct Ano de publicação: 2015 Tipo de documento: Article País de afiliação: Suécia

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Processamento Eletrônico de Dados / Biologia Computacional / Fluxo de Trabalho Tipo de estudo: Prognostic_studies Idioma: En Revista: Biol Direct Ano de publicação: 2015 Tipo de documento: Article País de afiliação: Suécia