Your browser doesn't support javascript.
loading
BigDataScript: a scripting language for data pipelines.
Cingolani, Pablo; Sladek, Rob; Blanchette, Mathieu.
  • Cingolani P; McGill University School of Computer Science, 3480 University Street, Montreal, Québec H3A 0E9 and McGill University and Génome Québec Innovation Centre, 740 Dr. Penfield Avenue, Montréal, Québec H3A 0G1, Canada McGill University School of Computer Science, 3480 University Street, Montreal, Québec H3A 0E9 and McGill University and Génome Québec Innovation Centre, 740 Dr. Penfield Avenue, Montréal, Québec H3A 0G1, Canada.
  • Sladek R; McGill University School of Computer Science, 3480 University Street, Montreal, Québec H3A 0E9 and McGill University and Génome Québec Innovation Centre, 740 Dr. Penfield Avenue, Montréal, Québec H3A 0G1, Canada.
  • Blanchette M; McGill University School of Computer Science, 3480 University Street, Montreal, Québec H3A 0E9 and McGill University and Génome Québec Innovation Centre, 740 Dr. Penfield Avenue, Montréal, Québec H3A 0G1, Canada.
Bioinformatics ; 31(1): 10-6, 2015 Jan 01.
Article en En | MEDLINE | ID: mdl-25189778
ABSTRACT
MOTIVATION The analysis of large biological datasets often requires complex processing pipelines that run for a long time on large computational infrastructures. We designed and implemented a simple script-like programming language with a clean and minimalist syntax to develop and manage pipeline execution and provide robustness to various types of software and hardware failures as well as portability.

RESULTS:

We introduce the BigDataScript (BDS) programming language for data processing pipelines, which improves abstraction from hardware resources and assists with robustness. Hardware abstraction allows BDS pipelines to run without modification on a wide range of computer architectures, from a small laptop to multi-core servers, server farms, clusters and clouds. BDS achieves robustness by incorporating the concepts of absolute serialization and lazy processing, thus allowing pipelines to recover from errors. By abstracting pipeline concepts at programming language level, BDS simplifies implementation, execution and management of complex bioinformatics pipelines, resulting in reduced development and debugging cycles as well as cleaner code. AVAILABILITY AND IMPLEMENTATION BigDataScript is available under open-source license at http//pcingola.github.io/BigDataScript.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Lenguajes de Programación / Programas Informáticos / Biología Computacional / Secuenciación de Nucleótidos de Alto Rendimiento Límite: Humans Idioma: En Año: 2015 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Lenguajes de Programación / Programas Informáticos / Biología Computacional / Secuenciación de Nucleótidos de Alto Rendimiento Límite: Humans Idioma: En Año: 2015 Tipo del documento: Article