Your browser doesn't support javascript.
loading
Efficient dynamic variation graphs.
Eizenga, Jordan M; Novak, Adam M; Kobayashi, Emily; Villani, Flavia; Cisar, Cecilia; Heumos, Simon; Hickey, Glenn; Colonna, Vincenza; Paten, Benedict; Garrison, Erik.
Afiliação
  • Eizenga JM; Genomics Institute, Santa Cruz, CA 95064, USA.
  • Novak AM; Biomolecular Engineering and Bioinformatics, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Kobayashi E; Genomics Institute, Santa Cruz, CA 95064, USA.
  • Villani F; Biomolecular Engineering and Bioinformatics, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
  • Cisar C; Genomics Institute, Santa Cruz, CA 95064, USA.
  • Heumos S; Bioinformatics and Systems Biology, University of California San Diego, La Jolla, CA 92093, USA.
  • Hickey G; Institute of Genetics and Biophysics, Consiglio Nazionale di Ricerche, Naples 80131, Italy.
  • Colonna V; Biotecnologie Mediche, Università degli Studi di Napoli Federico II, Naples 80138,Italy.
  • Paten B; Genomics Institute, Santa Cruz, CA 95064, USA.
  • Garrison E; Biomolecular Engineering and Bioinformatics, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
Bioinformatics ; 36(21): 5139-5144, 2021 01 29.
Article em En | MEDLINE | ID: mdl-33040146
ABSTRACT
MOTIVATION Pangenomics is a growing field within computational genomics. Many pangenomic analyses use bidirected sequence graphs as their core data model. However, implementing and correctly using this data model can be difficult, and the scale of pangenomic datasets can be challenging to work at. These challenges have impeded progress in this field.

RESULTS:

Here, we present a stack of two C++ libraries, libbdsg and libhandlegraph, which use a simple, field-proven interface, designed to expose elementary features of these graphs while preventing common graph manipulation mistakes. The libraries also provide a Python binding. Using a diverse collection of pangenome graphs, we demonstrate that these tools allow for efficient construction and manipulation of large genome graphs with dense variation. For instance, the speed and memory usage are up to an order of magnitude better than the prior graph implementation in the VG toolkit, which has now transitioned to using libbdsg's implementations. AVAILABILITY AND IMPLEMENTATION libhandlegraph and libbdsg are available under an MIT License from https//github.com/vgteam/libhandlegraph and https//github.com/vgteam/libbdsg.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Bibliotecas Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Bibliotecas Idioma: En Ano de publicação: 2021 Tipo de documento: Article