Your browser doesn't support javascript.
loading
Aligning optical maps to de Bruijn graphs.
Mukherjee, Kingshuk; Alipanahi, Bahar; Kahveci, Tamer; Salmela, Leena; Boucher, Christina.
Afiliación
  • Mukherjee K; Department of Computer and Information Science and Engineering, College of Engineering, University of Florida, Gainesville, USA.
  • Alipanahi B; Department of Computer and Information Science and Engineering, College of Engineering, University of Florida, Gainesville, USA.
  • Kahveci T; Department of Computer and Information Science and Engineering, College of Engineering, University of Florida, Gainesville, USA.
  • Salmela L; Department of Computer Science, Helsinki Institute for Information Technology HIIT, University of Helsinki, Helsinki, Finland.
  • Boucher C; Department of Computer and Information Science and Engineering, College of Engineering, University of Florida, Gainesville, USA.
Bioinformatics ; 35(18): 3250-3256, 2019 09 15.
Article en En | MEDLINE | ID: mdl-30698651
ABSTRACT
MOTIVATION Optical maps are high-resolution restriction maps (Rmaps) that give a unique numeric representation to a genome. Used in concert with sequence reads, they provide a useful tool for genome assembly and for discovering structural variations and rearrangements. Although they have been a regular feature of modern genome assembly projects, optical maps have been mainly used in post-processing step and not in the genome assembly process itself. Several methods have been proposed for pairwise alignment of single molecule optical maps-called Rmaps, or for aligning optical maps to assembled reads. However, the problem of aligning an Rmap to a graph representing the sequence data of the same genome has not been studied before. Such an alignment provides a mapping between two sets of data optical maps and sequence data which will facilitate the usage of optical maps in the sequence assembly step itself.

RESULTS:

We define the problem of aligning an Rmap to a de Bruijn graph and present the first algorithm for solving this problem which is based on a seed-and-extend approach. We demonstrate that our method is capable of aligning 73% of Rmaps generated from the Escherichia coli genome to the de Bruijn graph constructed from short reads generated from the same genome. We validate the alignments and show that our method achieves an accuracy of 99.6%. We also show that our method scales to larger genomes. In particular, we show that 76% of Rmaps can be aligned to the de Bruijn graph in the case of human data. AVAILABILITY AND IMPLEMENTATION The software for aligning optical maps to de Bruijn graph, omGraph is written in C++ and is publicly available under GNU General Public License at https//github.com/kingufl/omGraph. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Algoritmos / Programas Informáticos Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2019 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Algoritmos / Programas Informáticos Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2019 Tipo del documento: Article País de afiliación: Estados Unidos