Your browser doesn't support javascript.
loading
Fast and accurate mapping of Complete Genomics reads.
Lee, Donghyuk; Hormozdiari, Farhad; Xin, Hongyi; Hach, Faraz; Mutlu, Onur; Alkan, Can.
Afiliação
  • Lee D; Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA, USA.
  • Hormozdiari F; Department of Computer Science, University of California Los Angeles, Los Angeles, CA, USA.
  • Xin H; Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA, USA.
  • Hach F; School of Computing Science, Simon Fraser University, Burnaby, BC, Canada.
  • Mutlu O; Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA, USA. Electronic address: onur@cmu.edu.
  • Alkan C; Department of Computer Engineering, Bilkent University, Ankara, Turkey. Electronic address: calkan@cs.bilkent.edu.tr.
Methods ; 79-80: 3-10, 2015 Jun.
Article em En | MEDLINE | ID: mdl-25461772
ABSTRACT
Many recent advances in genomics and the expectations of personalized medicine are made possible thanks to power of high throughput sequencing (HTS) in sequencing large collections of human genomes. There are tens of different sequencing technologies currently available, and each HTS platform have different strengths and biases. This diversity both makes it possible to use different technologies to correct for shortcomings; but also requires to develop different algorithms for each platform due to the differences in data types and error models. The first problem to tackle in analyzing HTS data for resequencing applications is the read mapping stage, where many tools have been developed for the most popular HTS methods, but publicly available and open source aligners are still lacking for the Complete Genomics (CG) platform. Unfortunately, Burrows-Wheeler based methods are not practical for CG data due to the gapped nature of the reads generated by this method. Here we provide a sensitive read mapper (sirFAST) for the CG technology based on the seed-and-extend paradigm that can quickly map CG reads to a reference genome. We evaluate the performance and accuracy of sirFAST using both simulated and publicly available real data sets, showing high precision and recall rates.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Genômica / Sequenciamento de Nucleotídeos em Larga Escala Limite: Humans Idioma: En Revista: Methods Assunto da revista: BIOQUIMICA Ano de publicação: 2015 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Genômica / Sequenciamento de Nucleotídeos em Larga Escala Limite: Humans Idioma: En Revista: Methods Assunto da revista: BIOQUIMICA Ano de publicação: 2015 Tipo de documento: Article País de afiliação: Estados Unidos