Your browser doesn't support javascript.
loading
Systematic exploration of error sources in pyrosequencing flowgram data.
Balzer, Susanne; Malde, Ketil; Jonassen, Inge.
Afiliação
  • Balzer S; Institute of Marine Research, P.O. Box 1870, N-5817 Bergen, Norway. susanne.balzer@imr.no
Bioinformatics ; 27(13): i304-9, 2011 Jul 01.
Article em En | MEDLINE | ID: mdl-21685085
MOTIVATION: 454 pyrosequencing, by Roche Diagnostics, has emerged as an alternative to Sanger sequencing when it comes to read lengths, performance and cost, but shows higher per-base error rates. Although there are several tools available for noise removal, targeting different application fields, data interpretation would benefit from a better understanding of the different error types. RESULTS: By exploring 454 raw data, we quantify to what extent different factors account for sequencing errors. In addition to the well-known homopolymer length inaccuracies, we have identified errors likely to originate from other stages of the sequencing process. We use our findings to extend the flowsim pipeline with functionalities to simulate these errors, and thus enable a more realistic simulation of 454 pyrosequencing data with flowsim. AVAILABILITY: The flowsim pipeline is freely available under the General Public License from http://biohaskell.org/Applications/FlowSim. CONTACT: susanne.balzer@imr.no.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Análise de Sequência de DNA / Sequenciamento de Nucleotídeos em Larga Escala Tipo de estudo: Prognostic_studies Limite: Animals Idioma: En Ano de publicação: 2011 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Análise de Sequência de DNA / Sequenciamento de Nucleotídeos em Larga Escala Tipo de estudo: Prognostic_studies Limite: Animals Idioma: En Ano de publicação: 2011 Tipo de documento: Article