Your browser doesn't support javascript.
loading
PaSS: a sequencing simulator for PacBio sequencing.
Zhang, Wenmin; Jia, Ben; Wei, Chaochun.
Afiliação
  • Zhang W; Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
  • Jia B; Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
  • Wei C; Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China. ccwei@sjtu.edu.cn.
BMC Bioinformatics ; 20(1): 352, 2019 Jun 21.
Article em En | MEDLINE | ID: mdl-31226925
ABSTRACT

BACKGROUND:

Third-generation sequencing platforms, such as PacBio sequencing, have been developed rapidly in recent years. PacBio sequencing generates much longer reads than the second-generation sequencing (or the next generation sequencing, NGS) technologies and it has unique sequencing error patterns. An effective read simulator is essential to evaluate and promote the development of new bioinformatics tools for PacBio sequencing data analysis.

RESULTS:

We developed a new PacBio Sequencing Simulator (PaSS). It can learn sequence patterns from PacBio sequencing data currently available. In addition to the distribution of read lengths and error rates, we included a context-specific sequencing error model. Compared to existing PacBio sequencing simulators such as PBSIM, LongISLND and NPBSS, PaSS performed better in many aspects. Assembly tests also suggest that reads simulated by PaSS are the most similar to experimental sequencing data.

CONCLUSION:

PaSS is an effective sequence simulator for PacBio sequencing. It will facilitate the evaluation and development of new analysis tools for the third-generation sequencing data.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Análise de Sequência de DNA / Sequenciamento de Nucleotídeos em Larga Escala Limite: Animals Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Análise de Sequência de DNA / Sequenciamento de Nucleotídeos em Larga Escala Limite: Animals Idioma: En Ano de publicação: 2019 Tipo de documento: Article