Your browser doesn't support javascript.
loading
De novo meta-assembly of ultra-deep sequencing data.
Mirebrahim, Hamid; Close, Timothy J; Lonardi, Stefano.
Afiliación
  • Mirebrahim H; Department of Computer Science and Engineering and Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA.
  • Close TJ; Department of Computer Science and Engineering and Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA.
  • Lonardi S; Department of Computer Science and Engineering and Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA.
Bioinformatics ; 31(12): i9-16, 2015 Jun 15.
Article en En | MEDLINE | ID: mdl-26072514
ABSTRACT
UNLABELLED We introduce a new divide and conquer approach to deal with the problem of de novo genome assembly in the presence of ultra-deep sequencing data (i.e. coverage of 1000x or higher). Our proposed meta-assembler Slicembler partitions the input data into optimal-sized 'slices' and uses a standard assembly tool (e.g. Velvet, SPAdes, IDBA_UD and Ray) to assemble each slice individually. Slicembler uses majority voting among the individual assemblies to identify long contigs that can be merged to the consensus assembly. To improve its efficiency, Slicembler uses a generalized suffix tree to identify these frequent contigs (or fraction thereof). Extensive experimental results on real ultra-deep sequencing data (8000x coverage) and simulated data show that Slicembler significantly improves the quality of the assembly compared with the performance of the base assembler. In fact, most of the times, Slicembler generates error-free assemblies. We also show that Slicembler is much more resistant against high sequencing error rate than the base assembler. AVAILABILITY AND IMPLEMENTATION Slicembler can be accessed at http//slicembler.cs.ucr.edu/.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Hordeum / Algoritmos / Análisis de Secuencia de ADN / Genes de Plantas / Secuenciación de Nucleótidos de Alto Rendimiento Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2015 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Hordeum / Algoritmos / Análisis de Secuencia de ADN / Genes de Plantas / Secuenciación de Nucleótidos de Alto Rendimiento Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2015 Tipo del documento: Article País de afiliación: Estados Unidos