Your browser doesn't support javascript.
loading
NxTrim: optimized trimming of Illumina mate pair reads.
O'Connell, Jared; Schulz-Trieglaff, Ole; Carlson, Emma; Hims, Matthew M; Gormley, Niall A; Cox, Anthony J.
Afiliación
  • O'Connell J; Computational Biology Group and Technology Development, Illumina Cambridge Ltd., Chesterford Research Park, Little Chesterford, Essex CB10 1XL, UK.
  • Schulz-Trieglaff O; Computational Biology Group and Technology Development, Illumina Cambridge Ltd., Chesterford Research Park, Little Chesterford, Essex CB10 1XL, UK.
  • Carlson E; Computational Biology Group and Technology Development, Illumina Cambridge Ltd., Chesterford Research Park, Little Chesterford, Essex CB10 1XL, UK.
  • Hims MM; Computational Biology Group and Technology Development, Illumina Cambridge Ltd., Chesterford Research Park, Little Chesterford, Essex CB10 1XL, UK.
  • Gormley NA; Computational Biology Group and Technology Development, Illumina Cambridge Ltd., Chesterford Research Park, Little Chesterford, Essex CB10 1XL, UK.
  • Cox AJ; Computational Biology Group and Technology Development, Illumina Cambridge Ltd., Chesterford Research Park, Little Chesterford, Essex CB10 1XL, UK.
Bioinformatics ; 31(12): 2035-7, 2015 Jun 15.
Article en En | MEDLINE | ID: mdl-25661542
MOTIVATION: Mate pair protocols add to the utility of paired-end sequencing by boosting the genomic distance spanned by each pair of reads, potentially allowing larger repeats to be bridged and resolved. The Illumina Nextera Mate Pair (NMP) protocol uses a circularization-based strategy that leaves behind 38-bp adapter sequences, which must be computationally removed from the data. While 'adapter trimming' is a well-studied area of bioinformatics, existing tools do not fully exploit the particular properties of NMP data and discard more data than is necessary. RESULTS: We present NxTrim, a tool that strives to discard as little sequence as possible from NMP reads. NxTrim makes full use of the sequence on both sides of the adapter site to build 'virtual libraries' of mate pairs, paired-end reads and single-ended reads. For bacterial data, we show that aggregating these datasets allows a single NMP library to yield an assembly whose quality compares favourably to that obtained from regular paired-end reads. AVAILABILITY AND IMPLEMENTATION: The source code is available at https://github.com/sequencing/NxTrim
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Bacterias / Programas Informáticos / Genoma Bacteriano / Análisis de Secuencia de ADN / Biología Computacional / Secuenciación de Nucleótidos de Alto Rendimiento Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2015 Tipo del documento: Article

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Bacterias / Programas Informáticos / Genoma Bacteriano / Análisis de Secuencia de ADN / Biología Computacional / Secuenciación de Nucleótidos de Alto Rendimiento Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2015 Tipo del documento: Article