Your browser doesn't support javascript.
loading
Cnidaria: fast, reference-free clustering of raw and assembled genome and transcriptome NGS data.
Aflitos, Saulo Alves; Severing, Edouard; Sanchez-Perez, Gabino; Peters, Sander; de Jong, Hans; de Ridder, Dick.
Afiliação
  • Aflitos SA; Applied Bioinformatics, Plant Research International, Wageningen, The Netherlands. sauloalves.aflitos@wur.nl.
  • Severing E; Bioinformatics Group, Department of Plant Sciences, Wageningen University, Wageningen, The Netherlands. sauloalves.aflitos@wur.nl.
  • Sanchez-Perez G; Laboratory of Genetics, Wageningen University, Wageningen, The Netherlands. severing@mpipz.mpg.de.
  • Peters S; Applied Bioinformatics, Plant Research International, Wageningen, The Netherlands. gabino.sanchezperez@wur.nl.
  • de Jong H; Bioinformatics Group, Department of Plant Sciences, Wageningen University, Wageningen, The Netherlands. gabino.sanchezperez@wur.nl.
  • de Ridder D; Applied Bioinformatics, Plant Research International, Wageningen, The Netherlands. sander.peters@wur.nl.
BMC Bioinformatics ; 16: 352, 2015 Nov 02.
Article em En | MEDLINE | ID: mdl-26525298
ABSTRACT

BACKGROUND:

Identification of biological specimens is a requirement for a range of applications. Reference-free methods analyse unprocessed sequencing data without relying on prior knowledge, but generally do not scale to arbitrarily large genomes and arbitrarily large phylogenetic distances.

RESULTS:

We present Cnidaria, a practical tool for clustering genomic and transcriptomic data with no limitation on genome size or phylogenetic distances. We successfully simultaneously clustered 169 genomic and transcriptomic datasets from 4 kingdoms, achieving 100% identification accuracy at supra-species level and 78% accuracy at the species level.

CONCLUSION:

CNIDARIA allows for fast, resource-efficient comparison and identification of both raw and assembled genome and transcriptome data. This can help answer both fundamental (e.g. in phylogeny, ecological diversity analysis) and practical questions (e.g. sequencing quality control, primer design).
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Interface Usuário-Computador Limite: Animals Idioma: En Revista: BMC Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2015 Tipo de documento: Article País de afiliação: Holanda

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Interface Usuário-Computador Limite: Animals Idioma: En Revista: BMC Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2015 Tipo de documento: Article País de afiliação: Holanda