Your browser doesn't support javascript.
loading
metaBEETL: high-throughput analysis of heterogeneous microbial populations from shotgun DNA sequences.
Ander, Christina; Schulz-Trieglaff, Ole B; Stoye, Jens; Cox, Anthony J.
Affiliation
  • Ander C; Computational Biology Group, Illumina Cambridge Ltd,, Chesterford Research Park, Little Chesterford, Essex, United Kingdom.
BMC Bioinformatics ; 14 Suppl 5: S2, 2013.
Article in En | MEDLINE | ID: mdl-23734710
ABSTRACT
Environmental shotgun sequencing (ESS) has potential to give greater insight into microbial communities than targeted sequencing of 16S regions, but requires much higher sequence coverage. The advent of next-generation sequencing has made it feasible for the Human Microbiome Project and other initiatives to generate ESS data on a large scale, but computationally efficient methods for analysing such data sets are needed.Here we present metaBEETL, a fast taxonomic classifier for environmental shotgun sequences. It uses a Burrows-Wheeler Transform (BWT) index of the sequencing reads and an indexed database of microbial reference sequences. Unlike other BWT-based tools, our method has no upper limit on the number or the total size of the reference sequences in its database. By capturing sequence relationships between strains, our reference index also allows us to classify reads which are not unique to an individual strain but are nevertheless specific to some higher phylogenetic order.Tested on datasets with known taxonomic composition, metaBEETL gave results that are competitive with existing similarity-based tools due to normalization steps which other classifiers lack, the taxonomic profile computed by metaBEETL closely matched the true environmental profile. At the same time, its moderate running time and low memory footprint allow metaBEETL to scale well to large data sets.Code to construct the BWT indexed database and for the taxonomic classification is part of the BEETL library, available as a github repository at git@github.comBEETL/BEETL.git.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Sequence Analysis, DNA / Metagenomics / High-Throughput Nucleotide Sequencing / Microbiota Limits: Humans Language: En Journal: BMC Bioinformatics Journal subject: INFORMATICA MEDICA Year: 2013 Document type: Article Affiliation country: United kingdom

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Sequence Analysis, DNA / Metagenomics / High-Throughput Nucleotide Sequencing / Microbiota Limits: Humans Language: En Journal: BMC Bioinformatics Journal subject: INFORMATICA MEDICA Year: 2013 Document type: Article Affiliation country: United kingdom