Tackling soil diversity with the assembly of large, complex metagenomes.

Howe, Adina Chuang; Jansson, Janet K; Malfatti, Stephanie A; Tringe, Susannah G; Tiedje, James M; Brown, C Titus

Howe, Adina Chuang; Jansson, Janet K; Malfatti, Stephanie A; Tringe, Susannah G; Tiedje, James M; Brown, C Titus.

Affiliation

Howe AC; Departments of Microbiology and Molecular Genetics and Computer Science and Engineering, Michigan State University, East Lansing, MI 48824.

Proc Natl Acad Sci U S A ; 111(13): 4904-9, 2014 Apr 01.

Article in En | MEDLINE | ID: mdl-24632729

ABSTRACT

The large volumes of sequencing data required to sample deeply the microbial communities of complex environments pose new challenges to sequence analysis. De novo metagenomic assembly effectively reduces the total amount of data to be analyzed but requires substantial computational resources. We combine two preassembly filtering approaches--digital normalization and partitioning--to generate previously intractable large metagenome assemblies. Using a human-gut mock community dataset, we demonstrate that these methods result in assemblies nearly identical to assemblies from unprocessed data. We then assemble two large soil metagenomes totaling 398 billion bp (equivalent to 88,000 Escherichia coli genomes) from matched Iowa corn and native prairie soils. The resulting assembled contigs could be used to identify molecular interactions and reaction networks of known metabolic pathways using the Kyoto Encyclopedia of Genes and Genomes Orthology database. Nonetheless, more than 60% of predicted proteins in assemblies could not be annotated against known databases. Many of these unknown proteins were abundant in both corn and prairie soils, highlighting the benefits of assembly for the discovery and characterization of novelty in soil biodiversity. Moreover, 80% of the sequencing data could not be assembled because of low coverage, suggesting that considerably more sequencing data are needed to characterize the functional content of soil.

Subject(s)

Biodiversity; Metagenome/genetics; Soil Microbiology; Soil; Gastrointestinal Tract/microbiology; Humans; Iowa; Species Specificity; Zea mays/genetics

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google

Full text: 1 Database: MEDLINE Main subject: Soil / Soil Microbiology / Biodiversity / Metagenome Limits: Humans Country/Region as subject: America do norte Language: En Journal: Proc Natl Acad Sci U S A Year: 2014 Type: Article

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google