Your browser doesn't support javascript.
loading
Long-read metabarcoding of the eukaryotic rDNA operon to phylogenetically and taxonomically resolve environmental diversity.
Jamy, Mahwash; Foster, Rachel; Barbera, Pierre; Czech, Lucas; Kozlov, Alexey; Stamatakis, Alexandros; Bending, Gary; Hilton, Sally; Bass, David; Burki, Fabien.
Affiliation
  • Jamy M; Science for Life Laboratory, Program in Systematic Biology, Uppsala University, Uppsala, Sweden.
  • Foster R; Department of Life Sciences, Natural History Museum, London, UK.
  • Barbera P; Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.
  • Czech L; Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.
  • Kozlov A; Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.
  • Stamatakis A; Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.
  • Bending G; Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany.
  • Hilton S; School of Life Sciences, The University of Warwick, Coventry, UK.
  • Bass D; School of Life Sciences, The University of Warwick, Coventry, UK.
  • Burki F; Department of Life Sciences, Natural History Museum, London, UK.
Mol Ecol Resour ; 20(2): 429-443, 2020 Mar.
Article in En | MEDLINE | ID: mdl-31705734
High-throughput DNA metabarcoding of amplicon sizes below 500 bp has revolutionized the analysis of environmental microbial diversity. However, these short regions contain limited phylogenetic signal, which makes it impractical to use environmental DNA in full phylogenetic inferences. This lesser phylogenetic resolution of short amplicons may be overcome by new long-read sequencing technologies. To test this idea, we amplified soil DNA and used PacBio Circular Consensus Sequencing (CCS) to obtain an ~4500-bp region spanning most of the eukaryotic small subunit (18S) and large subunit (28S) ribosomal DNA genes. We first treated the CCS reads with a novel curation workflow, generating 650 high-quality operational taxonomic units (OTUs) containing the physically linked 18S and 28S regions. To assign taxonomy to these OTUs, we developed a phylogeny-aware approach based on the 18S region that showed greater accuracy and sensitivity than similarity-based methods. The taxonomically annotated OTUs were then combined with available 18S and 28S reference sequences to infer a well-resolved phylogeny spanning all major groups of eukaryotes, allowing us to accurately derive the evolutionary origin of environmental diversity. A total of 1,019 sequences were included, of which a majority (58%) corresponded to the new long environmental OTUs. The long reads also allowed us to directly investigate the relationships among environmental sequences themselves, which represents a key advantage over the placement of short reads on a reference phylogeny. Together, our results show that long amplicons can be treated in a full phylogenetic framework to provide greater taxonomic resolution and a robust evolutionary perspective to environmental DNA.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Phylogeny / Eukaryota Language: En Journal: Mol Ecol Resour Year: 2020 Document type: Article Affiliation country: Sweden Country of publication: United kingdom

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Phylogeny / Eukaryota Language: En Journal: Mol Ecol Resour Year: 2020 Document type: Article Affiliation country: Sweden Country of publication: United kingdom