Your browser doesn't support javascript.
loading
Qmatey: an automated pipeline for fast exact matching-based alignment and strain-level taxonomic binning and profiling of metagenomes.
Adams, Alison K; Kristy, Brandon D; Gorman, Myranda; Balint-Kurti, Peter; Yencho, G Craig; Olukolu, Bode A.
Afiliação
  • Adams AK; Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN 37996, USA.
  • Kristy BD; UT-ORNL Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN 37996, USA.
  • Gorman M; Department of Integrative Biology, Michigan State University, East Lansing, MI, USA.
  • Balint-Kurti P; W.K. Kellogg Biological Station, Michigan State University, Hickory Corners, MI, USA.
  • Yencho GC; Department of Animal Science, University of Tennessee, Knoxville, TN 37996, USA.
  • Olukolu BA; College of Veterinary Medicine, University of Tennessee, Knoxville, TN 37996, USA.
Brief Bioinform ; 24(6)2023 09 22.
Article em En | MEDLINE | ID: mdl-37824740
ABSTRACT
Metagenomics is a powerful tool for understanding organismal interactions; however, classification, profiling and detection of interactions at the strain level remain challenging. We present an automated pipeline, quantitative metagenomic alignment and taxonomic exact matching (Qmatey), that performs a fast exact matching-based alignment and integration of taxonomic binning and profiling. It interrogates large databases without using metagenome-assembled genomes, curated pan-genes or k-mer spectra that limit resolution. Qmatey minimizes misclassification and maintains strain level resolution by using only diagnostic reads as shown in the analysis of amplicon, quantitative reduced representation and shotgun sequencing datasets. Using Qmatey to analyze shotgun data from a synthetic community with 35% of the 26 strains at low abundance (0.01-0.06%), we revealed a remarkable 85-96% strain recall and 92-100% species recall while maintaining 100% precision. Benchmarking revealed that the highly ranked Kraken2 and KrakenUniq tools identified 2-4 more taxa (92-100% recall) than Qmatey but produced 315-1752 false positive taxa and high penalty on precision (1-8%). The speed, accuracy and precision of the Qmatey pipeline positions it as a valuable tool for broad-spectrum profiling and for uncovering biologically relevant interactions.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Metagenoma / Metagenômica Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Metagenoma / Metagenômica Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Estados Unidos