Your browser doesn't support javascript.
loading
MTSv: rapid alignment-based taxonomic classification and high-confidence metagenomic analysis.
Furstenau, Tara N; Schneider, Tsosie; Shaffer, Isaac; Vazquez, Adam J; Sahl, Jason; Fofanov, Viacheslav.
Affiliation
  • Furstenau TN; School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, Arizona, United States.
  • Schneider T; School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, Arizona, United States.
  • Shaffer I; School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, Arizona, United States.
  • Vazquez AJ; Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States.
  • Sahl J; Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States.
  • Fofanov V; School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, Arizona, United States.
PeerJ ; 10: e14292, 2022.
Article in En | MEDLINE | ID: mdl-36389404
ABSTRACT
As the size of reference sequence databases and high-throughput sequencing datasets continue to grow, it is becoming computationally infeasible to use traditional alignment to large genome databases for taxonomic classification of metagenomic reads. Exact matching approaches can rapidly assign taxonomy and summarize the composition of microbial communities, but they sacrifice accuracy and can lead to false positives. Full alignment tools provide higher confidence assignments and can assign sequences from genomes that diverge from reference sequences; however, full alignment tools are computationally intensive. To address this, we designed MTSv specifically for alignment-based taxonomic assignment in metagenomic analysis. This tool implements an FM-index assisted q-gram filter and SIMD accelerated Smith-Waterman algorithm to find alignments. However, unlike traditional aligners, MTSv will not attempt to make additional alignments to a TaxID once an alignment of sufficient quality has been found. This improves efficiency when many reference sequences are available per taxon. MTSv was designed to be flexible and can be modified to run on either memory or processor constrained systems. Although MTSv cannot compete with the speeds of exact k-mer matching approaches, it is reasonably fast and has higher precision than popular exact matching approaches. Because MTSv performs a full alignment it can classify reads even when the genomes share low similarity with reference sequences and provides a tool for high confidence pathogen detection with low off-target assignments to near neighbor species.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Metagenome Language: En Journal: PeerJ Year: 2022 Document type: Article Affiliation country:

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Metagenome Language: En Journal: PeerJ Year: 2022 Document type: Article Affiliation country:
...