Your browser doesn't support javascript.
loading
DL-TODA: A Deep Learning Tool for Omics Data Analysis.
Cres, Cecile M; Tritt, Andrew; Bouchard, Kristofer E; Zhang, Ying.
Afiliação
  • Cres CM; Department of Cell and Molecular Biology, College of the Environment and Life Sciences, University of Rhode Island, Kingston, RI 02881, USA.
  • Tritt A; Lawrence Berkeley National Laboratory, Scientific Data Division, Berkeley, CA 94720, USA.
  • Bouchard KE; Lawrence Berkeley National Laboratory, Applied Mathematics & Computational Research Division, Berkeley, CA 94720, USA.
  • Zhang Y; Lawrence Berkeley National Laboratory, Scientific Data Division, Berkeley, CA 94720, USA.
Biomolecules ; 13(4)2023 03 24.
Article em En | MEDLINE | ID: mdl-37189333
ABSTRACT
Metagenomics is a technique for genome-wide profiling of microbiomes; this technique generates billions of DNA sequences called reads. Given the multiplication of metagenomic projects, computational tools are necessary to enable the efficient and accurate classification of metagenomic reads without needing to construct a reference database. The program DL-TODA presented here aims to classify metagenomic reads using a deep learning model trained on over 3000 bacterial species. A convolutional neural network architecture originally designed for computer vision was applied for the modeling of species-specific features. Using synthetic testing data simulated with 2454 genomes from 639 species, DL-TODA was shown to classify nearly 75% of the reads with high confidence. The classification accuracy of DL-TODA was over 0.98 at taxonomic ranks above the genus level, making it comparable with Kraken2 and Centrifuge, two state-of-the-art taxonomic classification tools. DL-TODA also achieved an accuracy of 0.97 at the species level, which is higher than 0.93 by Kraken2 and 0.85 by Centrifuge on the same test set. Application of DL-TODA to the human oral and cropland soil metagenomes further demonstrated its use in analyzing microbiomes from diverse environments. Compared to Centrifuge and Kraken2, DL-TODA predicted distinct relative abundance rankings and is less biased toward a single taxon.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Microbiota / Aprendizado Profundo Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Microbiota / Aprendizado Profundo Idioma: En Ano de publicação: 2023 Tipo de documento: Article