Your browser doesn't support javascript.
loading
The use of taxon-specific reference databases compromises metagenomic classification.
R Marcelino, Vanessa; Holmes, Edward C; Sorrell, Tania C.
Afiliación
  • R Marcelino V; Marie Bashir Institute for Infectious Diseases and Biosecurity and Faculty of Medicine and Health, Sydney Medical School, Westmead Clinical School, The University of Sydney, Sydney, NSW, 2006, Australia. vrmarcelino@gmail.com.
  • Holmes EC; Centre for Infectious Diseases and Microbiology, Westmead Institute for Medical Research, Westmead, NSW, 2145, Australia. vrmarcelino@gmail.com.
  • Sorrell TC; School of Life & Environmental Sciences, Charles Perkins Centre, The University of Sydney, Sydney, NSW, 2006, Australia. vrmarcelino@gmail.com.
BMC Genomics ; 21(1): 184, 2020 Feb 27.
Article en En | MEDLINE | ID: mdl-32106809
A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to characterize the gut mycobiome of hunter-gatherers and Western populations, resulting in the identification of a range of fungal species in the vast majority of samples. In the HumanMycobiomeScan pipeline, sequence reads are mapped against a reference database containing fungal genome sequences only. We argue that using reference databases comprised of a single taxonomic group leads to an unacceptably high number of false-positives due to: (i) mapping to conserved genetic regions in reference genomes, and (ii) sequence contamination in the assembled reference genomes. To demonstrate this, we replaced the HumanMycobiomeScan's fungal reference database with one containing genome sequences of amphibians and reptiles and re-analysed their case study. The classification pipeline recovered all species present in the reference database, revealing turtles (Geoemydidae), bull frogs (Pyxicephalidae) and snakes (Colubridae) as the most abundant herpetological taxa in the human gut. We also re-analysed their case study using a kingdom-agnostic pipeline. This revealed that while the gut of hunter-gatherers and Western subjects may be colonized by a range of microbial eukaryotes, only three fungal families were retrieved. These results highlight the pitfalls of using taxon-specific reference databases for metagenome classification, even when they are comprised of curated whole genome data. We propose that databases containing all domains of life provide the most suitable option for metagenomic species profiling, especially when targeting microbial eukaryotes.
Asunto(s)
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Biología Computacional / Heces / Metagenómica / Código de Barras del ADN Taxonómico Límite: Animals / Humans País/Región como asunto: Africa / Europa Idioma: En Revista: BMC Genomics Asunto de la revista: GENETICA Año: 2020 Tipo del documento: Article País de afiliación: Australia

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Biología Computacional / Heces / Metagenómica / Código de Barras del ADN Taxonómico Límite: Animals / Humans País/Región como asunto: Africa / Europa Idioma: En Revista: BMC Genomics Asunto de la revista: GENETICA Año: 2020 Tipo del documento: Article País de afiliación: Australia