Your browser doesn't support javascript.
loading
Differential richness inference for 16S rRNA marker gene surveys.
Kumar, M Senthil; Slud, Eric V; Hehnly, Christine; Zhang, Lijun; Broach, James; Irizarry, Rafael A; Schiff, Steven J; Paulson, Joseph N.
Afiliación
  • Kumar MS; Department of Data Science, The Dana-Farber Cancer Institute, Boston, MA, USA. senthil@ds.dfci.harvard.edu.
  • Slud EV; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA. senthil@ds.dfci.harvard.edu.
  • Hehnly C; Department of Mathematics, University of Maryland, College Park, MD, USA.
  • Zhang L; Center for Statistical Research and Methodology U.S. Census Bureau, Suitland, MD, USA.
  • Broach J; Penn State Institute for Personalized Medicine, The Pennsylvania State University College of Medicine, Hershey, PA, USA.
  • Irizarry RA; Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA.
  • Schiff SJ; Penn State Institute for Personalized Medicine, The Pennsylvania State University College of Medicine, Hershey, PA, USA.
  • Paulson JN; Penn State Institute for Personalized Medicine, The Pennsylvania State University College of Medicine, Hershey, PA, USA.
Genome Biol ; 23(1): 166, 2022 08 01.
Article en En | MEDLINE | ID: mdl-35915508
BACKGROUND: Individual and environmental health outcomes are frequently linked to changes in the diversity of associated microbial communities. Thus, deriving health indicators based on microbiome diversity measures is essential. While microbiome data generated using high-throughput 16S rRNA marker gene surveys are appealing for this purpose, 16S surveys also generate a plethora of spurious microbial taxa. RESULTS: When this artificial inflation in the observed number of taxa is ignored, we find that changes in the abundance of detected taxa confound current methods for inferring differences in richness. Experimental evidence, theory-guided exploratory data analyses, and existing literature support the conclusion that most sub-genus discoveries are spurious artifacts of clustering 16S sequencing reads. We proceed to model a 16S survey's systematic patterns of sub-genus taxa generation as a function of genus abundance to derive a robust control for false taxa accumulation. These controls unlock classical regression approaches for highly flexible differential richness inference at various levels of the surveyed microbial assemblage: from sample groups to specific taxa collections. The proposed methodology for differential richness inference is available through an R package, Prokounter. CONCLUSIONS: False species discoveries bias richness estimation and confound differential richness inference. In the case of 16S microbiome surveys, supporting evidence indicate that most sub-genus taxa are spurious. Based on this finding, a flexible method is proposed and is shown to overcome the confounding problem noted with current approaches for differential richness inference. Package availability: https://github.com/mskb01/prokounter.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Bacterias / Microbiota Tipo de estudio: Prognostic_studies / Qualitative_research Idioma: En Revista: Genome Biol Asunto de la revista: BIOLOGIA MOLECULAR / GENETICA Año: 2022 Tipo del documento: Article País de afiliación: Estados Unidos Pais de publicación: Reino Unido

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Bacterias / Microbiota Tipo de estudio: Prognostic_studies / Qualitative_research Idioma: En Revista: Genome Biol Asunto de la revista: BIOLOGIA MOLECULAR / GENETICA Año: 2022 Tipo del documento: Article País de afiliación: Estados Unidos Pais de publicación: Reino Unido