Your browser doesn't support javascript.
loading
A statistical model for describing and simulating microbial community profiles.
Ma, Siyuan; Ren, Boyu; Mallick, Himel; Moon, Yo Sup; Schwager, Emma; Maharjan, Sagun; Tickle, Timothy L; Lu, Yiren; Carmody, Rachel N; Franzosa, Eric A; Janson, Lucas; Huttenhower, Curtis.
Afiliación
  • Ma S; Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Ren B; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Mallick H; Broad Institute, Cambridge, Massachusetts, United States of America.
  • Moon YS; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Schwager E; Broad Institute, Cambridge, Massachusetts, United States of America.
  • Maharjan S; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Tickle TL; Broad Institute, Cambridge, Massachusetts, United States of America.
  • Lu Y; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Carmody RN; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Franzosa EA; Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Janson L; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Huttenhower C; Broad Institute, Cambridge, Massachusetts, United States of America.
PLoS Comput Biol ; 17(9): e1008913, 2021 09.
Article en En | MEDLINE | ID: mdl-34516542
ABSTRACT
Many methods have been developed for statistical analysis of microbial community profiles, but due to the complex nature of typical microbiome measurements (e.g. sparsity, zero-inflation, non-independence, and compositionality) and of the associated underlying biology, it is difficult to compare or evaluate such methods within a single systematic framework. To address this challenge, we developed SparseDOSSA (Sparse Data Observations for the Simulation of Synthetic Abundances) a statistical model of microbial ecological population structure, which can be used to parameterize real-world microbial community profiles and to simulate new, realistic profiles of known structure for methods evaluation. Specifically, SparseDOSSA's model captures marginal microbial feature abundances as a zero-inflated log-normal distribution, with additional model components for absolute cell counts and the sequence read generation process, microbe-microbe, and microbe-environment interactions. Together, these allow fully known covariance structure between synthetic features (i.e. "taxa") or between features and "phenotypes" to be simulated for method benchmarking. Here, we demonstrate SparseDOSSA's performance for 1) accurately modeling human-associated microbial population profiles; 2) generating synthetic communities with controlled population and ecological structures; 3) spiking-in true positive synthetic associations to benchmark analysis methods; and 4) recapitulating an end-to-end mouse microbiome feeding experiment. Together, these represent the most common analysis types in assessment of real microbial community environmental and epidemiological statistics, thus demonstrating SparseDOSSA's utility as a general-purpose aid for modeling communities and evaluating quantitative methods. An open-source implementation is available at http//huttenhower.sph.harvard.edu/sparsedossa2.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Modelos Estadísticos / Microbiota Tipo de estudio: Risk_factors_studies Idioma: En Revista: PLoS Comput Biol Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2021 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Modelos Estadísticos / Microbiota Tipo de estudio: Risk_factors_studies Idioma: En Revista: PLoS Comput Biol Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2021 Tipo del documento: Article País de afiliación: Estados Unidos