Your browser doesn't support javascript.
loading
Multivariable association discovery in population-scale meta-omics studies.
Mallick, Himel; Rahnavard, Ali; McIver, Lauren J; Ma, Siyuan; Zhang, Yancong; Nguyen, Long H; Tickle, Timothy L; Weingart, George; Ren, Boyu; Schwager, Emma H; Chatterjee, Suvo; Thompson, Kelsey N; Wilkinson, Jeremy E; Subramanian, Ayshwarya; Lu, Yiren; Waldron, Levi; Paulson, Joseph N; Franzosa, Eric A; Bravo, Hector Corrada; Huttenhower, Curtis.
Affiliation
  • Mallick H; Biostatistics Department, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Rahnavard A; The Broad Institute, Cambridge, Massachusetts, United States of America.
  • McIver LJ; Computational Biology Institute, Department of Biostatistics and Bioinformatics, Milken Institute School of Public Health, George Washington University, Washington DC, United States of America.
  • Ma S; Biostatistics Department, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Zhang Y; The Broad Institute, Cambridge, Massachusetts, United States of America.
  • Nguyen LH; Biostatistics Department, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Tickle TL; The Broad Institute, Cambridge, Massachusetts, United States of America.
  • Weingart G; Biostatistics Department, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Ren B; The Broad Institute, Cambridge, Massachusetts, United States of America.
  • Schwager EH; Biostatistics Department, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Chatterjee S; Clinical and Translational Epidemiology Unit, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States of America.
  • Thompson KN; Division of Gastroenterology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States of America.
  • Wilkinson JE; The Broad Institute, Cambridge, Massachusetts, United States of America.
  • Subramanian A; Biostatistics Department, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Lu Y; The Broad Institute, Cambridge, Massachusetts, United States of America.
  • Waldron L; Biostatistics Department, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Paulson JN; The Broad Institute, Cambridge, Massachusetts, United States of America.
  • Franzosa EA; Biostatistics Department, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America.
  • Bravo HC; The Broad Institute, Cambridge, Massachusetts, United States of America.
  • Huttenhower C; Epidemiology Branch, Division of Intramural Population Health Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland, United States of America.
PLoS Comput Biol ; 17(11): e1009442, 2021 11.
Article in En | MEDLINE | ID: mdl-34784344
ABSTRACT
It is challenging to associate features such as human health outcomes, diet, environmental conditions, or other metadata to microbial community measurements, due in part to their quantitative properties. Microbiome multi-omics are typically noisy, sparse (zero-inflated), high-dimensional, extremely non-normal, and often in the form of count or compositional measurements. Here we introduce an optimized combination of novel and established methodology to assess multivariable association of microbial community features with complex metadata in population-scale observational studies. Our approach, MaAsLin 2 (Microbiome Multivariable Associations with Linear Models), uses generalized linear and mixed models to accommodate a wide variety of modern epidemiological studies, including cross-sectional and longitudinal designs, as well as a variety of data types (e.g., counts and relative abundances) with or without covariates and repeated measurements. To construct this method, we conducted a large-scale evaluation of a broad range of scenarios under which straightforward identification of meta-omics associations can be challenging. These simulation studies reveal that MaAsLin 2's linear model preserves statistical power in the presence of repeated measures and multiple covariates, while accounting for the nuances of meta-omics features and controlling false discovery. We also applied MaAsLin 2 to a microbial multi-omics dataset from the Integrative Human Microbiome (HMP2) project which, in addition to reproducing established results, revealed a unique, integrated landscape of inflammatory bowel diseases (IBD) across multiple time points and omics profiles.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Multivariate Analysis / Computational Biology / Gastrointestinal Microbiome Type of study: Observational_studies / Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: PLoS Comput Biol Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2021 Type: Article Affiliation country: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Multivariate Analysis / Computational Biology / Gastrointestinal Microbiome Type of study: Observational_studies / Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: PLoS Comput Biol Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2021 Type: Article Affiliation country: United States