Your browser doesn't support javascript.
loading
A Simultaneous Feature Selection and Compositional Association Test for Detecting Sparse Associations in High-Dimensional Metagenomic Data.
Hinton, Andrew L; Mucha, Peter J.
Afiliación
  • Hinton AL; Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, NC, United States.
  • Mucha PJ; School of Medicine, University of North Carolina at Chapel Hill Food Allergy Initiative, Chapel Hill, NC, United States.
Front Microbiol ; 13: 837396, 2022.
Article en En | MEDLINE | ID: mdl-35387076
Numerous metagenomic studies aim to discover associations between the microbial composition of an environment (e.g., gut, skin, oral) and a phenotype of interest. Multivariate analysis is often performed in these studies without critical a priori knowledge of which taxa are associated with the phenotype being studied. This approach typically reduces statistical power in settings where the true associations among only a few taxa are obscured by high dimensionality (i.e., sparse association signals). At the same time, low sample size and compositional sample space constraints may reduce beyond-study generalizability if not properly accounted for. To address these difficulties, we developed the Selection-Energy-Permutation (SelEnergyPerm) method, a nonparametric group association test with embedded feature selection that directly accounts for compositional constraints using parsimonious logratio signatures between taxonomic features, for characterizing and understanding alterations in microbial community structure. Simulation results show SelEnergyPerm selects small independent sets of logratios that capture strong associations in a range of scenarios. Additionally, our simulation results demonstrate SelEnergyPerm consistently detects/rejects associations in synthetic data with sparse, dense, or no association signals. We demonstrate the novel benefits of our method in four case studies utilizing publicly available 16S amplicon and whole-genome sequencing datasets. Our R implementation of Selection-Energy-Permutation, including an example demonstration and the code to generate all of the scenarios used here, is available at https://www.github.com/andrew84830813/selEnergyPermR.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Risk_factors_studies Idioma: En Revista: Front Microbiol Año: 2022 Tipo del documento: Article País de afiliación: Estados Unidos Pais de publicación: Suiza

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Risk_factors_studies Idioma: En Revista: Front Microbiol Año: 2022 Tipo del documento: Article País de afiliación: Estados Unidos Pais de publicación: Suiza