Your browser doesn't support javascript.
loading
A robust and transformation-free joint model with matching and regularization for metagenomic trajectory and disease onset.
Li, Qian; Vehik, Kendra; Li, Cai; Triplett, Eric; Roesch, Luiz; Hu, Yi-Juan; Krischer, Jeffrey.
Afiliación
  • Li Q; Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, 38105, TN, USA. qian.li@stjude.org.
  • Vehik K; Health Informatics Institute, University of South Florida, Tampa, 33620, FL, USA.
  • Li C; Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, 38105, TN, USA.
  • Triplett E; Department of Microbiology and Cell Science, University of Florida, Gainesville, 32611, FL, USA.
  • Roesch L; Department of Microbiology and Cell Science, University of Florida, Gainesville, 32611, FL, USA.
  • Hu YJ; Department of Biostatistics and Bioinformatics, Emory University, Atlanta, 30322, GA, USA.
  • Krischer J; Health Informatics Institute, University of South Florida, Tampa, 33620, FL, USA.
BMC Genomics ; 23(1): 661, 2022 Sep 19.
Article en En | MEDLINE | ID: mdl-36123651
ABSTRACT

BACKGROUND:

To identify operational taxonomy units (OTUs) signaling disease onset in an observational study, a powerful strategy was selecting participants by matched sets and profiling temporal metagenomes, followed by trajectory analysis. Existing trajectory analyses modeled individual OTU or microbial community without adjusting for the within-community correlation and matched-set-specific latent factors.

RESULTS:

We proposed a joint model with matching and regularization (JMR) to detect OTU-specific trajectory predictive of host disease status. The between- and within-matched-sets heterogeneity in OTU relative abundance and disease risk were modeled by nested random effects. The inherent negative correlation in microbiota composition was adjusted by incorporating and regularizing the top-correlated taxa as longitudinal covariate, pre-selected by Bray-Curtis distance and elastic net regression. We designed a simulation pipeline to generate true biomarkers for disease onset and the pseudo biomarkers caused by compositionality. We demonstrated that JMR effectively controlled the false discovery and pseudo biomarkers in a simulation study generating temporal high-dimensional metagenomic counts with random intercept or slope. Application of the competing methods in the simulated data and the TEDDY cohort showed that JMR outperformed the other methods and identified important taxa in infants' fecal samples with dynamics preceding host disease status.

CONCLUSION:

Our method JMR is a robust framework that models taxon-specific trajectory and host disease status for matched participants without transformation of relative abundance, improving the power of detecting disease-associated microbial features in certain scenarios. JMR is available in R package mtradeR at https//github.com/qianli10000/mtradeR.
Asunto(s)
Palabras clave

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Metagenoma / Microbiota Tipo de estudio: Etiology_studies / Incidence_studies / Observational_studies / Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: BMC Genomics Asunto de la revista: GENETICA Año: 2022 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Metagenoma / Microbiota Tipo de estudio: Etiology_studies / Incidence_studies / Observational_studies / Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: BMC Genomics Asunto de la revista: GENETICA Año: 2022 Tipo del documento: Article País de afiliación: Estados Unidos