Your browser doesn't support javascript.
loading
Negative binomial mixed models for analyzing microbiome count data.
Zhang, Xinyan; Mallick, Himel; Tang, Zaixiang; Zhang, Lei; Cui, Xiangqin; Benson, Andrew K; Yi, Nengjun.
Afiliação
  • Zhang X; Department of Biostatistics, University of Alabama at Birmingham, Birmingham, AL, 35294-0022, USA.
  • Mallick H; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA.
  • Tang Z; Program in Medical and Population Genetics, the Broad Institute, Cambridge, MA, 02142, USA.
  • Zhang L; Department of Biostatistics, School of Public Health, Medical College of Soochow University, Suzhou, 215123, China.
  • Cui X; Department of Biostatistics, School of Public Health, Medical College of Soochow University, Suzhou, 215123, China.
  • Benson AK; Department of Biostatistics, University of Alabama at Birmingham, Birmingham, AL, 35294-0022, USA.
  • Yi N; Department of Food Science and Technology and Core for Applied Genomics and Ecology, University of Nebraska, Lincoln, NE, 68583, USA.
BMC Bioinformatics ; 18(1): 4, 2017 Jan 03.
Article em En | MEDLINE | ID: mdl-28049409
ABSTRACT

BACKGROUND:

Recent advances in next-generation sequencing (NGS) technology enable researchers to collect a large volume of metagenomic sequencing data. These data provide valuable resources for investigating interactions between the microbiome and host environmental/clinical factors. In addition to the well-known properties of microbiome count measurements, for example, varied total sequence reads across samples, over-dispersion and zero-inflation, microbiome studies usually collect samples with hierarchical structures, which introduce correlation among the samples and thus further complicate the analysis and interpretation of microbiome count data.

RESULTS:

In this article, we propose negative binomial mixed models (NBMMs) for detecting the association between the microbiome and host environmental/clinical factors for correlated microbiome count data. Although having not dealt with zero-inflation, the proposed mixed-effects models account for correlation among the samples by incorporating random effects into the commonly used fixed-effects negative binomial model, and can efficiently handle over-dispersion and varying total reads. We have developed a flexible and efficient IWLS (Iterative Weighted Least Squares) algorithm to fit the proposed NBMMs by taking advantage of the standard procedure for fitting the linear mixed models.

CONCLUSIONS:

We evaluate and demonstrate the proposed method via extensive simulation studies and the application to mouse gut microbiome data. The results show that the proposed method has desirable properties and outperform the previously used methods in terms of both empirical power and Type I error. The method has been incorporated into the freely available R package BhGLM ( http//www.ssg.uab.edu/bhglm/ and http//github.com/abbyyan3/BhGLM ), providing a useful tool for analyzing microbiome data.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Modelos Estatísticos / Microbiota Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Animals / Humans / Male Idioma: En Revista: BMC Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2017 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Modelos Estatísticos / Microbiota Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Animals / Humans / Male Idioma: En Revista: BMC Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2017 Tipo de documento: Article País de afiliação: Estados Unidos