Your browser doesn't support javascript.
loading
Comprehensive evaluation of methods for differential expression analysis of metatranscriptomics data.
Cho, Hunyong; Qu, Yixiang; Liu, Chuwen; Tang, Boyang; Lyu, Ruiqi; Lin, Bridget M; Roach, Jeffrey; Azcarate-Peril, M Andrea; Aguiar Ribeiro, Apoena; Love, Michael I; Divaris, Kimon; Wu, Di.
Afiliação
  • Cho H; Department of Biostatistics, University of North Carolina, Chapel Hill, NC, United States.
  • Qu Y; Department of Biostatistics, University of North Carolina, Chapel Hill, NC, United States.
  • Liu C; Department of Biostatistics, University of North Carolina, Chapel Hill, NC, United States.
  • Tang B; Department of Statistics, University of Connecticut, Storrs, CT, United States.
  • Lyu R; School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States.
  • Lin BM; Department of Biostatistics, University of North Carolina, Chapel Hill, NC, United States.
  • Roach J; Research Computing, University of North Carolina, Chapel Hill, NC, United States.
  • Azcarate-Peril MA; Department of Medicine and Nutrition, University of North Carolina, Chapel Hill, NC, United States.
  • Aguiar Ribeiro A; Division of Diagnostic Sciences, University of North Carolina, Chapel Hill, NC, United States.
  • Love MI; Department of Biostatistics, University of North Carolina, Chapel Hill, NC, United States.
  • Divaris K; Department of Genetics, University of North Carolina, Chapel Hill, NC, United States.
  • Wu D; Division of Pediatric and Public Health, University of North Carolina, Chapel Hill, NC, United States.
Brief Bioinform ; 24(5)2023 09 20.
Article em En | MEDLINE | ID: mdl-37738402
ABSTRACT
Understanding the function of the human microbiome is important but the development of statistical methods specifically for the microbial gene expression (i.e. metatranscriptomics) is in its infancy. Many currently employed differential expression analysis methods have been designed for different data types and have not been evaluated in metatranscriptomics settings. To address this gap, we undertook a comprehensive evaluation and benchmarking of 10 differential analysis methods for metatranscriptomics data. We used a combination of real and simulated data to evaluate performance (i.e. type I error, false discovery rate and sensitivity) of the following

methods:

log-normal (LN), logistic-beta (LB), MAST, DESeq2, metagenomeSeq, ANCOM-BC, LEfSe, ALDEx2, Kruskal-Wallis and two-part Kruskal-Wallis. The simulation was informed by supragingival biofilm microbiome data from 300 preschool-age children enrolled in a study of childhood dental disease (early childhood caries, ECC), whereas validations were sought in two additional datasets from the ECC study and an inflammatory bowel disease study. The LB test showed the highest sensitivity in both small and large samples and reasonably controlled type I error. Contrarily, MAST was hampered by inflated type I error. Upon application of the LN and LB tests in the ECC study, we found that genes C8PHV7 and C8PEV7, harbored by the lactate-producing Campylobacter gracilis, had the strongest association with childhood dental disease. This comprehensive model evaluation offers practical guidance for selection of appropriate methods for rigorous analyses of differential expression in metatranscriptomics. Selection of an optimal method increases the possibility of detecting true signals while minimizing the chance of claiming false ones.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Doenças Estomatognáticas / Benchmarking Tipo de estudo: Guideline Limite: Child / Child, preschool / Humans Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Doenças Estomatognáticas / Benchmarking Tipo de estudo: Guideline Limite: Child / Child, preschool / Humans Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Estados Unidos