Your browser doesn't support javascript.
loading
Multi-part strategy for testing differential taxa abundance in sequencing data: A simulation study with an application to a microbiome study.
Cianci, Daniela; Tims, Sebastian; Roeselers, Guus; El Galta, Rachid; Swinkels, Sophie.
Afiliação
  • Cianci D; Julius Center for Health Sciences and Primary Care, Department of Data Science & Biostatistics, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands. Electronic address: d.cianci@umcutrecht.nl.
  • Tims S; Danone Nutricia Research, Utrecht, The Netherlands.
  • Roeselers G; Danone Nutricia Research, Utrecht, The Netherlands.
  • El Galta R; Former employee of Danone Nutricia Research, Utrecht, The Netherlands.
  • Swinkels S; Danone Nutricia Research, Utrecht, The Netherlands.
J Microbiol Methods ; 212: 106810, 2023 09.
Article em En | MEDLINE | ID: mdl-37607610
ABSTRACT
Comparing the microbiome across study arms is a recurrent goal in many studies. Standard statistical methods are often used for this purpose, however, they do not always represent the best choice in this context given the characteristics of microbiota sequencing data, e.g., non-negative, highly skewed counts with a large number of zeros. A multi-part strategy, that combines a two-part test (as described by Wagner et al., 2011), a Wilcoxon sum-rank test, a Chi-square and a Barnard's test was explored to compare the taxa abundance between study arms. The choice of the test is based on the data structure. The type I error of the multi-part strategy was evaluated by using a simulation study and the method was applied to real data. The script to perform the analysis with the multi-part approach is provided in the statistical software SAS. Several scenarios were simulated and in all of them the type I error was not inflated. Based on the statistical differences resulting from the two-part test (as described by Wagner et al., 2011) and the multi-part strategy (as proposed in this article), different biological implications can be extracted from the same comparison in the same data set. In the comparison of taxa abundance between study arms, we showed that careful attention needs to be paid on the data structure, in order to be able to choose an appropriate analysis method. Our approach selects the most suitable test according to the type of data observed, maintains a good type I error and is easily applicable by using the SAS macro provided.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Microbiota Tipo de estudo: Incidence_studies / Prognostic_studies Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Microbiota Tipo de estudo: Incidence_studies / Prognostic_studies Idioma: En Ano de publicação: 2023 Tipo de documento: Article