Equivalent input produces different output in the UniFrac significance test.
BMC Bioinformatics
; 15: 278, 2014 Aug 13.
Article
em En
| MEDLINE
| ID: mdl-25124232
BACKGROUND: UniFrac is a well-known tool for comparing microbial communities and assessing statistically significant differences between communities. In this paper we identify a discrepancy in the UniFrac methodology that causes semantically equivalent inputs to produce different outputs in tests of statistical significance. RESULTS: The phylogenetic trees that are input into UniFrac may or may not contain abundance counts. An isomorphic transform can be defined that will convert trees between these two formats without altering the semantic meaning of the trees. UniFrac produces different outputs for these equivalent forms of the same input tree. This is illustrated using metagenomics data from a lake sediment study. CONCLUSIONS: Results from the UniFrac tool can vary greatly for the same input depending on the arbitrary choice of input format. Practitioners should be aware of this issue and use the tool with caution to ensure consistency and validity in their analyses. We provide a script to transform inputs between equivalent formats to help researchers achieve this consistency.
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Filogenia
/
Biologia Computacional
/
Microbiologia
Idioma:
En
Revista:
BMC Bioinformatics
Ano de publicação:
2014
Tipo de documento:
Article