Your browser doesn't support javascript.
loading
A comparative analysis of mutual information methods for pairwise relationship detection in metagenomic data.
Francis, Dallace; Sun, Fengzhu.
Affiliation
  • Francis D; Quantitative and Computational Biology Department, University of Southern California, Los Angeles, CA, 90089, USA. dallacef@usc.edu.
  • Sun F; Quantitative and Computational Biology Department, University of Southern California, Los Angeles, CA, 90089, USA.
BMC Bioinformatics ; 25(1): 266, 2024 Aug 14.
Article in En | MEDLINE | ID: mdl-39143554
ABSTRACT

BACKGROUND:

Construction of co-occurrence networks in metagenomic data often employs correlation to infer pairwise relationships between microbes. However, biological systems are complex and often display qualities non-linear in nature. Therefore, the reliance on correlation alone may overlook important relationships and fail to capture the full breadth of intricacies presented in underlying interaction networks. It is of interest to incorporate metrics that are not only robust in detecting linear relationships, but non-linear ones as well.

RESULTS:

In this paper, we explore the use of various mutual information (MI) estimation approaches for quantifying pairwise relationships in biological data and compare their performances against two traditional measures-Pearson's correlation coefficient, r, and Spearman's rank correlation coefficient, ρ. Metrics are tested on both simulated data designed to mimic pairwise relationships that may be found in ecological systems and real data from a previous study on C. diff infection. The results demonstrate that, in the case of asymmetric relationships, mutual information estimators can provide better detection ability than Pearson's or Spearman's correlation coefficients. Specifically, we find that these estimators have elevated performances in the detection of exploitative relationships, demonstrating the potential benefit of including them in future metagenomic studies.

CONCLUSIONS:

Mutual information (MI) can uncover complex pairwise relationships in biological data that may be missed by traditional measures of association. The inclusion of such relationships when constructing co-occurrence networks can result in a more comprehensive analysis than the use of correlation alone.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Metagenomics Language: En Journal: BMC Bioinformatics / BMC bioinformatics (Online) Journal subject: INFORMATICA MEDICA Year: 2024 Document type: Article Affiliation country: Country of publication:

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Metagenomics Language: En Journal: BMC Bioinformatics / BMC bioinformatics (Online) Journal subject: INFORMATICA MEDICA Year: 2024 Document type: Article Affiliation country: Country of publication: