Adjusting for false discoveries in constraint-based differential metabolic flux analysis.

Galuzzi, Bruno G; Milazzo, Luca; Damiani, Chiara

Galuzzi, Bruno G; Milazzo, Luca; Damiani, Chiara.

Afiliação

Galuzzi BG; Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza dell'Ateneo Nuovo, 1, Milan, 20126, Italy; Institute of Molecular Bioimaging and Physiology (IBFM), Segrate, 20054, Italy; SYSBIO Centre of Systems Biology/ ISBE.IT, Milan, Italy. Electronic address: brunogiovanni.galuzzi@cnr.it.
Milazzo L; Department of Informatics, Systems, and Communications, University of Milano-Bicocca, Piazza dell'Ateneo Nuovo, 1, Milan, 20126, Italy.
Damiani C; Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza dell'Ateneo Nuovo, 1, Milan, 20126, Italy; SYSBIO Centre of Systems Biology/ ISBE.IT, Milan, Italy. Electronic address: chiara.damiani@unimib.it.

J Biomed Inform ; 150: 104597, 2024 02.

Article em En | MEDLINE | ID: mdl-38272432

ABSTRACT

ABSTRACT

One of the critical steps to characterize metabolic alterations in multifactorial diseases, as well as their heterogeneity across different patients, is the identification of reactions that exhibit significantly different usage (or flux) between cohorts. However, since metabolic fluxes cannot be determined directly, researchers typically use constraint-based metabolic network models, customized on post-genomics datasets. The use of random sampling within the feasible region of metabolic networks is becoming more prevalent for comparing these networks. While many algorithms have been proposed and compared for efficiently and uniformly sampling the feasible region of metabolic networks, their impact on the risk of making false discoveries when comparing different samples has not been investigated yet, and no sampling strategy has been so far specifically designed to mitigate the problem. To be able to precisely assess the False Discovery Rate (FDR), in this work we compared different samples obtained from the very same metabolic model. We compared the FDR obtained for different model scales, sample sizes, parameters of the sampling algorithm, and strategies to filter out non-significant variations. To be able to compare the largely used hit-and-run strategy with the much less investigated corner-based strategy, we first assessed the intrinsic capability of current corner-based algorithms and of a newly proposed one to visit all vertices of a constraint-based region. We show that false discoveries can occur at high rates even for large samples of small-scale networks. However, we demonstrate that a statistical test based on the empirical null distribution of Kullback-Leibler divergence can effectively correct for false discoveries. We also show that our proposed corner-based algorithm is more efficient than state-of-the-art alternatives and much less prone to false discoveries than hit-and-run strategies. We report that the differences in the marginal distributions obtained with the two strategies are related to but not fully explained by differences in sample standard deviation, as previously thought. Overall, our study provides insights into the impact of sampling strategies on FDR in metabolic network analysis and offers new guidelines for more robust and reproducible analyses.

Assuntos

Análise do Fluxo Metabólico; Modelos Biológicos; Humanos; Algoritmos; Redes e Vias Metabólicas; Genômica

Palavras-chave

COnstraint-based modeling; Corner-based; False discovery rate; Flux sampling; Hit and run

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article