Your browser doesn't support javascript.
loading
Exploring machine learning for untargeted metabolomics using molecular fingerprints.
Sirocchi, Christel; Biancucci, Federica; Donati, Matteo; Bogliolo, Alessandro; Magnani, Mauro; Menotta, Michele; Montagna, Sara.
Affiliation
  • Sirocchi C; Department of Pure and Applied Sciences, University of Urbino, Piazza della Repubblica, 13, Urbino, 61029, Italy. Electronic address: c.sirocchi2@campus.uniurb.it.
  • Biancucci F; Department of Biomolecular Sciences, University of Urbino, Via Saffi 2, Urbino, 61029, Italy.
  • Donati M; Department of Pure and Applied Sciences, University of Urbino, Piazza della Repubblica, 13, Urbino, 61029, Italy.
  • Bogliolo A; Department of Pure and Applied Sciences, University of Urbino, Piazza della Repubblica, 13, Urbino, 61029, Italy.
  • Magnani M; Department of Biomolecular Sciences, University of Urbino, Via Saffi 2, Urbino, 61029, Italy.
  • Menotta M; Department of Biomolecular Sciences, University of Urbino, Via Saffi 2, Urbino, 61029, Italy.
  • Montagna S; Department of Pure and Applied Sciences, University of Urbino, Piazza della Repubblica, 13, Urbino, 61029, Italy.
Comput Methods Programs Biomed ; 250: 108163, 2024 Jun.
Article in En | MEDLINE | ID: mdl-38626559
ABSTRACT

BACKGROUND:

Metabolomics, the study of substrates and products of cellular metabolism, offers valuable insights into an organism's state under specific conditions and has the potential to revolutionise preventive healthcare and pharmaceutical research. However, analysing large metabolomics datasets remains challenging, with available methods relying on limited and incompletely annotated metabolic pathways.

METHODS:

This study, inspired by well-established methods in drug discovery, employs machine learning on metabolite fingerprints to explore the relationship of their structure with responses in experimental conditions beyond known pathways, shedding light on metabolic processes. It evaluates fingerprinting effectiveness in representing metabolites, addressing challenges like class imbalance, data sparsity, high dimensionality, duplicate structural encoding, and interpretable features. Feature importance analysis is then applied to reveal key chemical configurations affecting classification, identifying related metabolite groups.

RESULTS:

The approach is tested on two datasets one on Ataxia Telangiectasia and another on endothelial cells under low oxygen. Machine learning on molecular fingerprints predicts metabolite responses effectively, and feature importance analysis aligns with known metabolic pathways, unveiling new affected metabolite groups for further study.

CONCLUSION:

In conclusion, the presented approach leverages the strengths of drug discovery to address critical issues in metabolomics research and aims to bridge the gap between these two disciplines. This work lays the foundation for future research in this direction, possibly exploring alternative structural encodings and machine learning models.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Metabolomics / Machine Learning Limits: Humans Language: En Journal: Comput Methods Programs Biomed Journal subject: INFORMATICA MEDICA Year: 2024 Document type: Article Country of publication: Ireland

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Metabolomics / Machine Learning Limits: Humans Language: En Journal: Comput Methods Programs Biomed Journal subject: INFORMATICA MEDICA Year: 2024 Document type: Article Country of publication: Ireland