Your browser doesn't support javascript.
loading
Multi-Task Neural Networks and Molecular Fingerprints to Enhance Compound Identification from LC-MS/MS Data.
Consonni, Viviana; Gosetti, Fabio; Termopoli, Veronica; Todeschini, Roberto; Valsecchi, Cecile; Ballabio, Davide.
Affiliation
  • Consonni V; Department of Earth and Environmental Sciences, University of Milano-Bicocca, Piazza della Scienza 1, 20126 Milano, Italy.
  • Gosetti F; Department of Earth and Environmental Sciences, University of Milano-Bicocca, Piazza della Scienza 1, 20126 Milano, Italy.
  • Termopoli V; Department of Earth and Environmental Sciences, University of Milano-Bicocca, Piazza della Scienza 1, 20126 Milano, Italy.
  • Todeschini R; Department of Earth and Environmental Sciences, University of Milano-Bicocca, Piazza della Scienza 1, 20126 Milano, Italy.
  • Valsecchi C; Department of Earth and Environmental Sciences, University of Milano-Bicocca, Piazza della Scienza 1, 20126 Milano, Italy.
  • Ballabio D; Department of Earth and Environmental Sciences, University of Milano-Bicocca, Piazza della Scienza 1, 20126 Milano, Italy.
Molecules ; 27(18)2022 Sep 08.
Article in En | MEDLINE | ID: mdl-36144564
ABSTRACT
Mass spectrometry (MS) is widely used for the identification of chemical compounds by matching the experimentally acquired mass spectrum against a database of reference spectra. However, this approach suffers from a limited coverage of the existing databases causing a failure in the identification of a compound not present in the database. Among the computational approaches for mining metabolite structures based on MS data, one option is to predict molecular fingerprints from the mass spectra by means of chemometric strategies and then use them to screen compound libraries. This can be carried out by calibrating multi-task artificial neural networks from large datasets of mass spectra, used as inputs, and molecular fingerprints as outputs. In this study, we prepared a large LC-MS/MS dataset from an on-line open repository. These data were used to train and evaluate deep-learning-based approaches to predict molecular fingerprints and retrieve the structure of unknown compounds from their LC-MS/MS spectra. Effects of data sparseness and the impact of different strategies of data curing and dimensionality reduction on the output accuracy have been evaluated. Moreover, extensive diagnostics have been carried out to evaluate modelling advantages and drawbacks as a function of the explored chemical space.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Neural Networks, Computer / Tandem Mass Spectrometry Type of study: Diagnostic_studies / Prognostic_studies Language: En Journal: Molecules Journal subject: BIOLOGIA Year: 2022 Document type: Article Affiliation country: Italia

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Neural Networks, Computer / Tandem Mass Spectrometry Type of study: Diagnostic_studies / Prognostic_studies Language: En Journal: Molecules Journal subject: BIOLOGIA Year: 2022 Document type: Article Affiliation country: Italia