Your browser doesn't support javascript.
loading
Improved Small Molecule Identification through Learning Combinations of Kernel Regression Models.
Brouard, Céline; Bassé, Antoine; d'Alché-Buc, Florence; Rousu, Juho.
Affiliation
  • Brouard C; Unité de Mathématiques et Informatique Appliquées de Toulouse, UR 875, INRA, 31326 Castanet Tolosan, France. celine.brouard@inra.fr.
  • Bassé A; LTCI, Télécom Paris, Institut Polytechnique de Paris, 75634 Paris, France.
  • d'Alché-Buc F; LTCI, Télécom Paris, Institut Polytechnique de Paris, 75634 Paris, France.
  • Rousu J; Helsinki Institute for Information Technology HIIT, Department of Computer Science, Aalto University, 00076 Espoo, Finland.
Metabolites ; 9(8)2019 Aug 01.
Article in En | MEDLINE | ID: mdl-31374904
ABSTRACT
In small molecule identification from tandem mass (MS/MS) spectra, input-output kernel regression (IOKR) currently provides the state-of-the-art combination of fast training and prediction and high identification rates. The IOKR approach can be simply understood as predicting a fingerprint vector from the MS/MS spectrum of the unknown molecule, and solving a pre-image problem to find the molecule with the most similar fingerprint. In this paper, we bring forward the following improvements to the IOKR framework firstly, we formulate the IOKRreverse model that can be understood as mapping molecular structures into the MS/MS feature space and solving a pre-image problem to find the molecule whose predicted spectrum is the closest to the input MS/MS spectrum. Secondly, we introduce an approach to combine several IOKR and IOKRreverse models computed from different input and output kernels, called IOKRfusion. The method is based on minimizing structured Hinge loss of the combined model using a mini-batch stochastic subgradient optimization. Our experiments show a consistent improvement of top-k accuracy both in positive and negative ionization mode data.
Key words

Full text: 1 Database: MEDLINE Type of study: Diagnostic_studies / Prognostic_studies Language: En Year: 2019 Type: Article

Full text: 1 Database: MEDLINE Type of study: Diagnostic_studies / Prognostic_studies Language: En Year: 2019 Type: Article