Your browser doesn't support javascript.
loading
Bayesian networks for mass spectrometric metabolite identification via molecular fingerprints.
Ludwig, Marcus; Dührkop, Kai; Böcker, Sebastian.
Afiliação
  • Ludwig M; Chair for Bioinformatics, Friedrich-Schiller-University, Jena, Germany.
  • Dührkop K; Chair for Bioinformatics, Friedrich-Schiller-University, Jena, Germany.
  • Böcker S; Chair for Bioinformatics, Friedrich-Schiller-University, Jena, Germany.
Bioinformatics ; 34(13): i333-i340, 2018 07 01.
Article em En | MEDLINE | ID: mdl-29949965
ABSTRACT
Motivation Metabolites, small molecules that are involved in cellular reactions, provide a direct functional signature of cellular state. Untargeted metabolomics experiments usually rely on tandem mass spectrometry to identify the thousands of compounds in a biological sample. Recently, we presented CSIFingerID for searching in molecular structure databases using tandem mass spectrometry data. CSIFingerID predicts a molecular fingerprint that encodes the structure of the query compound, then uses this to search a molecular structure database such as PubChem. Scoring of the predicted query fingerprint and deterministic target fingerprints is carried out assuming independence between the molecular properties constituting the fingerprint.

Results:

We present a scoring that takes into account dependencies between molecular properties. As before, we predict posterior probabilities of molecular properties using machine learning. Dependencies between molecular properties are modeled as a Bayesian tree network; the tree structure is estimated on the fly from the instance data. For each edge, we also estimate the expected covariance between the two random variables. For fixed marginal probabilities, we then estimate conditional probabilities using the known covariance. Now, the corrected posterior probability of each candidate can be computed, and candidates are ranked by this score. Modeling dependencies improves identification rates of CSIFingerID by 2.85 percentage points. Availability and implementation The new scoring Bayesian (fixed tree) is integrated into SIRIUS 4.0 (https//bio.informatik.uni-jena.de/software/sirius/).
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Espectrometria de Massas em Tandem / Metabolômica / Bases de Dados de Compostos Químicos Tipo de estudo: Diagnostic_studies / Prognostic_studies Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2018 Tipo de documento: Article País de afiliação: Alemanha

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Espectrometria de Massas em Tandem / Metabolômica / Bases de Dados de Compostos Químicos Tipo de estudo: Diagnostic_studies / Prognostic_studies Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2018 Tipo de documento: Article País de afiliação: Alemanha