Your browser doesn't support javascript.
loading
TransExION: a transformer based explainable similarity metric for comparing IONS in tandem mass spectrometry.
Bui-Thi, Danh; Liu, Youzhong; Lippens, Jennifer L; Laukens, Kris; De Vijlder, Thomas.
Afiliação
  • Bui-Thi D; Computer Science Department, University of Antwerp, Middelheimlaan 1, 2020, Antwerp, Belgium.
  • Liu Y; Therapeutic Development and Supply, Janssen Pharmaceutica N.V., Turnhoutseweg 30, 2340, Beerse, Belgium.
  • Lippens JL; Therapeutic Development and Supply, Janssen Pharmaceutica N.V., Turnhoutseweg 30, 2340, Beerse, Belgium.
  • Laukens K; Computer Science Department, University of Antwerp, Middelheimlaan 1, 2020, Antwerp, Belgium.
  • De Vijlder T; Therapeutic Development and Supply, Janssen Pharmaceutica N.V., Turnhoutseweg 30, 2340, Beerse, Belgium. tdevijld@its.jnj.com.
J Cheminform ; 16(1): 61, 2024 May 28.
Article em En | MEDLINE | ID: mdl-38807166
ABSTRACT
Small molecule identification is a crucial task in analytical chemistry and life sciences. One of the most commonly used technologies to elucidate small molecule structures is mass spectrometry. Spectral library search of product ion spectra (MS/MS) is a popular strategy to identify or find structural analogues. This approach relies on the assumption that spectral similarity and structural similarity are correlated. However, popular spectral similarity measures, usually calculated based on identical fragment matches between the MS/MS spectra, do not always accurately reflect the structural similarity. In this study, we propose TransExION, a Transformer based Explainable similarity metric for IONS. TransExION detects related fragments between MS/MS spectra through their mass difference and uses these to estimate spectral similarity. These related fragments can be nearly identical, but can also share a substructure. TransExION also provides a post-hoc explanation of its estimation, which can be used to support scientists in evaluating the spectral library search results and thus in structure elucidation of unknown molecules. Our model has a Transformer based architecture and it is trained on the data derived from GNPS MS/MS libraries. The experimental results show that it improves existing spectral similarity measures in searching and interpreting structural analogues as well as in molecular networking. SCIENTIFIC CONTRIBUTION We propose a transformer-based spectral similarity metrics that improves the comparison of small molecule tandem mass spectra. We provide a post hoc explanation that can serve as a good starting point for unknown spectra annotation based on database spectra.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: J Cheminform Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: J Cheminform Ano de publicação: 2024 Tipo de documento: Article