Electron ionization mass spectrometry feature peak relationships combined with deep classification model to assist similarity algorithm for fast and accurate identification of compounds.
Rapid Commun Mass Spectrom
; 38(13): e9752, 2024 Jul 15.
Article
en En
| MEDLINE
| ID: mdl-38700125
ABSTRACT
RATIONALE Gas chromatography-mass spectrometry (GC-MS) combines chromatography and MS, providing full play to the advantages of high separation efficiency of GC, strong qualitative ability of MS, and high sensitivity of detector. In GC-MS data processing, determining the experimental compounds is one of the most important analytical steps, which is usually realized by one-to-one similarity calculations between the experimental mass spectrum and the standard mass spectrum library. Although the accuracy of the algorithm has been improved in recent years, it is still difficult to distinguish structurally similar mass spectra, especially isomers. At the same time, the library capacity is very large and increasing every year, and the algorithm needs to perform large numbers of calculations with irrelevant compounds in the library to recognize unknown compounds, which leads to a significant reduction in efficiency. METHODS:
This work proposed to exclude a large number of irrelevant mass spectra by presearching, perform preliminary similarity calculations using similarity algorithms, and finally improve the accuracy of similarity calculations using deep classification models. The replica library of NIST17 is used as the query data, and the master library is used as the reference database.RESULTS:
Compared with the traditional recognition algorithm, the preprocessing algorithm has reduced the time by 4.2 h, and by adding the deep learning models 1 and 2 as the final determination, the recognition accuracy has been improved by 1.9% and 6.5%, respectively, based on the original algorithm.CONCLUSIONS:
This method improves the recognition efficiency compared to conventional algorithms and at the same time has better recognition accuracy for structurally similar mass spectra and isomers.
Texto completo:
1
Banco de datos:
MEDLINE
Idioma:
En
Año:
2024
Tipo del documento:
Article