Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 6 de 6
Filter
Add more filters










Database
Language
Publication year range
1.
Molecules ; 28(8)2023 Apr 12.
Article in English | MEDLINE | ID: mdl-37110641

ABSTRACT

Unsymmetrical dimethylhydrazine (UDMH) is a widely used rocket propellant. Entering the environment or being stored in uncontrolled conditions, UDMH easily forms an enormous variety (at least many dozens) of transformation products. Environmental pollution by UDMH and its transformation products is a major problem in many countries and across the Arctic region. Unfortunately, previous works often use only electron ionization mass spectrometry with a library search, or they consider only the molecular formula to propose the structures of new products. This is quite an unreliable approach. It was demonstrated that a newly proposed artificial intelligence-based workflow allows for the proposal of structures of UDMH transformation products with a greater degree of certainty. The presented free and open-source software with a convenient graphical user interface facilitates the non-target analysis of industrial samples. It has bundled machine learning models for the prediction of retention indices and mass spectra. A critical analysis of whether a combination of several methods of chromatography and mass spectrometry allows us to elucidate the structure of an unknown UDMH transformation product was provided. It was demonstrated that the use of gas chromatographic retention indices for two stationary phases (polar and non-polar) allows for the rejection of false candidates in many cases when only one retention index is not enough. The structures of five previously unknown UDMH transformation products were proposed, and four previously proposed structures were refined.

2.
Chemosphere ; 307(Pt 1): 135764, 2022 Nov.
Article in English | MEDLINE | ID: mdl-35863423

ABSTRACT

Unsymmetrical dimethylhydrazine (UDMH) is a toxic and environmentally hostile compound that was massively introduced to the environment during previous decades due to its use in the space and rocket industry. The compound forms multiple transformation products, and many of them are as dangerous as UDMH or even more dangerous. The danger includes, but is not limited to, acute toxicity, chronic health hazards, carcinogenicity, and environmental damage. UDMH transformation products are poorly investigated. In this work, the mixture formed by long storage of the waste that contained UDMH was studied. Even a preliminary screening of such a mixture is a complex task. It consists of dozens of compounds, and most of them are missing in chemical and spectral databases. The complete preparative separation of such a mixture is very laborious. We applied several methods of gas chromatography-mass spectrometry and liquid chromatography-mass spectrometry, and several machine learning and chemoinformatics methods to make a preliminary but informative screening of the mixture. Machine learning allowed predicting retention indices and mass spectra of candidate structures. The combination of various ion sources and a comparison of the observed with the predicted spectra and retention was used to propose confident structures for 24 compounds. It was demonstrated that neither high-resolution mass spectrometry nor mass spectral library matching is enough to elucidate the structures of unknown UDMH transformation products. At the same time, the use of machine learning and a combination of methods significantly improves the identification power. Finally, machine learning was applied to estimate the acute toxicity of the discovered compounds. It was shown that many of them are comparable to or even more toxic than UDMH itself. Such an extremely wide and still underestimated variety of easily formed derivatives of UDMH can lead to a significant underestimation of the potential hazard of this compound.


Subject(s)
Complex Mixtures , Machine Learning , Dimethylhydrazines , Gas Chromatography-Mass Spectrometry , Mass Spectrometry/methods
3.
Int J Mol Sci ; 22(17)2021 Aug 25.
Article in English | MEDLINE | ID: mdl-34502099

ABSTRACT

Prediction of gas chromatographic retention indices based on compound structure is an important task for analytical chemistry. The predicted retention indices can be used as a reference in a mass spectrometry library search despite the fact that their accuracy is worse in comparison with the experimental reference ones. In the last few years, deep learning was applied for this task. The use of deep learning drastically improved the accuracy of retention index prediction for non-polar stationary phases. In this work, we demonstrate for the first time the use of deep learning for retention index prediction on polar (e.g., polyethylene glycol, DB-WAX) and mid-polar (e.g., DB-624, DB-210, DB-1701, OV-17) stationary phases. The achieved accuracy lies in the range of 16-50 in terms of the mean absolute error for several stationary phases and test data sets. We also demonstrate that our approach can be directly applied to the prediction of the second dimension retention times (GC × GC) if a large enough data set is available. The achieved accuracy is considerably better compared with the previous results obtained using linear quantitative structure-retention relationships and ACD ChromGenius software. The source code and pre-trained models are available online.


Subject(s)
Chromatography, Gas/methods , Deep Learning , Chromatography, Gas/standards
4.
Anal Chem ; 92(17): 11818-11825, 2020 09 01.
Article in English | MEDLINE | ID: mdl-32867500

ABSTRACT

Preliminary compound identification and peak annotation in gas chromatography-mass spectrometry is usually made using mass spectral databases. There are a few algorithms that enable performing a search of a spectrum in a large mass spectral library. In many cases, a library search procedure returns a wrong answer even if a correct compound is contained in a library. In this work, we present a deep learning driven approach to a library search in order to reduce the probability of such cases. Machine learning ranking (learning to rank) is a class of machine learning and deep learning algorithms that perform a comparison (ranking) of objects. This work introduces the usage of deep learning ranking for small molecules identification using low-resolution electron ionization mass spectrometry. Instead of simple similarity measures for two spectra, such as the dot product or the Euclidean distance between vectors that represent spectra, a deep convolutional neural network is used. The deep learning ranking model outperforms other approaches and enables reducing a fraction of wrong answers (at rank-1) by 9-23% depending on the used data set. Spectra from the Golm Metabolome Database, Human Metabolome Database, and FiehnLib were used for testing the model.


Subject(s)
Deep Learning/standards , Gas Chromatography-Mass Spectrometry/methods , Machine Learning/standards , Metabolomics/methods , Humans
5.
Talanta ; 209: 120448, 2020 Mar 01.
Article in English | MEDLINE | ID: mdl-31892031

ABSTRACT

The most successful method for pyrolysis liquids analysis is comprehensive two-dimensional gas chromatography. Columns with a stationary liquid phase are used for this purpose. However, when is necessary to analyze a gas phase containing C3-C5 hydrocarbons over a liquid pyrolysis product, the use of columns with a liquid phase in CG*CG will not result to separation of light hydrocarbons. In this case, it is necessary to use PLOT columns with a porous layer of sorbents of various nature. Today this approach with two PLOT columns in GC*GC is not described, as well as its use for the analysis of light hydrocarbons resulting from pyrolysis. This paper describes an application of two PLOT columns in GC*GC mode. This paper describes an application of two PLOT columns in GC*GC mode. The next columns of different nature that have different selectivity were used: Rt-Q-BOND, Rt-S-BOND, Rt-U-BOND (columns based on divinylbenzene styrene copolymer), column with sorbent poly- (1-trimethylsilyl-1-propyne) (PTMSP) and an Agilent GASPRO silica column. The most suitable pair of the columns was determined by finding of their orthogonality. The numerical orthogonality data was found by studying of the correlation coefficients between compounds retention time on the first and second columns. It is shown that the best combination of columns are PTMSP - GASPRO and Rt-Q-BOND - GASPRO, however, the first combination of columns allows separation at the same temperature conditions about twice as fast as the second. Examples of the separation of С3-С8 hydrocarbons in the gas phase over pyrolysis mixtures of different origin are given.

6.
J Chromatogr A ; 1607: 460395, 2019 Dec 06.
Article in English | MEDLINE | ID: mdl-31405570

ABSTRACT

A deep convolutional neural network was used for the estimation of gas chromatographic retention indices on non-polar (polydimethylsiloxane and polydimethyl(5%-phenyl) siloxane) stationary phases. The neural network can be used for candidate ranking while searching a mass spectral database. A linear representation (SMILES notation) of the molecule structure was used as an input for the model. The input line was converted to a one-hot matrix and then directly processed by the neural network. The calculation of any common molecular descriptors is avoided, following the modern tendency in machine learning: to allow the neural network to find the most preferable features by itself instead of using hard-coded features. The model has two 1D-convolutional layers with 120 neurons each followed by a pooling layer and a fully-connected layer with 200 hidden neurons. The model was compared with state-of-the-art models for prediction of gas chromatographic indices based on molecular descriptors and on functional groups contributions. On different data sets better accuracy is shown together with greater versatility. The applicability to diverse sets of flavors and fragrances, essential oils, metabolites is shown. The possibility of using the model for improvement of mass spectral identification (without reference retention index) is demonstrated. The median absolute error and the median percentage error are in the range of 17.3 (0.93%) to 38.1 (2.15%) depending on used test data set. Ready-to-use neural network parameters are provided.


Subject(s)
Chromatography, Gas/methods , Neural Networks, Computer , Databases, Factual , Gas Chromatography-Mass Spectrometry , Regression Analysis
SELECTION OF CITATIONS
SEARCH DETAIL
...