Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
Más filtros




Base de datos
Asunto de la revista
Intervalo de año de publicación
1.
J Chromatogr A ; 1730: 465144, 2024 Jul 06.
Artículo en Inglés | MEDLINE | ID: mdl-38996513

RESUMEN

Ionic liquids, i.e., organic salts with a low melting point, can be used as gas chromatographic liquid stationary phases. These stationary phases have some advantages such as peculiar selectivity, high polarity, and thermostability. Many previous works are devoted to such stationary phases. However, there are still no large enough retention data sets of structurally diverse compounds for them. Consequently, there are very few works devoted to quantitative structure-retention relationships (QSRR) for ionic liquid-based stationary phases. This work is aimed at closing this gap. Three ionic liquids with substituted pyridinium cations are considered. We provide large enough data sets (123-158 compounds) that can be used in further works devoted to QSRR and related methods. We provide a QSRR study using this data set and demonstrate the following. The retention index for a polyethylene glycol stationary phase (denoted as RI_PEG), predicted using another model, can be used as a molecular descriptor. This descriptor significantly improves the accuracy of the QSRR model. Both deep learning-based and linear models were considered for RI_PEG prediction. The ability to predict the retention indices for ionic liquid-based stationary phases with high accuracy is demonstrated. Particular attention is paid to the reproducibility and reliability of the QSRR study. It was demonstrated that adding/removing several compounds, small perturbations of the data set can considerably affect the results such as descriptor importance and model accuracy. These facts have to be considered in order to avoid misleading conclusions. For the QSRR research, we developed a software tool with a graphical user interface, which we called CHERESHNYA. It is intended to select molecular descriptors and construct linear equations connecting molecular descriptors with gas chromatographic retention indices for any stationary phase. The software allows the user to generate several hundred molecular descriptors (one-dimensional and two-dimensional). Among them, predicted retention indices for popular stationary phases such as polydimethylsiloxane and polyethylene glycol are used as molecular descriptors. Various methods for selecting (and assessing the importance of) molecular descriptors have been implemented, in particular the Boruta algorithm, partial least squares, genetic algorithms, L1-regularized regression (LASSO) and others. The software is free, open-source and available online: https://github.com/mtshn/chereshnya.

2.
Chemosphere ; 362: 142679, 2024 Jun 21.
Artículo en Inglés | MEDLINE | ID: mdl-38909863

RESUMEN

Thirty two commercially available standards were used to determine chromatographic retention indices for three different stationary phases (non-polar, polar and mid-polar) commonly used in gas chromatography. The selected compounds were nitrogen-containing heterocycles and amides, which are referred to in the literature as unsymmetrical dimethylhydrazine (UDMH) transformation products or its assumed transformation products. UDMH is a highly toxic compound widely used in the space industry. It is a reactive substance that forms a large number of different compounds in the environment. Well-known transformation products may exceed UDMH itself in their toxicity, but most of the products are poorly investigated, while posing a huge environmental threat. Experimental retention indices for the three stationary phases, retention indices from the NIST database, and predicted retention indices are presented in this paper. It is shown that there are virtually no retention indices for UDMH transformation products in the NIST database. In addition, even among those compounds for which retention indices were known, inconsistencies were identified. Adding retention indices to the database and eliminating erroneous data would allow for more reliable identification when standards are not available. The discrepancies identified between experimental retention index values and predicted values will allow for adjustments to the machine learning models that are used for prediction. Previously proposed compounds as possible transformation products without the use of standards and NMR method were confirmed.

3.
Molecules ; 28(8)2023 Apr 12.
Artículo en Inglés | MEDLINE | ID: mdl-37110641

RESUMEN

Unsymmetrical dimethylhydrazine (UDMH) is a widely used rocket propellant. Entering the environment or being stored in uncontrolled conditions, UDMH easily forms an enormous variety (at least many dozens) of transformation products. Environmental pollution by UDMH and its transformation products is a major problem in many countries and across the Arctic region. Unfortunately, previous works often use only electron ionization mass spectrometry with a library search, or they consider only the molecular formula to propose the structures of new products. This is quite an unreliable approach. It was demonstrated that a newly proposed artificial intelligence-based workflow allows for the proposal of structures of UDMH transformation products with a greater degree of certainty. The presented free and open-source software with a convenient graphical user interface facilitates the non-target analysis of industrial samples. It has bundled machine learning models for the prediction of retention indices and mass spectra. A critical analysis of whether a combination of several methods of chromatography and mass spectrometry allows us to elucidate the structure of an unknown UDMH transformation product was provided. It was demonstrated that the use of gas chromatographic retention indices for two stationary phases (polar and non-polar) allows for the rejection of false candidates in many cases when only one retention index is not enough. The structures of five previously unknown UDMH transformation products were proposed, and four previously proposed structures were refined.

4.
Chemosphere ; 307(Pt 1): 135764, 2022 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-35863423

RESUMEN

Unsymmetrical dimethylhydrazine (UDMH) is a toxic and environmentally hostile compound that was massively introduced to the environment during previous decades due to its use in the space and rocket industry. The compound forms multiple transformation products, and many of them are as dangerous as UDMH or even more dangerous. The danger includes, but is not limited to, acute toxicity, chronic health hazards, carcinogenicity, and environmental damage. UDMH transformation products are poorly investigated. In this work, the mixture formed by long storage of the waste that contained UDMH was studied. Even a preliminary screening of such a mixture is a complex task. It consists of dozens of compounds, and most of them are missing in chemical and spectral databases. The complete preparative separation of such a mixture is very laborious. We applied several methods of gas chromatography-mass spectrometry and liquid chromatography-mass spectrometry, and several machine learning and chemoinformatics methods to make a preliminary but informative screening of the mixture. Machine learning allowed predicting retention indices and mass spectra of candidate structures. The combination of various ion sources and a comparison of the observed with the predicted spectra and retention was used to propose confident structures for 24 compounds. It was demonstrated that neither high-resolution mass spectrometry nor mass spectral library matching is enough to elucidate the structures of unknown UDMH transformation products. At the same time, the use of machine learning and a combination of methods significantly improves the identification power. Finally, machine learning was applied to estimate the acute toxicity of the discovered compounds. It was shown that many of them are comparable to or even more toxic than UDMH itself. Such an extremely wide and still underestimated variety of easily formed derivatives of UDMH can lead to a significant underestimation of the potential hazard of this compound.


Asunto(s)
Mezclas Complejas , Aprendizaje Automático , Dimetilhidrazinas , Cromatografía de Gases y Espectrometría de Masas , Espectrometría de Masas/métodos
5.
Int J Mol Sci ; 22(17)2021 Aug 25.
Artículo en Inglés | MEDLINE | ID: mdl-34502099

RESUMEN

Prediction of gas chromatographic retention indices based on compound structure is an important task for analytical chemistry. The predicted retention indices can be used as a reference in a mass spectrometry library search despite the fact that their accuracy is worse in comparison with the experimental reference ones. In the last few years, deep learning was applied for this task. The use of deep learning drastically improved the accuracy of retention index prediction for non-polar stationary phases. In this work, we demonstrate for the first time the use of deep learning for retention index prediction on polar (e.g., polyethylene glycol, DB-WAX) and mid-polar (e.g., DB-624, DB-210, DB-1701, OV-17) stationary phases. The achieved accuracy lies in the range of 16-50 in terms of the mean absolute error for several stationary phases and test data sets. We also demonstrate that our approach can be directly applied to the prediction of the second dimension retention times (GC × GC) if a large enough data set is available. The achieved accuracy is considerably better compared with the previous results obtained using linear quantitative structure-retention relationships and ACD ChromGenius software. The source code and pre-trained models are available online.


Asunto(s)
Cromatografía de Gases/métodos , Aprendizaje Profundo , Cromatografía de Gases/normas
6.
Anal Chem ; 92(17): 11818-11825, 2020 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-32867500

RESUMEN

Preliminary compound identification and peak annotation in gas chromatography-mass spectrometry is usually made using mass spectral databases. There are a few algorithms that enable performing a search of a spectrum in a large mass spectral library. In many cases, a library search procedure returns a wrong answer even if a correct compound is contained in a library. In this work, we present a deep learning driven approach to a library search in order to reduce the probability of such cases. Machine learning ranking (learning to rank) is a class of machine learning and deep learning algorithms that perform a comparison (ranking) of objects. This work introduces the usage of deep learning ranking for small molecules identification using low-resolution electron ionization mass spectrometry. Instead of simple similarity measures for two spectra, such as the dot product or the Euclidean distance between vectors that represent spectra, a deep convolutional neural network is used. The deep learning ranking model outperforms other approaches and enables reducing a fraction of wrong answers (at rank-1) by 9-23% depending on the used data set. Spectra from the Golm Metabolome Database, Human Metabolome Database, and FiehnLib were used for testing the model.


Asunto(s)
Aprendizaje Profundo/normas , Cromatografía de Gases y Espectrometría de Masas/métodos , Aprendizaje Automático/normas , Metabolómica/métodos , Humanos
7.
Talanta ; 209: 120448, 2020 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-31892031

RESUMEN

The most successful method for pyrolysis liquids analysis is comprehensive two-dimensional gas chromatography. Columns with a stationary liquid phase are used for this purpose. However, when is necessary to analyze a gas phase containing C3-C5 hydrocarbons over a liquid pyrolysis product, the use of columns with a liquid phase in CG*CG will not result to separation of light hydrocarbons. In this case, it is necessary to use PLOT columns with a porous layer of sorbents of various nature. Today this approach with two PLOT columns in GC*GC is not described, as well as its use for the analysis of light hydrocarbons resulting from pyrolysis. This paper describes an application of two PLOT columns in GC*GC mode. This paper describes an application of two PLOT columns in GC*GC mode. The next columns of different nature that have different selectivity were used: Rt-Q-BOND, Rt-S-BOND, Rt-U-BOND (columns based on divinylbenzene styrene copolymer), column with sorbent poly- (1-trimethylsilyl-1-propyne) (PTMSP) and an Agilent GASPRO silica column. The most suitable pair of the columns was determined by finding of their orthogonality. The numerical orthogonality data was found by studying of the correlation coefficients between compounds retention time on the first and second columns. It is shown that the best combination of columns are PTMSP - GASPRO and Rt-Q-BOND - GASPRO, however, the first combination of columns allows separation at the same temperature conditions about twice as fast as the second. Examples of the separation of С3-С8 hydrocarbons in the gas phase over pyrolysis mixtures of different origin are given.

8.
J Chromatogr A ; 1607: 460395, 2019 Dec 06.
Artículo en Inglés | MEDLINE | ID: mdl-31405570

RESUMEN

A deep convolutional neural network was used for the estimation of gas chromatographic retention indices on non-polar (polydimethylsiloxane and polydimethyl(5%-phenyl) siloxane) stationary phases. The neural network can be used for candidate ranking while searching a mass spectral database. A linear representation (SMILES notation) of the molecule structure was used as an input for the model. The input line was converted to a one-hot matrix and then directly processed by the neural network. The calculation of any common molecular descriptors is avoided, following the modern tendency in machine learning: to allow the neural network to find the most preferable features by itself instead of using hard-coded features. The model has two 1D-convolutional layers with 120 neurons each followed by a pooling layer and a fully-connected layer with 200 hidden neurons. The model was compared with state-of-the-art models for prediction of gas chromatographic indices based on molecular descriptors and on functional groups contributions. On different data sets better accuracy is shown together with greater versatility. The applicability to diverse sets of flavors and fragrances, essential oils, metabolites is shown. The possibility of using the model for improvement of mass spectral identification (without reference retention index) is demonstrated. The median absolute error and the median percentage error are in the range of 17.3 (0.93%) to 38.1 (2.15%) depending on used test data set. Ready-to-use neural network parameters are provided.


Asunto(s)
Cromatografía de Gases/métodos , Redes Neurales de la Computación , Bases de Datos Factuales , Cromatografía de Gases y Espectrometría de Masas , Análisis de Regresión
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA