Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 150
Filtrar
1.
J Chromatogr A ; 1708: 464341, 2023 Oct 11.
Artigo em Inglês | MEDLINE | ID: mdl-37660566

RESUMO

Comprehensive three-dimensional (3D) gas chromatography with time-of-flight mass spectrometry (GC3-TOFMS) is a promising instrumental platform for the separation of volatiles and semi-volatiles due to its increased peak capacity and selectivity relative to comprehensive two-dimensional gas chromatography with TOFMS (GC×GC-TOFMS). Given the recent advances in GC3-TOFMS instrumentation, new data analysis methods are now required to analyze its complex data structure efficiently and effectively. This report highlights the development of a cuboid-based Fisher ratio (F-ratio) analysis for supervised, non-targeted studies. This approach builds upon the previously reported tile-based F-ratio software for GC×GC-TOFMS data. Cuboid-based F-ratio analysis is enabled by constructing 3D cuboids within the GC3-TOFMS chromatogram and calculating F-ratios for every cuboid on a per-mass channel basis. This methodology is evaluated using a GC3-TOFMS data set of jet fuel spiked with both non-native and native components. The neat and spiked jet fuels were collected on a total-transfer (100 % duty cycle) GC3-TOFMS instrument, employing thermal modulation between the first (1D) and second dimension (2D) columns and dynamic pressure gradient modulation between the 2D and third dimension (3D) columns. In total, cuboid-based F-ratio analysis discovered 32 spiked analytes in the top 50 hits at concentration ratios as low as 1.1. In contrast, tile-based F-ratio analysis of the corresponding GC×GC-TOFMS data only discovered 28 of the spiked analytes total, with only 25 of them in the top 50 hits. Along with discovering more analytes, cuboid-based F-ratio analysis of GC3-TOFMS data resulted in fewer false positives. The increased discoverability is due to the added peak capacity and selectivity provided by the 3D column with GC3-TOFMS resulting in improved chromatographic resolution.


Assuntos
Projetos de Pesquisa , Software , Cromatografia Gasosa-Espectrometria de Massas
2.
Anal Chem ; 95(36): 13519-13527, 2023 Sep 12.
Artigo em Inglês | MEDLINE | ID: mdl-37647642

RESUMO

In this study, we introduce a new nontargeted tile-based supervised analysis method that combines the four-grid tiling scheme previously established for the Fisher ratio (F-ratio) analysis (FRA) with the estimation of tile hit importance using the machine learning (ML) algorithm Random Forest (RF). This approach is termed tile-based RF analysis. As opposed to the standard tile-based F-ratio analysis, the RF approach can be extended to the analysis of unbalanced data sets, i.e., different numbers of samples per class. Tile-based RF computes out-of-bag (oob) tile hit importance estimates for every summed chromatographic signal within each tile on a per-mass channel basis (m/z). These estimates are then used to rank tile hits in a descending order of importance. In the present investigation, the RF approach was applied for a two-class comparison of stool samples collected from omnivore (O) subjects and stored using two different storage conditions: liquid (Liq) and lyophilized (Lyo). Two final hit lists were generated using balanced (8 vs Eight comparison) and unbalanced (8 vs Nine comparison) data sets and compared to the hit list generated by the standard F-ratio analysis. Similar class-distinguishing analytes (p < 0.01) were discovered by both methods. However, while the FRA discovered a more comprehensive hit list (65 hits), the RF approach strictly discovered hits (31 hits for the balanced data set comparison and 29 hits for the unbalanced data set comparison) with concentration ratios, [OLiq]/[OLyo], greater than 2 (or less than 0.5). This difference is attributed to the more stringent feature selection process used by the RF algorithm. Moreover, our findings suggest that the RF approach is a promising method for identifying class-distinguishing analytes in settings characterized by both high between-class variance and high within-class variance, making it an advantageous method in the study of complex biological matrices.

3.
Talanta ; 259: 124525, 2023 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-37031541

RESUMO

Solid phase extraction (SPE) sample preparation for the analysis of complex organic mixtures is often applied assuming all analytes of interest will preconcentrate on the stationary phase. This assumption ignores the reality that extraction is a dynamic interactive process and a diverse range of affinities for the stationary phase will result in equally diverse breakthrough volumes due to competitive sorption processes. To study this dynamic interactive process, and further to take advantage of it, we extracted a JP-8 jet fuel spiked with 40 ppm of a polar compound mix with silica and alumina SPE cartridges and analyzed sequential extracted fractions of the fuel to both assess the shifting chemical landscape present in the extraction and the impact of both SPE stationary phases on this process. Tile-based 1v1 comparative analysis (a recently reported extension of tile-based Fisher ratio analysis) was used to discover the (polar) compounds whose concentrations change between extracted fractions, discovering 21 compounds extracted with silica and 27 compounds extracted with alumina with at least a 2-fold change in concentration from the neat sample relative to the first 1 mL pass fraction sample. These compounds were quantified in each fraction to construct concentration ratio profiles, defined as the concentration ratio for a given SPE fraction per analyte compound relative to the analyte concentration in the neat fuel, for which the extraction behavior for each analyte could be assessed. These analyte compounds were found to breakthrough at different rates, with some analytes remaining on the column indefinitely (until extracted with a subsequent polar solvent) and other analytes eluting before the extraction is complete. Furthermore, in a comparison of the effect of selected stationary phase, alumina was found to retain oxygen-containing phenolic compounds to a greater extent than silica. Principal component analysis (PCA) was used to analyze the concentration ratio profiles of the various trace analytes in the JP8 fuel (phenols, indoles, etc.) in the context of their stationary phase affinity (silica or alumina) and competitive sorption behavior.

4.
J Chromatogr A ; 1694: 463920, 2023 Apr 12.
Artigo em Inglês | MEDLINE | ID: mdl-36933463

RESUMO

Chemometric methods like partial least squares (PLS) regression are valuable for correlating sample-based differences hidden in comprehensive two-dimensional gas chromatography (GC × GC) data to independently measured physicochemical properties. Herein, this work establishes the first implementation of tile-based variance ranking as a selective data reduction methodology to improve PLS modeling performance of 58 diverse aerospace fuels. Tile-based variance ranking discovered a total of 521 analytes with a square of the relative standard deviation (RSD2) in signal between 0.07 to 22.84. The goodness-of-fit for the models were determined by their normalized root-mean-square error of cross-validation (NRMSECV) and normalized root-mean-square error of prediction (NRMSEP). PLS models developed for viscosity, hydrogen content, and heat of combustion using all 521 features discovered by tile-based variance ranking had a respective NRMSECV (NRMSEP) equal to 10.5 % (10.2 %), 8.3 % (7.6 %), and 13.1 % (13.5 %). In contrast, use of a single-grid binning scheme, a common data reduction strategy for PLS analysis, resulted in less accurate models for viscosity (NRMSECV = 14.2 %; NRMSEP = 14.3 %), hydrogen content (NRMSECV = 12.1 %; NRMSEP = 11.0 %), and heat of combustion (NRMSECV = 14.4 %; NRMSEP = 13.6 %). Further, the features discovered by tile-based variance ranking can be optimized for each PLS model with RReliefF analysis, a machine learning algorithm. RReliefF feature optimization selected 48, 125, and 172 analytes out of the original 521 discovered by tile-based variance ranking to model viscosity, hydrogen content, and heat of combustion, respectively. The RReliefF optimized features developed highly accurate property-composition models for viscosity (NRMSECV = 7.9 %; NRMSEP = 5.8 %), hydrogen content (NRMSECV = 7.0 %; NRMSEP = 4.9 %), heat of combustion (NRMSECV = 7.9 %; NRMSEP = 8.4 %). This work also demonstrates that processing the chromatograms with a tile-based approach allows the analyst to directly identify the analytes of importance in a PLS model. Coupling tile-based feature selection with PLS analysis allows for deeper understanding in any property-composition study.


Assuntos
Algoritmos , Análise dos Mínimos Quadrados , Cromatografia Gasosa-Espectrometria de Massas/métodos
6.
Talanta ; 254: 124173, 2023 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-36512972

RESUMO

We examine and then optimize alignment of chromatograms collected on nominally identical columns using retention time locking (RTL), an instrumental alignment tool, and software-based alignment using correlation optimized warping (COW). For this purpose, three samples are constructed by spiking two sets of analytes into a base test mixture. The three samples are analyzed by high-speed gas chromatography with four nominally identical columns and identical separation conditions. The data is first analyzed without alignment, then using COW alone, then RTL alone, and finally with RTL followed by COW to correct the severe column-to-column misalignment. Principal component analysis (PCA) is used to investigate how well each alignment method clustered the chromatograms into the three sample classes via a scores plot without being compromised by the specific column(s) used. The degree-of-class separation (DCS) is used as a classification metric, measured as the Euclidian distance between the centroids of two clusters in PC space in the scores plot, normalized by their pooled variance. With no alignment, the average DCS between sample classes (DCSsam) was 3.0, while the average DCS between the four nominally identical columns, i.e., column classes (DCScol) was 76.1 (ideally the DCScol should be 0), indicating the chromatograms were initially classified by the columns used. Using either COW or RTL alone also produced unsatisfactory results, with COW alone incorrectly aligning many peaks, leading to a DCSsam of only 1.9 and DCScol of 1.7, while RTL alone provided a DCSsam of 4.7 and DCScol of 4.2. Finally, using RTL followed by COW alignment, DCSsam increased to 32.5, indicating successful classification by chemical differences between sample classes, while the DCScol decreased to 0.4, indicating virtually no classification due to column-to-column differences, as desired. Thus, RTL provided a "first-order" correction of the initial retention mismatch observed for the nominally identical columns, while additional alignment via COW was required to optimize sample classification by PCA.


Assuntos
Algoritmos , Cromatografia Gasosa/métodos , Análise de Componente Principal
7.
Anal Chem ; 95(2): 1513-1521, 2023 01 17.
Artigo em Inglês | MEDLINE | ID: mdl-36563309

RESUMO

Nontargeted analyses of low-concentration analytes in the information-rich data collected by liquid chromatography with high-resolution mass spectrometry detection can be challenging to accomplish in an efficient and comprehensive manner. The aim of this study is to demonstrate a workflow involving targeted parameter optimization for entire chromatograms using region of interest (ROI) data compression uncoupled from a subsequent tile-based Fisher ratio (F-ratio) analysis, a supervised discovery-based method, for the discovery of low-concentration analytes. Soil samples spiked with 18 pesticides at nominal concentrations ranging from 0.1 to 50 ppb for a total of six sample classes served as challenging samples to demonstrate the overall workflow. Optimization of two parameters proved to be the most critical for ROI data compression: the signal threshold parameter and the admissible mass deviation parameter. The parameter optimization method workflow we introduce is based upon spiking known analytes into a representative sample and determining the number of detectable spikes and the Δppm for various combinations of the signal threshold and admissible mass deviation, where Δppm is the absolute value of the difference between the theoretical m/z and the ROI m/z. Once optimal parameters are determined providing the lowest average Δppm and the greatest number of detectable analytes, the optimized parameters can be utilized for the intended analysis. Herein, tile-based F-ratio analysis was performed on the ROI compressed data of all spiked soil samples first by applying ROI parameters recommended in the literature, referred to herein as the initial ROI parameters, and finally by the combination of the two optimized parameters. Using the initial ROI parameters, three pesticides were discovered, whereas all 18 spiked pesticides were discovered by optimizing both ROI parameters.


Assuntos
Compressão de Dados , Praguicidas , Espectrometria de Massas , Cromatografia Líquida/métodos , Solo
8.
Anal Bioanal Chem ; 415(13): 2411-2423, 2023 May.
Artigo em Inglês | MEDLINE | ID: mdl-36181512

RESUMO

Tile-based Fisher ratio (F-ratio) analysis of comprehensive two-dimensional gas chromatography time-of-flight mass spectrometry (GC × GC-TOFMS) data is a powerful, supervised discovery methodology for pinpointing sample class-distinguishing analytes between two or more sample classes. Herein, we extend this analytical methodology to focus upon specific chemical groups in kerosene-based aerospace fuel using solid-phase extraction (SPE). Treating samples with SPE removes specific compounds depending on the SPE stationary phase (i.e., silica), creating an altered "pass" sample, identical to the original "neat" sample except for the extracted compounds. Application of F-ratio analysis to the neat samples against the pass samples provides global discovery with a numerically sorted hit list of all analytes affected by the SPE procedure. Sections of GC × GC-TOFMS data from the top analyte hits are reconstructed to form a "stitch" chromatogram to visualize the sample class-distinguishing compounds, revealing excellent agreement with the extract chromatogram. Additionally, utilizing the four-grid tiling scheme developed for tile-based F-ratio analysis, we demonstrate a tile-based pairwise analysis method, referred to as 1v1 analysis, to discover analytes that differ in concentration between two fuel chromatograms. Application of 1v1 analysis is highly efficient since replicates do not necessarily need to be run on the GC × GC-TOFMS instrument, which is beneficial for sample-limited applications. The 1v1 analyses discovered most of the same features as F-ratio analysis, ranging from 69 to 81% of the features discovered by F-ratio analysis while requiring one-sixth the data. Lastly, the overall methodology is applied to three candidate rocket fuels to better understand the compound class-distinguishing differences. The separate hit lists produced for high-concentration bulk hydrocarbon differences and low-concentration level polar compound differences provided valuable insight into these candidate rocket fuels.

9.
J Chromatogr A ; 1677: 463321, 2022 Aug 16.
Artigo em Inglês | MEDLINE | ID: mdl-35853427

RESUMO

Untargeted analysis of comprehensive two-dimensional (2D) gas chromatography time-of-flight mass spectrometry (GC×GC-TOFMS) data has the potential to be hindered by run-to-run retention time shifting. To address this challenge, tile-based Fisher ratio (F-ratio) analysis (FRA) has been developed, which utilizes a supervised, untargeted approach involving a chromatographic segmentation routine termed "tiling" combined with the ANOVA F-ratio statistic to discover class-distinguishing analytes while minimizing false positives arising from shifting. The tiling algorithm is designed to account for retention shifting in both separation dimensions. Although applications of FRA have been reported, there remains a need to thoroughly evaluate the robustness of FRA for different levels of run-to-run retention shifting in order to broaden the scope of its application. To this end, a novel method of simulating GC×GC-TOFMS chromatograms with realistic run-to-run shifting is presented by random generation of low-frequency "shift functions". The dimensionless retention-time precision, <δr>, which is four times the standard deviation in retention time normalized to the peak width-at-base is used as a key modeling variable along with the 2D chromatographic saturation, αe,2D, and within-class relative standard deviation in peak area, RSDwc. We demonstrate that all three of these variables operate together to impact true positive discovery. To quantify the "success" of true positive discovery, GC×GC-TOFMS datasets for various combinations of <δr>, αe,2D, and RSDwc were simulated and then analyzed by FRA using a wide range of relative tile areas (RTA), which is a dimensionless measure of tile size. Since each hit in the FRA hit list was known a priori as either a true or false positive based on the simulation inputs, receiver operating characteristic (ROC) curves were readily constructed. Then, the area under the ROC curve (AUROC) was used as a metric for discovery "success" for various combinations of the modeling variables. Based on the results of this study, recommendations for tile size selection and experimental design are provided, and further supported by comparison to previous tile-based FRA applications. For instance, values for <δr>, αe,2D, and RSDwc obtained from a GC×GC-TOFMS dataset of yeast metabolites suggested an optimum RTA of 6.25, corresponding closely to the RTA of 4.00 employed in the study, implying the simulation results obtained here can be generalized to real datasets.


Assuntos
Algoritmos , Saccharomyces cerevisiae , Cromatografia Gasosa-Espectrometria de Massas/métodos , Curva ROC
10.
Anal Chem ; 94(26): 9407-9414, 2022 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-35728566

RESUMO

An analytical workflow for the analysis of olefins in gasoline that combines selective bromination and comprehensive two-dimensional (2D) gas chromatography time-of-flight mass spectrometry (GC×GC-TOFMS) with discovery-based analysis is reported. First, a standard mix containing n-alkanes, 1-alkenes, and aromatic species was brominated and quantified using % reacted as a metric for each compound class, defined as the difference in the total peak area between the brominated and original samples normalized to the original sample. The average % reacted (1 s.d.) values were -1.45% (2.8%) for the alkanes, 99.5% (0.4%) for the alkenes, and 6.7% (11.6%) for the aromatics, demonstrating excellent selectivity toward the alkenes with only minor aromatic bromination. The bromination chemistry was then applied to gasoline, followed by GC×GC-TOFMS analysis of the original and brominated gasoline. This GC×GC-TOFMS data set was then submitted to the supervised discovery tool tile-based F-ratio analysis (FRA), which reduced the large data set to only the chromatographic regions which distinguish between the original and brominated gasoline samples. FRA discovered 314 hits, 56 of which were determined statistically significant using combinatorial null distribution analysis (CNDA), a permutation-based significance test. Since the brominated olefins elute in an uncrowded region of the 2D chromatogram and have no signal in the original sample, their discoverability was greatly increased relative to the original olefins. By combining the information gained from brominated olefin standards and the structured patterns of the GC×GC separations, the top hits were identified as the dibromoalkane products of mono-olefins, with five C5 mono-olefins identified on a species level. The analytical workflow has broad implications for using selective reaction chemistries to facilitate supervised discovery by targeting desired compound classes.


Assuntos
Alcenos , Gasolina , Alcanos/análise , Alcenos/análise , Cromatografia Gasosa-Espectrometria de Massas/métodos , Gasolina/análise , Halogenação
11.
Anal Chim Acta ; 1209: 339847, 2022 May 29.
Artigo em Inglês | MEDLINE | ID: mdl-35569851

RESUMO

Tile-based variance rank initiated-unsupervised sample indexing (VRI-USI) analysis is introduced for comprehensive two-dimensional gas chromatography time-of-flight mass spectrometry (GC×GC-TOFMS). VRI-USI analysis addresses the challenge that irrelevant variables can often obscure true chemical variation when using other unsupervised chemometric tools. Implementation of VRI-USI analysis with GC×GC-TOFMS data incorporates the tile-based Fisher ratio (F-ratio) analysis software platform that mitigates the effects of retention shifting in both separation dimensions with an unsupervised variance metric (instead of the F-ratio metric) as the initial step of ranking the hitlist. Next, implementation of k-means clustering, k, per hit using the silhouette metric, Smax, is used to reveal to what extent recurring indexed sample clusters are uncovered. Finally, based upon a probability-based evaluation of how the individual samples cluster throughout the hitlist an unsupervised class membership is revealed. For a JP8 jet fuel dataset spiked with a sulfur-containing analyte mix at 30-ppm, 15-ppm, and neat, clustering by spike level at k = 3 was the most commonly re-occurring set of index assignments, occurring for 11 out of 14 spiked analytes. Upon application of these k-means index assignments to the entire hitlist, all 14 spiked hits had one way ANOVA p-values < 0.05, validating the presumption of classes. Next, application of VRI-USI to a 3-ppm spiked and neat JP8 jet fuel comparison exhibited similar performance to F-ratio analysis for analyte discovery. In the last study, for a dataset of J1800A, JP4, and JP8 jet fuel, each spiked with the sulfur-containing analyte mix at 30-ppm and neat, 453 out of 520 hits in the hitlist exhibited index assignments indicative of fuel type clustering, with the remaining 67 hits having contradictory assignments. Scrutinization of these 67 hits revealed nine hits with "split combinations" in index assignments, whereby the spiked and neat samples for a given fuel were in separate clusters. Eight of these hits were identified as spiked sulfur analytes. Interestingly, these hits also had large Smax indicative of a true sub-cluster. Thus, tile-based VRI-USI analysis appears to be a promising tool for unsupervised multi-class classification studies using GC×GC-TOFMS data.


Assuntos
Software , Enxofre , Análise por Conglomerados , Cromatografia Gasosa-Espectrometria de Massas/métodos
12.
Talanta ; 244: 123396, 2022 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-35354112

RESUMO

A computational method for the untargeted determination of cycling yeast metabolites using a comprehensive two-dimensional gas chromatography time-of-flight mass spectrometry (GC×GC-TOFMS) dataset is presented. The yeast metabolomic cycle for the diploid yeast strain CEN.PK with a 5 h cycle period relative to the O2 concentration level is comprehensively examined to determine the metabolites that exhibit cycling. Samples were collected over only two cycles (10 h with a total of 24 time-point sampling intervals at 25 min each) as an experimental constraint. Due to the limited number of cycles expressed in the dataset, a computational method was devised to determine with statistical significance whether or not a given metabolite exhibited a temporal signal pattern that constituted cycling in the context of the 5 h cycle period. The computational method we report compares the experimentally obtained 24 time-point metabolite signal sequences to randomly generated signal sequences coupled with statistically based confidence level LOF metrics to determine whether or not a given metabolite expresses cycling, and if so, what is the phase of the cycling. Initially the GC×GC-TOFMS dataset was analyzed using tile-based Fisher ratio (F-ratio) analysis. Since there were 24 time-point intervals, this constituted 24 sample classes in the F-ratio calculation which produced 672 metabolite hits. Next, application of the computational method determined that there were 210 of the 672 metabolites exhibiting cycling: 55 identified metabolites and 155 unknown metabolites. Furthermore, the 210 cycling metabolites were categorized into four groups, and where applicable, a phase determined: 1 cycle/5 h period (106 metabolites), 2 cycles/5 h period (13 metabolites), spiky pattern (12 metabolites), or multimodal pattern (79 metabolites).


Assuntos
Metabolômica , Saccharomyces cerevisiae , Cromatografia Gasosa-Espectrometria de Massas/métodos , Sinais Direcionadores de Proteínas , Saccharomyces cerevisiae/metabolismo
13.
Anal Chem ; 94(14): 5658-5666, 2022 04 12.
Artigo em Inglês | MEDLINE | ID: mdl-35347985

RESUMO

A new tile-based pairwise analysis workflow, termed 1v1 analysis, is presented to discover and identify analytes that differentiate two chromatograms collected using comprehensive two-dimensional (2D) gas chromatography coupled with time-of-flight mass spectrometry (GC × GC-TOFMS). Tile-based 1v1 analysis easily discovered all 18 non-native analytes spiked in diesel fuel within the top 30 hits, outperforming standard pairwise chromatographic analyses. However, eight spiked analytes could not be identified with multivariate curve resolution-alternating least-squares (MCR-ALS) nor parallel factor analysis (PARAFAC) due to background contamination. Analyte identification was achieved with class comparison enabled-mass spectrum purification (CCE-MSP), which obtains a pure analyte spectrum by normalizing the spectra to an interferent mass channel (m/z) identified from 1v1 analysis and subtracting the two spectra. This report also details the development of CCE-MSP assisted MCR-ALS, which removes the identified interferent m/z from the data prior to decomposition. In total, 17 out of 18 spiked analytes had a match value (MV) > 800 with both versions of CCE-MSP. For example, MCR-ALS and PARAFAC were unable to decompose the pure spectrum of methyl decanoate (MVs < 200) due to its low 2D chromatographic resolution (∼0.34) and high interferent-to-analyte signal ratio (∼30:1). By leveraging information gained from 1v1 analysis, CCE-MSP and CCE-MSP assisted MCR-ALS obtained a pure spectrum with an average MV of 908 and 964, respectively. Furthermore, tile-based 1v1 analysis was applied to track moisture damage in cacao beans, where 86 analytes with at least a 2-fold concentration change were discovered between the unmolded and molded samples. This 1v1 analysis workflow is beneficial for studies where multiple replicates are either unavailable or undesirable to save analysis time.


Assuntos
Gasolina , Cromatografia Gasosa-Espectrometria de Massas/métodos , Gasolina/análise , Análise dos Mínimos Quadrados , Espectrometria de Massas
14.
J Chromatogr A ; 1667: 462868, 2022 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-35182910

RESUMO

Integration of rice and fish farming, eg., pacu fish in Argentina, has raised concern that herbicides used for rice paddies may adversely affect the fish metabolome. To study this issue, tile-based Fisher ratio (F-ratio) analysis was applied to comprehensive two-dimensional gas chromatography with time-of-flight mass spectrometry (GC × GC-TOFMS) data of pacu fish raised in an integrated rice-fish farming system (farmed class) versus fish raised in tanks (tank-raised class) to discover class-distinguishing analytes. F-ratio analysis resulted in hit lists initially dominated by artifact peaks from the sample derivatization process, as well as some redundant hits. These challenges were addressed by developing an automated artifact removal algorithm and an improvement to redundant hit removal in the tile-based F-ratio analysis using a pooled farmed fish sample, either spiked with 29 metabolites (spiked class) or unspiked (i.e., the background serving as the control class). Of the 29 spiked metabolites, 23 were discovered by standard F-ratio analysis, improving to 28 discovered using control-normalized F-ratio analysis. Standard and control-normalized F-ratio hit lists initially with 185 and 246 hits, were reduced to 56 and 49 hits, respectively, after artifact removal and removing redundant hits. Next, we returned to the F-ratio analysis of the farmed fish versus the control fish. Here, we introduce a minimum variance optimized (MVO) F-ratio calculation (MVOF-ratio) that provides a comprehensive hit list ranking. The initial MVOF-ratio analysis hit list of 537 hits was reduced to 110 hits following artifact removal. Of the 110 hits, 70 expressed a concentration ratio statistically different than 1 (p < 0.05). The MVOF-ratio hit list discovered more true positives compared to the standard, tank-normalized, and farm-normalized F-ratio hit lists, providing the combination of results from the farm-normalized and tank-normalized hit lists. A majority of analytes (54 out of 70) important for normal biological functioning of pacu fish were significantly downregulated in the farmed fish, suggesting the integrated farming system may negatively impact pacu fish quality.


Assuntos
Algoritmos , Metaboloma , Cromatografia Gasosa-Espectrometria de Massas/métodos
15.
J Chromatogr A ; 1662: 462735, 2022 Jan 11.
Artigo em Inglês | MEDLINE | ID: mdl-34936905

RESUMO

The volatile fraction of food, also called the food volatilome, is increasingly used to develop new fingerprinting approaches. The characterization of the food volatilome is important to achieve desired flavor profiles in food production processes, or to differentiate different products, with winemaking being one popular area of interest. In the present research, headspace solid-phase microextraction (HS SPME) coupled to flow-modulated comprehensive two-dimensional gas chromatography with time-of-flight mass spectrometry (FM GC×GC-TOFMS) was used to characterize geographical-based differences in the volatilome of five white "Grillo" wines (of Sicilian origin), comprising the five sample classes. All wines were produced with the same vinification method in 2019. To minimize the influence of minor bottle-to-bottle differences, three bottles of the same wine were randomly selected, and three samples were collected per bottle, resulting in nine sample replicates per wine. Particular emphasis was devoted to the operational conditions of a novel low duty cycle flow modulator. A fast FM GC×GC-TOFMS method with a modulation period of 700 ms and a re-injection period of 80 ms was developed. Following, the instrumental software was exploited to identify class-distinguishing analytes in the dataset via tile-based Fisher ratio analysis (i.e., ChromaTOF Tile). A tile size of 10 modulations (7 s) on the first dimension and 45 spectra (300 ms) on the second dimension was used to encompass average peak widths and to account for minor retention time shifting. Off-line software was used to apply an ANOVA test. A p-value of 0.01 was applied in order to select the most important class-distinguishing analytes, which were input to principal component analysis (PCA). The PCA scores plot showed distinct clustering of the wines according to geographical origin, although the loadings revealed that only a few analytes were necessary to differentiate the wines. However, a comprehensive flavor profile assessment underscored the importance of all the information output by the ChromaTOF Tile software.


Assuntos
Compostos Orgânicos Voláteis , Vinho , Cromatografia Gasosa-Espectrometria de Massas , Espectrometria de Massas , Microextração em Fase Sólida , Compostos Orgânicos Voláteis/análise , Vinho/análise
16.
Talanta ; 236: 122844, 2022 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-34635234

RESUMO

Tile-based Fisher ratio (F-ratio) analysis is emerging as a versatile data analysis tool for supervised discovery-based experimentation using comprehensive two-dimensional (2D) gas chromatography coupled with time-of-flight mass spectrometry (GC × GC-TOFMS). None the less, analyte identification can often be marred by poor 2D resolution and low analyte abundance relative to overlapping compounds. Linear algebra-based chemometric methods, in particular multivariate curve resolution alternating least squares (MCR-ALS), parallel factor analysis (PARAFAC) and PARAFAC2, are often applied in an effort to address this situation. However, these chemometric methods can fail to produce an accurate spectrum when the analyte is at low 2D resolution and/or in low relative abundance. To address this challenge, we introduce class comparison enabled mass spectrum purification (CCE-MSP), a method that utilizes the underlying requirement for signal consistency of the background interference compounds between the two classes in the F-ratio analysis to purify the mass spectrum of the analyte hits. CCE-MSP is validated using a dataset obtained for a neat JP-8 jet fuel spiked with 14 sulfur containing compounds at two levels (15 ppm and 30 ppm), using the p-value and lack-of-fit (LOF) for each analyte hit as consistency metrics. A purified mass spectrum was produced for each spiked analyte hit and their mass spectrum match value (MV) was compared to the MV obtained by MCR-ALS, PARAFAC, and PARAFAC2. The resulting MV for CCE-MSP were found to be as good or better than these chemometric methods, eg., for 2-butyl-5-ethylthiophene with an analyte-to-interference relative signal abundance of 1:87 and a 2D resolution of 0.2, CCE-MSP produced a MV of 831, compared to 476 for MCR-ALS, 403 for PARAFAC, and 336 for PARAFAC2. CCE-MSP is also extended to obtain the purified spectrum for more than one analyte, eg., two analyte hits in overlapping hit locations. The spectra produced by CCE-MSP can also be utilized as estimates to facilitate quantitative signal decomposition using MCR-ALS.


Assuntos
Espectrometria de Massas , Análise Fatorial , Cromatografia Gasosa-Espectrometria de Massas , Análise dos Mínimos Quadrados
17.
J Chromatogr A ; 1652: 462358, 2021 Aug 30.
Artigo em Inglês | MEDLINE | ID: mdl-34237483

RESUMO

A baseline correction method is developed for comprehensive two-dimensional (2D) chromatography (GC × GC) with flame-ionization detection (FID) using dynamic pressure gradient modulation (DPGM). The DPGM-GC × GC-FID utilized porous layer open tubular (PLOT) columns in both dimensions to focus on light hydrocarbon separations. Since DPGM is nominally a stop-flow modulation technique, a rhythmic baseline disturbance is observed in the FID signal that cycles with the modulation period (PM). This baseline disturbance needs to be corrected to optimize trace analysis. The baseline correction method has three steps: collection of a background "blank" chromatogram and multiplying it by an optimized normalization factor, subtraction of the normalization-optimized background chromatogram from a sample chromatogram, and application of Savitzky-Golay smoothing. An alkane standard solution, containing pentane, hexane and heptane was used for method development, producing linear calibration curves (r2 > 0.991) over a broad concentration range (7.8 ppm - 4000 ppm). Further, the limit-of-detection (LOD) and limit-of-quantification (LOQ) were determined for pentane (LOD = 2.5 ppm, LOQ = 8.2 ppm), hexane (LOD = 0.9 ppm, LOQ = 3.0 ppm), and heptane (LOD = 1.9 ppm, LOQ = 6.4 ppm). A natural gas sample separation illustrated method applicability, whereby the DPGM produced a signal enhancement (SE) of 30 for isopentane, where SE is defined as the height of the tallest 2D peak in the modulated chromatogram for the analyte divided by the height of the unmodulated 1D peak. The 30-fold SE resulted in about a 10-fold improvement in the signal-to-noise ratio (S/N) for isopentane. Additional versatility of the baseline correction method for more complicated samples was demonstrated for an unleaded gasoline sample, which enabled the detection (and visual appearance) of trace components.


Assuntos
Ionização de Chama/métodos , Alcanos/química , Gasolina/análise , Hidrocarbonetos/isolamento & purificação , Limite de Detecção , Gás Natural/análise , Pentanos/análise
18.
Talanta ; 233: 122495, 2021 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-34215113

RESUMO

Traditional non-targeted chemometric workflows for gas chromatography-mass spectrometry (GC-MS) data rely on using supervised methods, which requires a priori knowledge of sample class membership. Herein, we propose a simple, unsupervised chemometric workflow known as variance rank initiated-unsupervised sample indexing (VRI-USI). VRI-USI discovers analyte peaks exhibiting high relative variance across all samples, followed by k-means clustering on the individual peaks. Based upon how the samples cluster for a given peak, a sample index assignment is provided. Using a probabilistic argument, if the same sample index assignment appears for several discovered peaks, then this outcome strongly suggests that the samples are properly classified by that particular sample index assignment. Thus, relevant chemical differences between the samples have been discovered in an unsupervised fashion. The VRI-USI workflow is demonstrated on three, increasingly difficult datasets: simulations, yeast metabolomics, and human cancer metabolomics. For simulated GC-MS datasets, VRI-USI discovered 85-90% of analytes modeled to vary between sample classes. Nineteen out of 53 peaks in the peak table developed for the yeast metabolome dataset had the same sample index assignments, indicating that those indices are most likely due to class-distinguishing chemical differences. A t-test revealed that 22 out of 53 peaks were statistically significant (p < 0.05) when using those sample index assignments. Likewise, for the human cancer metabolomics study, VRI-USI discovered 25 analytes that were statistically different (p < 0.05) using the sample index assignments determined to highlight meaningful sample-based differences. For all datasets, the sample index assignments that were deduced from VRI-USI were the correct class-based difference when using prior knowledge. VRI-USI holds promise as an exploratory data analysis workflow for studies in which analysts do not readily have a priori class information or want to uncover the underlying nature of their dataset.


Assuntos
Metaboloma , Metabolômica , Análise por Conglomerados , Cromatografia Gasosa-Espectrometria de Massas , Humanos , Fluxo de Trabalho
19.
Anal Chem ; 93(24): 8526-8535, 2021 06 22.
Artigo em Inglês | MEDLINE | ID: mdl-34097388

RESUMO

We investigate the extent to which comprehensive three-dimensional gas chromatography (GC3) provides a signal enhancement (SE) and a signal-to-noise ratio enhancement (S/NRel) relative to one-dimensional (1D)-GC. Specifically, the SE is defined as the ratio of the tallest 3D peak height from the GC3 separation to the 1D peak height from the unmodulated 1D-GC separation. A model is proposed which allows the analyst to predict the theoretically attainable SE (SET) based upon the peak width and sampling density inputs. The model is validated via comparison of the SET to the experimentally measured SE (SEM) obtained using total-transfer GC3 (100% duty cycle for both modulators) with time-of-flight mass spectrometry detection. Two experimental conditions were studied using the same GC3 column set, differing principally in the modulation period from the 1D to 2D columns: 4 s versus 8 s. Under the first set of conditions, the average SEM was 97 (±22), in excellent agreement with the SET of 97 (±18). The second set of conditions improved the average SEM to 181 (±27), also in agreement with the average SET of 176 (±26). The average S/NRel following correction for the mass spectrum acquisition frequency was 38.8 (±11.2) and 59.0 (±27.2) for the two sets of conditions. The enhancement in S/N is largely attributed to moving the signal to a higher frequency domain where the impact of "low frequency" noise is less detrimental. The findings here provide strong evidence that GC3 separations can provide enhanced detectability relative to 1D-GC and comprehensive two-dimensional gas chromatography (GC×GC) separations.


Assuntos
Cromatografia Gasosa-Espectrometria de Massas , Espectrometria de Massas , Razão Sinal-Ruído
20.
J Chromatogr A ; 1644: 462092, 2021 May 10.
Artigo em Inglês | MEDLINE | ID: mdl-33823385

RESUMO

Comprehensive two-dimensional gas chromatography coupled with time-of-flight mass spectrometry (GC×GC-TOFMS) is followed by tile-based Fisher ratio (F-ratio) analysis to investigate the "limit of discovery" for low concentration levels of sulfur-containing compounds in JP8 jet fuel. A mixture of 14 sulfur-containing compounds was spiked at 30 ppm, 15 ppm, 3 ppm and 1.5 ppm into the neat fuel prior to GC×GC-TOFMS analysis with a "reversed" column format (aka polar first dimension (1D) and non-polar second dimension (2D) column). Prior standard implementation of tile-based F-ratio analysis utilized an average F-ratio requiring a minimum of 3 mass channels (m/z) with the highest F-ratios. Herein, we explore the notion that use of the top F-ratio m/z for hitlist ranking is superior to the standard implementation for analytes near their limit-of-quantitation (LOQ), defined as an analyte concentration that produces a signal equal to ten times the standard deviation of the baseline noise (10σn). Hitlist ranking comparisons revealed that using only the top F-ratio m/z resulted in impressive improvements in discoverability for the low concentration comparisons. Specifically, for the 3 ppm versus neat hitlist, 1,4-oxathiane (LOQ = 2.5 ppm) improved from hit 114 via standard F-ratio analysis, to hit 25. For the 1.5 ppm versus neat hitlist, 2-propylthiophene (LOQ = 0.64 ppm) improved from hit 59 to 17, benzo[b]thiophene (LOQ = 1.1 ppm) from hit 98 to 28, and 2,5-dimethylthiophene (LOQ = 1.3 ppm) from hit 262 to 39. Additional hitlist ranking comparisons revealed the importance of proper tile size selection, as analyte discoverability deteriorated upon using either an inappropriately too small or too large of a tile.


Assuntos
Cromatografia Gasosa-Espectrometria de Massas/métodos , Limite de Detecção , Hidrocarbonetos/análise , Enxofre/análise , Tiofenos/química
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA