Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 70
Filtrar
1.
J Am Soc Mass Spectrom ; 35(2): 266-274, 2024 Feb 07.
Artículo en Inglés | MEDLINE | ID: mdl-38271611

RESUMEN

Calculating spectral similarity is a fundamental step in MS/MS data analysis in untargeted metabolomics experiments, as it facilitates the identification of related spectra and the annotation of compounds. To improve matching accuracy when querying an experimental mass spectrum against a spectral library, previous approaches have proposed increasing peak intensities for high m/z ranges. These high m/z values tend to be smaller in magnitude, yet they offer more crucial information for identifying the chemical structure. Here, we evaluate the impact of using these weights for identifying structurally related compounds and mass spectral library searches. Additionally, we propose a weighting approach that (i) takes into account the frequency of the m/z values within a spectral library in order to assign higher importance to the most common peaks and (ii) increases the intensity of lower peaks, similar to previous approaches. To demonstrate our approach, we applied weighting preprocessing to modified cosine, entropy, and fidelity distance metrics and benchmarked it against previously reported weights. Our results demonstrate how weighting-based preprocessing can assist in annotating the structure of unknown spectra as well as identifying structurally similar compounds. Finally, we examined scenarios in which the utilization of weights resulted in diminished performance, pinpointing spectral features where the application of weights might be detrimental.


Asunto(s)
Metabolómica , Espectrometría de Masas en Tándem , Metabolómica/métodos , Iones
2.
J Chem Inf Model ; 62(18): 4403-4410, 2022 09 26.
Artículo en Inglés | MEDLINE | ID: mdl-36107950

RESUMEN

Here, we provide an algorithm that introduces excited states into the molecular dynamics prediction of the 70 eV electron ionization mass spectra. To decide the contributions of different electronic states, the ionization cross section associated with relevant molecular orbitals was calculated by the binary-encounter-Bethe (BEB) model. We used a fast orthogonalization model/single and double state configuration interaction (OM2/CISD) method to implement excited states calculations and combined this with the GFN1-xTB semiempirical model. Demonstrated by predicting the mass spectrum of urocanic acid, we showed better accuracies to experimental spectra using excited-state molecular dynamics than calculations that only used the ground-state occupation. For several histidine pathway intermediates, we found that excited-state corrections yielded an average of 73% more true positive ions compared to the OM2 method when matching to experimental spectra and 16% more true positive ions compared to the GFN method. Importantly, the exited state models also correctly predict several fragmentation reactions that were missing from both ground-state methods. Overall, for 48 calculated molecules, we found the best average mass spectral similarity scores for the mixed excited-state method compared to the ground-state methods using either cosine, weighted dot score, or entropy similarity calculations. Therefore, we recommend adding excited-state calculations for predicting the electron ionization mass spectra of small molecules in metabolomics.


Asunto(s)
Electrones , Ácido Urocánico , Histidina , Iones , Simulación de Dinámica Molecular , Teoría Cuántica
3.
J Chem Inf Model ; 62(17): 4049-4056, 2022 09 12.
Artículo en Inglés | MEDLINE | ID: mdl-36043939

RESUMEN

Competitive Fragmentation Modeling for Metabolite Identification (CFM-ID) is a machine learning tool to predict in silico tandem mass spectra (MS/MS) for known or suspected metabolites for which chemical reference standards are not available. As a machine learning tool, it relies on both an underlying statistical model and an explicit training set that encompasses experimental mass spectra for specific compounds. Such mass spectra depend on specific parameters such as collision energies, instrument types, and adducts which are accumulated in libraries. Yet, ultimately prediction tools that are meant to cover wide expanses of entities must be validated on cases that were not included in the initial training and testing sets. Hence, we here benchmarked the performance of CFM-ID 4.0 to correctly predict MS/MS spectra for spectra that were not included in the CFM-ID training set and for different mass spectrometry conditions. We used 609,456 experimental tandem spectra from the NIST20 mass spectral library that were newly added to the previous NIST17 library version. We found that CFM-ID's highest energy prediction output would maximize the capacity for library generation. Matching the experimental collision energy with CFM-ID's prediction energy produced the best results, even for HCD-Orbitrap instruments. For benzenoids, better MS/MS predictions were achieved than for heterocyclic compounds. However, when exploring CFM-ID's performance on 8,305 compounds at 40 eV HCD-Orbitrap collision energy, >90% of the 20/80 split test compounds showed <700 MS/MS similarity score. Instead of a stand-alone tool, CFM-ID 4.0 might be useful to boost candidate structures in the greater context of identification workflows.


Asunto(s)
Benchmarking , Espectrometría de Masas en Tándem , Biblioteca de Genes , Modelos Estadísticos , Espectrometría de Masas en Tándem/métodos
4.
Commun Biol ; 5(1): 334, 2022 04 07.
Artículo en Inglés | MEDLINE | ID: mdl-35393526

RESUMEN

Identifying the genetic determinants of inter-individual variation in lipid species (lipidome) may provide deeper understanding and additional insight into the mechanistic effect of complex lipidomic pathways in CVD risk and progression beyond simple traditional lipids. Previous studies have been largely population based and thus only powered to discover associations with common genetic variants. Founder populations represent a powerful resource to accelerate discovery of previously unknown biology associated with rare population alleles that have risen to higher frequency due to genetic drift. We performed a genome-wide association scan of 355 lipid species in 650 individuals from the Amish founder population including 127 lipid species not previously tested. To the best of our knowledge, we report for the first time the lipid species associated with two rare-population but Amish-enriched lipid variants: APOB_rs5742904 and APOC3_rs76353203. We also identified novel associations for 3 rare-population Amish-enriched loci with several sphingolipids and with proposed potential functional/causal variant in each locus including GLTPD2_rs536055318, CERS5_rs771033566, and AKNA_rs531892793. We replicated 7 previously known common loci including novel associations with two sterols: androstenediol with UGT locus and estriol with SLC22A8/A24 locus. Our results show the double power of founder populations and detailed lipidome to discover novel trait-associated variants.


Asunto(s)
Amish , Efecto Fundador , Genética de Población , Lipidómica , Amish/genética , Proteínas de Unión al ADN/genética , Estudio de Asociación del Genoma Completo , Humanos , Lípidos , Proteínas Nucleares/genética , Factores de Transcripción/genética
5.
Anal Chem ; 94(6): 2732-2739, 2022 02 15.
Artículo en Inglés | MEDLINE | ID: mdl-35119811

RESUMEN

Acyl-coenzyme A derivatives (acyl-CoAs) are core molecules in the fatty acid and energy metabolism across all species. However, in vivo, many other carboxylic acids can form xenobiotic acyl-CoA esters, including drugs. More than 2467 acyl-CoAs are known from the published literature. In addition, more than 300 acyl-CoAs are covered in pathway databases, but as of October 2020, only 53 experimental acyl-CoA tandem mass spectra are present in NIST20 and MoNA libraries to enable annotation of the mass spectra in untargeted metabolomics studies. The experimental spectra originated from low-resolution ion trap and triple quadrupole mass spectrometers as well as high-resolution quadrupole-time of flight and orbital ion trap instruments at various collision energies. We used MassFrontier software and the literature to annotate fragment ions to generate fragmentation rules and intensities for the different instruments and collision energies. These rules were then applied to 1562 unique species based on [M+H]+ and [M-H]- precursor ions to generate two mass spectra per instrument platform and collision energy, amassing an in silico library of 10,934 accurate mass MS/MS spectra that are freely available at github.com/urikeshet/CoA-Blast. The spectra can be imported into a commercial or freely available mass spectral search tool. We used the libraries to annotate 23 acyl-CoA esters in mouse liver, including 8 novel species.


Asunto(s)
Acilcoenzima A , Espectrometría de Masas en Tándem , Acilcoenzima A/metabolismo , Animales , Hígado/metabolismo , Metabolómica , Ratones , Programas Informáticos
6.
Metabolites ; 12(1)2022 Jan 12.
Artículo en Inglés | MEDLINE | ID: mdl-35050190

RESUMEN

Mass spectrometry is the most commonly used method for compound annotation in metabolomics. However, most mass spectra in untargeted assays cannot be annotated with specific compound structures because reference mass spectral libraries are far smaller than the complement of known molecules. Theoretically predicted mass spectra might be used as a substitute for experimental spectra especially for compounds that are not commercially available. For example, the Quantum Chemistry Electron Ionization Mass Spectra (QCEIMS) method can predict 70 eV electron ionization mass spectra from any given input molecular structure. In this work, we investigated the accuracy of QCEIMS predictions of electron ionization (EI) mass spectra for 80 purine and pyrimidine derivatives in comparison to experimental data in the NIST 17 database. Similarity scores between every pair of predicted and experimental spectra revealed that 45% of the compounds were found as the correct top hit when QCEIMS predicted spectra were matched against the NIST17 library of >267,000 EI spectra, and 74% of the compounds were found within the top 10 hits. We then investigated the impact of matching, missing, and additional fragment ions in predicted EI mass spectra versus ion abundances in MS similarity scores. We further include detailed studies of fragmentation pathways such as retro Diels-Alder reactions to predict neutral losses of (iso)cyanic acid, hydrogen cyanide, or cyanamide in the mass spectra of purines and pyrimidines. We describe how trends in prediction accuracy correlate with the chemistry of the input compounds to better understand how mechanisms of QCEIMS predictions could be improved in future developments. We conclude that QCEIMS is useful for generating large-scale predicted mass spectral libraries for identification of compounds that are absent from experimental libraries and that are not commercially available.

7.
Anal Chem ; 94(3): 1559-1566, 2022 01 25.
Artículo en Inglés | MEDLINE | ID: mdl-35006668

RESUMEN

Chemical derivatization, especially silylation, is widely used in gas chromatography coupled to mass spectrometry (GC-MS). By introducing the trimethylsilyl (TMS) group to substitute active hydrogens in the molecule, thermostable volatile compounds are created that can be easily analyzed. While large GC-MS libraries are available, the number of spectra for TMS-derivatized compounds is comparatively small. In addition, many metabolites cannot be purchased to produce authentic library spectra. Therefore, computationally generated in silico mass spectral databases need to take TMS derivatizations into account for metabolomics. The quantum chemistry method QCEIMS is an automatic method to generate electron ionization (EI) mass spectra directly from compound structures. To evaluate the performance of the QCEIMS method for TMS-derivatized compounds, we chose 816 trimethylsilyl derivatives of organic acids, alcohols, amides, amines, and thiols to compare in silico-generated spectra against the experimental EI mass spectra from the NIST17 library. Overall, in silico spectra showed a weighted dot score similarity (1000 is maximum) of 635 compared to the NIST17 experimental spectra. Aromatic compounds yielded a better prediction accuracy with an average similarity score of 808, while oxygen-containing molecules showed lower accuracy with only an average score of 609. Such similarity scores are useful for annotation of small molecules in untargeted GC-MS-based metabolomics, suggesting that QCEIMS methods can be extended to compounds that are not present in experimental databases. Despite this overall success, 37% of all experimentally observed ions were not found in QCEIMS predictions. We investigated QCEIMS trajectories in detail and found missed fragmentations in specific rearrangement reactions. Such findings open the way forward for future improvements to the QCEIMS software.


Asunto(s)
Electrones , Metabolómica , Cromatografía de Gases y Espectrometría de Masas/métodos , Espectrometría de Masas , Metabolómica/métodos , Programas Informáticos
8.
Nat Methods ; 18(12): 1524-1531, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34857935

RESUMEN

Compound identification in small-molecule research, such as untargeted metabolomics or exposome research, relies on matching tandem mass spectrometry (MS/MS) spectra against experimental or in silico mass spectral libraries. Most software programs use dot product similarity scores. Here we introduce the concept of MS/MS spectral entropy to improve scoring results in MS/MS similarity searches via library matching. Entropy similarity outperformed 42 alternative similarity algorithms, including dot product similarity, when searching 434,287 spectra against the high-quality NIST20 library. Entropy similarity scores proved to be highly robust even when we added different levels of noise ions. When we applied entropy levels to 37,299 experimental spectra of natural products, false discovery rates of less than 10% were observed at entropy similarity score 0.75. Experimental human gut metabolome data were used to confirm that entropy similarity largely improved the accuracy of MS-based annotations in small-molecule research to false discovery rates below 10%, annotated new compounds and provided the basis to automatically flag poor-quality, noisy spectra.


Asunto(s)
Biología Computacional/métodos , Intestinos/metabolismo , Metabolómica/métodos , Espectrometría de Masas en Tándem/métodos , Algoritmos , Cromatografía Liquida/métodos , Simulación por Computador , Entropía , Reacciones Falso Positivas , Humanos , Metaboloma , Curva ROC , Reproducibilidad de los Resultados , Programas Informáticos
9.
Nutrients ; 13(11)2021 Nov 10.
Artículo en Inglés | MEDLINE | ID: mdl-34836252

RESUMEN

Postprandial lipemia (PPL) is an important risk factor for cardiovascular disease. Inter-individual variation in the dietary response to a meal is known to be influenced by genetic factors, yet genes that dictate variation in postprandial lipids are not completely characterized. Genetic studies of the plasma lipidome can help to better understand postprandial metabolism by isolating lipid molecular species which are more closely related to the genome. We measured the plasma lipidome at fasting and 6 h after a standardized high-fat meal in 668 participants from the Genetics of Lipid-Lowering Drugs and Diet Network study (GOLDN) using ultra-performance liquid chromatography coupled to (quadrupole) time-of-flight mass spectrometry. A total of 413 unique lipids were identified. Heritable and responsive lipid species were examined for association with single-nucleotide polymorphisms (SNPs) genotyped on the Affymetrix 6.0 array. The most statistically significant SNP findings were replicated in the Amish Heredity and Phenotype Intervention (HAPI) Heart Study. We further followed up findings from GOLDN with a regional analysis of cytosine-phosphate-guanine (CpGs) sites measured on the Illumina HumanMethylation450 array. A total of 132 lipids were both responsive to the meal challenge and heritable in the GOLDN study. After correction for multiple testing of 132 lipids (α = 5 × 10-8/132 = 4 × 10-10), no SNP was statistically significantly associated with any lipid response. Four SNPs in the region of a known lipid locus (fatty acid desaturase 1 and 2/FADS1 and FADS2) on chromosome 11 had p < 8.0 × 10-7 for arachidonic acid FA(20:4). Those SNPs replicated in HAPI Heart with p < 3.3 × 10-3. CpGs around the FADS1/2 region were associated with arachidonic acid and the relationship of one SNP was partially mediated by a CpG (p = 0.005). Both SNPs and CpGs from the fatty acid desaturase region on chromosome 11 contribute jointly and independently to the diet response to a high-fat meal.


Asunto(s)
Genómica , Hipolipemiantes/farmacología , Lipidómica , Periodo Posprandial/efectos de los fármacos , Periodo Posprandial/genética , Adulto , Anciano , delta-5 Desaturasa de Ácido Graso/genética , Ácido Graso Desaturasas/genética , Femenino , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Lípidos , Masculino , Comidas , Persona de Mediana Edad , Fenotipo , Plasma , Polimorfismo de Nucleótido Simple
10.
Medicine (Baltimore) ; 100(30): e26588, 2021 Jul 30.
Artículo en Inglés | MEDLINE | ID: mdl-34397689

RESUMEN

ABSTRACT: High cardiovascular disease risk in people living with HIV is partly attributed to antiretroviral therapy (ART). Lipid response to ART has been extensively studied, yet, little is known how small molecule lipids respond to Integrase inhibitor-based (INSTI-based) compared to Protease inhibitor-based (PI-based) ART regimens.Ancillary study to a phase 3, randomized, open-label trial [AIDS Clinical Trial Group A5257 Study] in treatment-naive HIV-infected patients randomized in a 1:1:1 ratio to receive ritonavir-boosted atazanavir (ATV/r), ritonavir-boosted darunavir (DRV/r) (both PI-based), or raltegravir with Tenofovir Disoproxil Fumarate-TDF plus emtricitabine (RAL, INSTI-based).We examined small molecule lipid response in a subcohort of 75 participants. Lipidomic assays of plasma samples collected pre- and post-ART treatment (48 weeks) were conducted using ultra-performance liquid chromatography coupled to time-of-flight mass spectrometry. The effect of ART regimens was regressed on lipid species response adjusting for the baseline covariates (lipids, age, sex, race, CD4 level, BMI, and smoking). Results were validated in the Centers for AIDS Research Network of Integrated Clinical Systems study (N = 16).Out of 417 annotated lipids, glycerophospholipids (P = .007) and sphingolipids (P = .028) had a higher response to ATV/r and DRV/r compared to RAL. The lysophosphatidylcholine (LPCs(16:1),(17:1),(20:3)) and phosphophatidylcholine species (PCs(40:7),(38:4)) had an opposite response to RAL versus ATV/r in the discovery and validation cohort. The INSTI-based regimen had an opposite response of ceramide species ((d38:1), (d42:2)), PCs((35:2), (38:4)), phosphatidylethanolamines (PEs(38:4), (38:6)), and sphingomyelin(SMd38:1) species compared with the PI-based regimens. There were no differences observed between 2 PI-based regimens.We observed differences in response of small molecule lipid species by ART regimens in treatment-naive people living with HIV.


Asunto(s)
Antirretrovirales/efectos adversos , Lipidómica/métodos , Adulto , Antirretrovirales/uso terapéutico , Sulfato de Atazanavir/efectos adversos , Sulfato de Atazanavir/uso terapéutico , Distribución de Chi-Cuadrado , Darunavir/efectos adversos , Darunavir/uso terapéutico , Femenino , Infecciones por VIH/tratamiento farmacológico , Infecciones por VIH/fisiopatología , Humanos , Lipidómica/estadística & datos numéricos , Lípidos , Masculino , Espectrometría de Masas/métodos , Persona de Mediana Edad , Plasma/efectos de los fármacos , Raltegravir Potásico/efectos adversos , Raltegravir Potásico/uso terapéutico , Tenofovir/efectos adversos , Tenofovir/uso terapéutico
11.
Chem Rev ; 121(10): 5633-5670, 2021 05 26.
Artículo en Inglés | MEDLINE | ID: mdl-33979149

RESUMEN

A primary goal of metabolomics studies is to fully characterize the small-molecule composition of complex biological and environmental samples. However, despite advances in analytical technologies over the past two decades, the majority of small molecules in complex samples are not readily identifiable due to the immense structural and chemical diversity present within the metabolome. Current gold-standard identification methods rely on reference libraries built using authentic chemical materials ("standards"), which are not available for most molecules. Computational quantum chemistry methods, which can be used to calculate chemical properties that are then measured by analytical platforms, offer an alternative route for building reference libraries, i.e., in silico libraries for "standards-free" identification. In this review, we cover the major roadblocks currently facing metabolomics and discuss applications where quantum chemistry calculations offer a solution. Several successful examples for nuclear magnetic resonance spectroscopy, ion mobility spectrometry, infrared spectroscopy, and mass spectrometry methods are reviewed. Finally, we consider current best practices, sources of error, and provide an outlook for quantum chemistry calculations in metabolomics studies. We expect this review will inspire researchers in the field of small-molecule identification to accelerate adoption of in silico methods for generation of reference libraries and to add quantum chemistry calculations as another tool at their disposal to characterize complex samples.


Asunto(s)
Metabolómica , Teoría Cuántica
12.
Lipids Health Dis ; 20(1): 30, 2021 Apr 03.
Artículo en Inglés | MEDLINE | ID: mdl-33812378

RESUMEN

BACKGROUND: Developing an understanding of the biochemistry of aging in both sexes is critical for managing disease throughout the lifespan. Lipidomic associations with age and sex have been reported, but prior studies are limited by measurements in serum rather than plasma or by participants taking lipid-lowering medications. METHODS: Our study included lipidomic data from 980 participants aged 18-87 years old from the Genetics of Lipid-Lowering Drugs and Diet Network (GOLDN). Participants were off lipid-lowering medications for at least 4 weeks, and signal intensities of 413 known lipid species were measured in plasma. We examined linear age and sex associations with signal intensity of (a) 413 lipid species; (b) 6 lipid classes (glycerolipids, glycerophospholipids, sphingolipids, sterol lipids, fatty acids, and acylcarnitines); and (c) 15 lipid subclasses; as well as with the particle sizes of three lipoproteins. RESULTS: Significant age associations were identified in 4 classes, 11 subclasses, 147 species, and particle size of one lipoprotein while significant sex differences were identified in 5 classes, 12 subclasses, 248 species, and particle sizes of two lipoproteins. For many lipid species (n = 97), age-related associations were significantly different between males and females. Age*sex interaction effects were most prevalent among phosphatidylcholines, sphingomyelins, and triglycerides. CONCLUSION: We identified several lipid species, subclasses, and classes that differ by age and sex; these lipid phenotypes may serve as useful biomarkers for lipid changes and associated cardiovascular risk with aging in the future. Future studies of age-related changes throughout the adult lifespan of both sexes are warranted. TRIAL REGISTRATION: ClinicalTrials.gov NCT00083369 ; May 21, 2004.


Asunto(s)
Lipidómica , Lípidos/sangre , Caracteres Sexuales , Adolescente , Adulto , Factores de Edad , Anciano , Anciano de 80 o más Años , Femenino , Humanos , Lípidos/clasificación , Lipoproteínas/química , Masculino , Persona de Mediana Edad , Tamaño de la Partícula , Adulto Joven
13.
J Cheminform ; 12(1): 63, 2020 Oct 20.
Artículo en Inglés | MEDLINE | ID: mdl-33372633

RESUMEN

Compound identification by mass spectrometry needs reference mass spectra. While there are over 102 million compounds in PubChem, less than 300,000 curated electron ionization (EI) mass spectra are available from NIST or MoNA mass spectral databases. Here, we test quantum chemistry methods (QCEIMS) to generate in silico EI mass spectra (MS) by combining molecular dynamics (MD) with statistical methods. To test the accuracy of predictions, in silico mass spectra of 451 small molecules were generated and compared to experimental spectra from the NIST 17 mass spectral library. The compounds covered 43 chemical classes, ranging up to 358 Da. Organic oxygen compounds had a lower matching accuracy, while computation time exponentially increased with molecular size. The parameter space was probed to increase prediction accuracy including initial temperatures, the number of MD trajectories and impact excess energy (IEE). Conformational flexibility was not correlated to the accuracy of predictions. Overall, QCEIMS can predict 70 eV electron ionization spectra of chemicals from first principles. Improved methods to calculate potential energy surfaces (PES) are still needed before QCEIMS mass spectra of novel molecules can be generated at large scale.

14.
Lipids Health Dis ; 19(1): 153, 2020 Jun 25.
Artículo en Inglés | MEDLINE | ID: mdl-32586392

RESUMEN

BACKGROUND: The lipoprotein insulin resistance (LPIR) score was shown to predict insulin resistance (IR) and type 2 diabetes (T2D) in healthy adults. However, the molecular basis underlying the LPIR utility for classification remains unclear. OBJECTIVE: To identify small molecule lipids associated with variation in the LPIR score, a weighted index of lipoproteins measured by nuclear magnetic resonance, in the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN) study (n = 980). METHODS: Linear mixed effects models were used to test the association between the LPIR score and 413 lipid species and their principal component analysis-derived groups. Significant associations were tested for replication with homeostatic model assessment-IR (HOMA-IR), a phenotype correlated with the LPIR score (r = 0.48, p <  0.001), in the Heredity and Phenotype Intervention (HAPI) Heart Study (n = 590). RESULTS: In GOLDN, 319 lipids were associated with the LPIR score (false discovery rate-adjusted p-values ranging from 4.59 × 10- 161 to 49.50 × 10- 3). Factors 1 (triglycerides and diglycerides/storage lipids) and 3 (mixed lipids) were positively (ß = 0.025, p = 4.52 × 10- 71 and ß = 0.021, p = 5.84 × 10- 41, respectively) and factor 2 (phospholipids/non-storage lipids) was inversely (ß = - 0.013, p = 2.28 × 10- 18) associated with the LPIR score. These findings were replicated for HOMA-IR in the HAPI Heart Study (ß = 0.10, p = 1.21 × 10- 02 for storage, ß = - 0.13, p = 3.14 × 10- 04 for non-storage, and ß = 0.19, p = 8.40 × 10- 07 for mixed lipids). CONCLUSIONS: Non-storage lipidomics species show a significant inverse association with the LPIR metabolic dysfunction score and present a promising focus for future therapeutic and prevention studies.


Asunto(s)
Resistencia a la Insulina/fisiología , Lípidos/sangre , Adulto , Anciano , Índice de Masa Corporal , Diabetes Mellitus Tipo 2/sangre , Femenino , Humanos , Lipidómica , Lipoproteínas/sangre , Masculino , Persona de Mediana Edad , Triglicéridos/sangre , Circunferencia de la Cintura
15.
Anal Chem ; 92(11): 7515-7522, 2020 06 02.
Artículo en Inglés | MEDLINE | ID: mdl-32390414

RESUMEN

Unidentified peaks remain a major problem in untargeted metabolomics by LC-MS/MS. Confidence in peak annotations increases by combining MS/MS matching and retention time. We here show how retention times can be predicted from molecular structures. Two large, publicly available data sets were used for model training in machine learning: the Fiehn hydrophilic interaction liquid chromatography data set (HILIC) of 981 primary metabolites and biogenic amines,and the RIKEN plant specialized metabolome annotation (PlaSMA) database of 852 secondary metabolites that uses reversed-phase liquid chromatography (RPLC). Five different machine learning algorithms have been integrated into the Retip R package: the random forest, Bayesian-regularized neural network, XGBoost, light gradient-boosting machine (LightGBM), and Keras algorithms for building the retention time prediction models. A complete workflow for retention time prediction was developed in R. It can be freely downloaded from the GitHub repository (https://www.retip.app). Keras outperformed other machine learning algorithms in the test set with minimum overfitting, verified by small error differences between training, test, and validation sets. Keras yielded a mean absolute error of 0.78 min for HILIC and 0.57 min for RPLC. Retip is integrated into the mass spectrometry software tools MS-DIAL and MS-FINDER, allowing a complete compound annotation workflow. In a test application on mouse blood plasma samples, we found a 68% reduction in the number of candidate structures when searching all isomers in MS-FINDER compound identification software. Retention time prediction increases the identification rate in liquid chromatography and subsequently leads to an improved biological interpretation of metabolomics data.


Asunto(s)
Aprendizaje Automático , Metabolómica , Compuestos Orgánicos/sangre , Cromatografía Liquida , Humanos , Espectrometría de Masas en Tándem , Factores de Tiempo
16.
Anal Chem ; 92(8): 5960-5968, 2020 04 21.
Artículo en Inglés | MEDLINE | ID: mdl-32202765

RESUMEN

Fatty acid esters of hydroxy fatty acids (FAHFAs) are a family of recently discovered lipids with important physiological functions in mammals and plants. However, low detection sensitivity in negative ionization mode mass spectrometry makes low-abundance FAHFA challenging to analyze. A 2-dimethylaminoethylamine (DMED) based chemical derivatization strategy was recently reported to improve the MS sensitivity of FAHFAs by labeling FAHFAs with a positively ionizable tertiary amine group. To facilitate reliable, high-throughput, and automatic annotation of these compounds, a DMED-FAHFA in silico library containing 4290 high-resolution tandem mass spectra covering 264 different FAHFA classes was developed. The construction of the library was based on the heuristic information from MS/MS fragmentation patterns of DMED-FAHFA authentic standards, and then, the patterns were applied to computer-generated DMED-FAHFAs. The developed DMED-FAHFA in silico library was demonstrated to be compatible with library search software NIST MS Search and the LC-MS/MS data processing tool MS-DIAL to guarantee high-throughput and automatic annotations. Applying the in silico library in Arabidopsis thaliana samples for profiling FAHFAs by high-resolution LC-MS/MS enabled the annotation of 19 DMED-FAHFAs from 16 families, including 3 novel compounds. Using the in silico library largely decreased the false-positive annotation rate in comparison to low-resolution LC-MS/MS. The developed library, MS/MS spectra, and development templates are freely available for commercial and noncommercial use at https://zenodo.org/record/3606905.


Asunto(s)
Ésteres/análisis , Etilaminas/química , Ácidos Grasos/análisis , Estructura Molecular , Espectrometría de Masas en Tándem
17.
Metabolites ; 9(5)2019 May 22.
Artículo en Inglés | MEDLINE | ID: mdl-31121816

RESUMEN

Mouse knockouts facilitate the study ofgene functions. Often, multiple abnormal phenotypes are induced when a gene is inactivated. The International Mouse Phenotyping Consortium (IMPC) has generated thousands of mouse knockouts and catalogued their phenotype data. We have acquired metabolomics data from 220 plasma samples from 30 unique mouse gene knockouts and corresponding wildtype mice from the IMPC. To acquire comprehensive metabolomics data, we have used liquid chromatography (LC) combined with mass spectrometry (MS) for detecting polar and lipophilic compounds in an untargeted approach. We have also used targeted methods to measure bile acids, steroids and oxylipins. In addition, we have used gas chromatography GC-TOFMS for measuring primary metabolites. The metabolomics dataset reports 832 unique structurally identified metabolites from 124 chemical classes as determined by ChemRICH software. The GCMS and LCMS raw data files, intermediate and finalized data matrices, R-Scripts, annotation databases, and extracted ion chromatograms are provided in this data descriptor. The dataset can be used for subsequent studies to link genetic variants with molecular mechanisms and phenotypes.

18.
Anal Chem ; 91(5): 3590-3596, 2019 03 05.
Artículo en Inglés | MEDLINE | ID: mdl-30758187

RESUMEN

Large-scale untargeted lipidomics experiments involve the measurement of hundreds to thousands of samples. Such data sets are usually acquired on one instrument over days or weeks of analysis time. Such extensive data acquisition processes introduce a variety of systematic errors, including batch differences, longitudinal drifts, or even instrument-to-instrument variation. Technical data variance can obscure the true biological signal and hinder biological discoveries. To combat this issue, we present a novel normalization approach based on using quality control pool samples (QC). This method is called systematic error removal using random forest (SERRF) for eliminating the unwanted systematic variations in large sample sets. We compared SERRF with 15 other commonly used normalization methods using six lipidomics data sets from three large cohort studies (832, 1162, and 2696 samples). SERRF reduced the average technical errors for these data sets to 5% relative standard deviation. We conclude that SERRF outperforms other existing methods and can significantly reduce the unwanted systematic variation, revealing biological variance of interest.


Asunto(s)
Conjuntos de Datos como Asunto/normas , Lipidómica/normas , Control de Calidad , Error Científico Experimental/estadística & datos numéricos
19.
ACS Chem Neurosci ; 10(3): 1369-1379, 2019 03 20.
Artículo en Inglés | MEDLINE | ID: mdl-30698015

RESUMEN

The classical small molecule neurotransmitters are essential for cell-cell signaling in the nervous system for regulation of behaviors and physiological functions. Metabolomics approaches are ideal for quantitative analyses of neurotransmitter profiles but have not yet been achieved for the repertoire of 14 classical neurotransmitters. Therefore, this study developed targeted metabolomics analyses by full scan gas chromatography/time-of-flight mass spectrometry (GC-TOF) and hydrophilic interaction chromatography-QTRAP mass spectrometry (HILIC-MS/MS) operated in positive ionization mode for identification and quantitation of 14 neurotransmitters consisting of acetylcholine, adenosine, anandamide, aspartate, dopamine, epinephrine, GABA, glutamate, glycine, histamine, melatonin, norepinephrine, serine, and serotonin. GC-TOF represents a new metabolomics method for neurotransmitter analyses. Sensitive measurements of 11 neurotransmitters were achieved by GC-TOF, and three neurotransmitters were analyzed by LC-MS/MS (acetylcholine, anandamide, and melatonin). The limits of detection (LOD) and limits of quantitation (LOQ) were assessed for linearity for GC-TOF and LC-MS/MS protocols. In neurotransmitter-containing dense core secretory vesicles of adrenal medulla, known as chromaffin granules (CG), metabolomics measured the concentrations of 9 neurotransmitters consisting of the catecholamines dopamine, norepinephrine, and epinephrine, combined with glutamate, serotonin, adenosine, aspartate, glycine, and serine. The CG neurotransmitters were constitutively secreted from sympathoadrenal chromaffin cells in culture. Nicotine- and KCl-stimulated release of the catecholamines and adenosine. Lithium, a drug used for the treatment of bipolar disorder, decreased the constitutive secretion of dopamine and norepinephrine and decreased nicotine-stimulated secretion of epinephrine. Lithium had no effect on other secreted neurotransmitters. Overall, the newly developed GC-TOF with LC-MS/MS metabolomics methods for analyses of 14 neurotransmitters will benefit investigations of neurotransmitter regulation in biological systems and in human disease conditions related to drug treatments.


Asunto(s)
Comunicación Celular/fisiología , Células Cromafines/química , Litio/farmacología , Metabolómica/métodos , Neurotransmisores/análisis , Espectrometría de Masas en Tándem/métodos , Glándulas Suprarrenales/química , Glándulas Suprarrenales/efectos de los fármacos , Glándulas Suprarrenales/metabolismo , Animales , Bovinos , Comunicación Celular/efectos de los fármacos , Células Cromafines/efectos de los fármacos , Células Cromafines/metabolismo , Cromatografía de Gases/métodos , Cromatografía Liquida/métodos , Neurotransmisores/metabolismo , Paraganglios Cromafines/química , Paraganglios Cromafines/efectos de los fármacos , Paraganglios Cromafines/metabolismo , Transducción de Señal/efectos de los fármacos , Transducción de Señal/fisiología
20.
Anal Chem ; 91(3): 2155-2162, 2019 02 05.
Artículo en Inglés | MEDLINE | ID: mdl-30608141

RESUMEN

Urine metabolites are used in many clinical and biomedical studies but usually only for a few classic compounds. Metabolomics detects vastly more metabolic signals that may be used to precisely define the health status of individuals. However, many compounds remain unidentified, hampering biochemical conclusions. Here, we annotate all metabolites detected by two untargeted metabolomic assays, hydrophilic interaction chromatography (HILIC)-Q Exactive HF mass spectrometry and charged surface hybrid (CSH)-Q Exactive HF mass spectrometry. Over 9,000 unique metabolite signals were detected, of which 42% triggered MS/MS fragmentations in data-dependent mode. On the highest Metabolomics Standards Initiative (MSI) confidence level 1, we identified 175 compounds using authentic standards with precursor mass, retention time, and MS/MS matching. An additional 578 compounds were annotated by precursor accurate mass and MS/MS matching alone, MSI level 2, including a novel library specifically geared at acylcarnitines (CarniBlast). The rest of the metabolome is usually left unannotated. To fill this gap, we used the in silico fragmentation tool CSI:FingerID and the new NIST hybrid search to annotate all further compounds (MSI level 3). Testing the top-ranked metabolites in CSI:Finger ID annotations yielded 40% accuracy when applied to the MSI level 1 identified compounds. We classified all MSI level 3 annotations by the NIST hybrid search using the ClassyFire ontology into 21 superclasses that were further distinguished into 184 chemical classes. ClassyFire annotations showed that the previously unannotated urine metabolome consists of 28% derivatives of organic acids, 16% heterocyclics, and 16% lipids as major classes.


Asunto(s)
Carnitina/metabolismo , Metabolómica , Carnitina/análogos & derivados , Carnitina/orina , Cromatografía Líquida de Alta Presión , Humanos , Interacciones Hidrofóbicas e Hidrofílicas , Espectrometría de Masas , Fenotipo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...