Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
1.
Nat Methods ; 18(12): 1524-1531, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34857935

RESUMEN

Compound identification in small-molecule research, such as untargeted metabolomics or exposome research, relies on matching tandem mass spectrometry (MS/MS) spectra against experimental or in silico mass spectral libraries. Most software programs use dot product similarity scores. Here we introduce the concept of MS/MS spectral entropy to improve scoring results in MS/MS similarity searches via library matching. Entropy similarity outperformed 42 alternative similarity algorithms, including dot product similarity, when searching 434,287 spectra against the high-quality NIST20 library. Entropy similarity scores proved to be highly robust even when we added different levels of noise ions. When we applied entropy levels to 37,299 experimental spectra of natural products, false discovery rates of less than 10% were observed at entropy similarity score 0.75. Experimental human gut metabolome data were used to confirm that entropy similarity largely improved the accuracy of MS-based annotations in small-molecule research to false discovery rates below 10%, annotated new compounds and provided the basis to automatically flag poor-quality, noisy spectra.


Asunto(s)
Biología Computacional/métodos , Intestinos/metabolismo , Metabolómica/métodos , Espectrometría de Masas en Tándem/métodos , Algoritmos , Cromatografía Liquida/métodos , Simulación por Computador , Entropía , Reacciones Falso Positivas , Humanos , Metaboloma , Curva ROC , Reproducibilidad de los Resultados , Programas Informáticos
2.
Nat Methods ; 15(1): 53-56, 2018 01.
Artículo en Inglés | MEDLINE | ID: mdl-29176591

RESUMEN

Novel metabolites distinct from canonical pathways can be identified through the integration of three cheminformatics tools: BinVestigate, which queries the BinBase gas chromatography-mass spectrometry (GC-MS) metabolome database to match unknowns with biological metadata across over 110,000 samples; MS-DIAL 2.0, a software tool for chromatographic deconvolution of high-resolution GC-MS or liquid chromatography-mass spectrometry (LC-MS); and MS-FINDER 2.0, a structure-elucidation program that uses a combination of 14 metabolome databases in addition to an enzyme promiscuity library. We showcase our workflow by annotating N-methyl-uridine monophosphate (UMP), lysomonogalactosyl-monopalmitin, N-methylalanine, and two propofol derivatives.


Asunto(s)
Proteínas Sanguíneas/metabolismo , Biología Computacional/métodos , Bases de Datos Factuales , Cromatografía de Gases y Espectrometría de Masas/métodos , Metaboloma , Metabolómica/métodos , Programas Informáticos , Bacterias/metabolismo , Cromatografía Liquida , Heces/química , Humanos
3.
Anal Chem ; 91(3): 2155-2162, 2019 02 05.
Artículo en Inglés | MEDLINE | ID: mdl-30608141

RESUMEN

Urine metabolites are used in many clinical and biomedical studies but usually only for a few classic compounds. Metabolomics detects vastly more metabolic signals that may be used to precisely define the health status of individuals. However, many compounds remain unidentified, hampering biochemical conclusions. Here, we annotate all metabolites detected by two untargeted metabolomic assays, hydrophilic interaction chromatography (HILIC)-Q Exactive HF mass spectrometry and charged surface hybrid (CSH)-Q Exactive HF mass spectrometry. Over 9,000 unique metabolite signals were detected, of which 42% triggered MS/MS fragmentations in data-dependent mode. On the highest Metabolomics Standards Initiative (MSI) confidence level 1, we identified 175 compounds using authentic standards with precursor mass, retention time, and MS/MS matching. An additional 578 compounds were annotated by precursor accurate mass and MS/MS matching alone, MSI level 2, including a novel library specifically geared at acylcarnitines (CarniBlast). The rest of the metabolome is usually left unannotated. To fill this gap, we used the in silico fragmentation tool CSI:FingerID and the new NIST hybrid search to annotate all further compounds (MSI level 3). Testing the top-ranked metabolites in CSI:Finger ID annotations yielded 40% accuracy when applied to the MSI level 1 identified compounds. We classified all MSI level 3 annotations by the NIST hybrid search using the ClassyFire ontology into 21 superclasses that were further distinguished into 184 chemical classes. ClassyFire annotations showed that the previously unannotated urine metabolome consists of 28% derivatives of organic acids, 16% heterocyclics, and 16% lipids as major classes.


Asunto(s)
Carnitina/metabolismo , Metabolómica , Carnitina/análogos & derivados , Carnitina/orina , Cromatografía Líquida de Alta Presión , Humanos , Interacciones Hidrofóbicas e Hidrofílicas , Espectrometría de Masas , Fenotipo
4.
Mass Spectrom Rev ; 37(4): 513-532, 2018 07.
Artículo en Inglés | MEDLINE | ID: mdl-28436590

RESUMEN

Tandem mass spectral library search (MS/MS) is the fastest way to correctly annotate MS/MS spectra from screening small molecules in fields such as environmental analysis, drug screening, lipid analysis, and metabolomics. The confidence in MS/MS-based annotation of chemical structures is impacted by instrumental settings and requirements, data acquisition modes including data-dependent and data-independent methods, library scoring algorithms, as well as post-curation steps. We critically discuss parameters that influence search results, such as mass accuracy, precursor ion isolation width, intensity thresholds, centroiding algorithms, and acquisition speed. A range of publicly and commercially available MS/MS databases such as NIST, MassBank, MoNA, LipidBlast, Wiley MSforID, and METLIN are surveyed. In addition, software tools including NIST MS Search, MS-DIAL, Mass Frontier, SmileMS, Mass++, and XCMS2 to perform fast MS/MS search are discussed. MS/MS scoring algorithms and challenges during compound annotation are reviewed. Advanced methods such as the in silico generation of tandem mass spectra using quantum chemistry and machine learning methods are covered. Community efforts for curation and sharing of tandem mass spectra that will allow for faster distribution of scientific discoveries are discussed.


Asunto(s)
Aprendizaje Automático , Bibliotecas de Moléculas Pequeñas/aislamiento & purificación , Programas Informáticos , Espectrometría de Masas en Tándem/estadística & datos numéricos , Simulación por Computador , Bases de Datos de Compuestos Químicos , Humanos , Difusión de la Información , Modelos Químicos , Teoría Cuántica , Espectrometría de Masas en Tándem/instrumentación , Espectrometría de Masas en Tándem/métodos
5.
Anal Chem ; 90(18): 10758-10764, 2018 09 18.
Artículo en Inglés | MEDLINE | ID: mdl-30096227

RESUMEN

Unknown metabolites represent a bottleneck in untargeted metabolomics research. Ion mobility-mass spectrometry (IM-MS) facilitates lipid identification because it yields collision cross section (CCS) information that is independent from mass or lipophilicity. To date, only a few CCS values are publicly available for complex lipids such as phosphatidylcholines, sphingomyelins, or triacylglycerides. This scarcity of data limits the use of CCS values as an identification parameter that is orthogonal to mass, MS/MS, or retention time. A combination of lipid descriptors was used to train five different machine learning algorithms for automatic lipid annotations, combining accurate mass ( m/ z), retention time (RT), CCS values, carbon number, and unsaturation level. Using a training data set of 429 true positive lipid annotations from four lipid classes, 92.7% correct annotations overall were achieved using internal cross-validation. The trained prediction model was applied to an unknown milk lipidomics data set and allowed for class 3 level annotations of most features detected in this application set according to Metabolomics Standards Initiative (MSI) reporting guidelines.


Asunto(s)
Cromatografía Liquida/métodos , Espectrometría de Movilidad Iónica/métodos , Lípidos/química , Algoritmos , Animales , Bovinos , Bases de Datos Factuales , Metabolómica , Leche/química , Reproducibilidad de los Resultados
6.
Anal Chem ; 89(6): 3250-3255, 2017 03 21.
Artículo en Inglés | MEDLINE | ID: mdl-28225594

RESUMEN

Untargeted metabolomics by liquid chromatography-mass spectrometry generates data-rich chromatograms in the form of m/z-retention time features. Managing such datasets is a bottleneck. Many popular data processing tools, including XCMS-online and MZmine2, yield numerous false-positive peak detections. Flagging and removing such false peaks manually is a time-consuming task and prone to human error. We present a web application, Mass Spectral Feature List Optimizer (MS-FLO), to improve the quality of feature lists after initial processing to expedite the process of data curation. The tool utilizes retention time alignments, accurate mass tolerances, Pearson's correlation analysis, and peak height similarity to identify ion adducts, duplicate peak reports, and isotopic features of the main monoisotopic metabolites. Removing such erroneous peaks reduces the overall number of metabolites in data reports and improves the quality of subsequent statistical investigations. To demonstrate the effectiveness of MS-FLO, we processed 28 biological studies and uploaded raw and results data to the Metabolomics Workbench website ( www.metabolomicsworkbench.org ), encompassing 1481 chromatograms produced by two different data processing programs used in-house (MZmine2 and later MS-DIAL). Post-processing of datasets with MS-FLO yielded a 7.8% automated reduction of total peak features and flagged an additional 7.9% of features, per dataset, for review by the user. When manually curated, 87% of these additional flagged features were verified false positives. MS-FLO is an open source web application that is freely available for use at http://msflo.fiehnlab.ucdavis.edu .


Asunto(s)
Metabolómica , Programas Informáticos , Cromatografía Liquida , Reacciones Falso Positivas , Humanos , Espectrometría de Masas
7.
Nat Commun ; 12(1): 6021, 2021 10 15.
Artículo en Inglés | MEDLINE | ID: mdl-34654818

RESUMEN

The mammalian brain relies on neurochemistry to fulfill its functions. Yet, the complexity of the brain metabolome and its changes during diseases or aging remain poorly understood. Here, we generate a metabolome atlas of the aging wildtype mouse brain from 10 anatomical regions spanning from adolescence to old age. We combine data from three assays and structurally annotate 1,547 metabolites. Almost all metabolites significantly differ between brain regions or age groups, but not by sex. A shift in sphingolipid patterns during aging related to myelin remodeling is accompanied by large changes in other metabolic pathways. Functionally related brain regions (brain stem, cerebrum and cerebellum) are also metabolically similar. In cerebrum, metabolic correlations markedly weaken between adolescence and adulthood, whereas at old age, cross-region correlation patterns reflect decreased brain segregation. We show that metabolic changes can be mapped to existing gene and protein brain atlases. The brain metabolome atlas is publicly available ( https://mouse.atlas.metabolomics.us/ ) and serves as a foundation dataset for future metabolomic studies.


Asunto(s)
Envejecimiento/metabolismo , Encéfalo/metabolismo , Metaboloma , Animales , Cerebelo/metabolismo , Femenino , Masculino , Redes y Vías Metabólicas , Metabolómica , Ratones , Esfingolípidos
8.
J Cheminform ; 9(1): 32, 2017 May 25.
Artículo en Inglés | MEDLINE | ID: mdl-29086039

RESUMEN

In mass spectrometry-based untargeted metabolomics, rarely more than 30% of the compounds are identified. Without the true identity of these molecules it is impossible to draw conclusions about the biological mechanisms, pathway relationships and provenance of compounds. The only way at present to address this discrepancy is to use in silico fragmentation software to identify unknown compounds by comparing and ranking theoretical MS/MS fragmentations from target structures to experimental tandem mass spectra (MS/MS). We compared the performance of four publicly available in silico fragmentation algorithms (MetFragCL, CFM-ID, MAGMa+ and MS-FINDER) that participated in the 2016 CASMI challenge. We found that optimizing the use of metadata, weighting factors and the manner of combining different tools eventually defined the ultimate outcomes of each method. We comprehensively analysed how outcomes of different tools could be combined and reached a final success rate of 93% for the training data, and 87% for the challenge data, using a combination of MAGMa+, CFM-ID and compound importance information along with MS/MS matching. Matching MS/MS spectra against the MS/MS libraries without using any in silico tool yielded 60% correct hits, showing that the use of in silico methods is still important.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA