RESUMO
We report XCMS-MRM and METLIN-MRM ( http://xcmsonline-mrm.scripps.edu/ and http://metlin.scripps.edu/ ), a cloud-based data-analysis platform and a public multiple-reaction monitoring (MRM) transition repository for small-molecule quantitative tandem mass spectrometry. This platform provides MRM transitions for more than 15,500 molecules and facilitates data sharing across different instruments and laboratories.
Assuntos
Computação em Nuvem , Bibliotecas de Moléculas Pequenas/química , Cromatografia Líquida/métodos , Biologia Computacional , Metabolômica , Espectrometria de Massas em TandemRESUMO
Comprehensive metabolomic data can be achieved using multiple orthogonal separation and mass spectrometry (MS) analytical techniques. However, drawing biologically relevant conclusions from this data and combining it with additional layers of information collected by other omic technologies present a significant bioinformatic challenge. To address this, a data processing approach was designed to automate the comprehensive prediction of dysregulated metabolic pathways/networks from multiple data sources. The platform autonomously integrates multiple MS-based metabolomics data types without constraints due to different sample preparation/extraction, chromatographic separation, or MS detection method. This multimodal analysis streamlines the extraction of biological information from the metabolomics data as well as the contextualization within proteomics and transcriptomics data sets. As a proof of concept, this multimodal analysis approach was applied to a colorectal cancer (CRC) study, in which complementary liquid chromatography-mass spectrometry (LC-MS) data were combined with proteomic and transcriptomic data. Our approach provided a highly resolved overview of colon cancer metabolic dysregulation, with an average 17% increase of detected dysregulated metabolites per pathway and an increase in metabolic pathway prediction confidence. Moreover, 95% of the altered metabolic pathways matched with the dysregulated genes and proteins, providing additional validation at a systems level. The analysis platform is currently available via the XCMS Online ( XCMSOnline.scripps.edu ).
Assuntos
Neoplasias Colorretais/metabolismo , Redes e Vias Metabólicas , Metabolômica/métodos , Biologia de Sistemas/métodos , Cromatografia Líquida/métodos , Neoplasias Colorretais/genética , Biologia Computacional/métodos , Genômica/métodos , Humanos , Espectrometria de Massas em Tandem/métodos , TranscriptomaRESUMO
The speed and throughput of analytical platforms has been a driving force in recent years in the "omics" technologies and while great strides have been accomplished in both chromatography and mass spectrometry, data analysis times have not benefited at the same pace. Even though personal computers have become more powerful, data transfer times still represent a bottleneck in data processing because of the increasingly complex data files and studies with a greater number of samples. To meet the demand of analyzing hundreds to thousands of samples within a given experiment, we have developed a data streaming platform, XCMS Stream, which capitalizes on the acquisition time to compress and stream recently acquired data files to data processing servers, mimicking just-in-time production strategies from the manufacturing industry. The utility of this XCMS Online-based technology is demonstrated here in the analysis of T cell metabolism and other large-scale metabolomic studies. A large scale example on a 1000 sample data set demonstrated a 10â¯000-fold time savings, reducing data analysis time from days to minutes. Further, XCMS Stream has the capability to increase the efficiency of downstream biochemical dependent data acquisition (BDDA) analysis by initiating data conversion and data processing on subsets of data acquired, expanding its application beyond data transfer to smart preliminary data decision-making prior to full acquisition.
Assuntos
Compressão de Dados/métodos , Mineração de Dados/métodos , Metabolômica/métodos , Linfócitos T/metabolismo , Compressão de Dados/economia , Mineração de Dados/economia , Humanos , Metabolômica/economia , Software , Fatores de Tempo , Fluxo de TrabalhoRESUMO
Concurrent exposure to a wide variety of xenobiotics and their combined toxic effects can play a pivotal role in health and disease, yet are largely unexplored. Investigating the totality of these exposures, i.e., the "exposome", and their specific biological effects constitutes a new paradigm for environmental health but still lacks high-throughput, user-friendly technology. We demonstrate the utility of mass spectrometry-based global exposure metabolomics combined with tailored database queries and cognitive computing for comprehensive exposure assessment and the straightforward elucidation of biological effects. The METLIN Exposome database has been redesigned to help identify environmental toxicants, food contaminants and supplements, drugs, and antibiotics as well as their biotransformation products, through its expansion with over 700 000 chemical structures to now include more than 950 000 unique small molecules. More importantly, we demonstrate how the XCMS/METLIN platform now allows for the readout of the biological effect of a toxicant through metabolomic-derived pathway analysis, and further, artificial intelligence provides a means of assessing the role of a potential toxicant. The presented workflow addresses many of the methodological challenges current exposomics research is facing and will serve to gain a deeper understanding of the impact of environmental exposures and combinatory toxic effects on human health.
Assuntos
Inteligência Artificial , Metabolômica/métodos , Bases de Dados Genéticas , Genômica , Humanos , MasculinoRESUMO
Active data screening is an integral part of many scientific activities, and mobile technologies have greatly facilitated this process by minimizing the reliance on large hardware instrumentation. In order to meet with the increasingly growing field of metabolomics and heavy workload of data processing, we designed the first remote metabolomic data screening platform for mobile devices. Two mobile applications (apps), XCMS Mobile and METLIN Mobile, facilitate access to XCMS and METLIN, which are the most important components in the computer-based XCMS Online platforms. These mobile apps allow for the visualization and analysis of metabolic data throughout the entire analytical process. Specifically, XCMS Mobile and METLIN Mobile provide the capabilities for remote monitoring of data processing, real time notifications for the data processing, visualization and interactive analysis of processed data (e.g., cloud plots, principle component analysis, box-plots, extracted ion chromatograms, and hierarchical cluster analysis), and database searching for metabolite identification. These apps, available on Apple iOS and Google Android operating systems, allow for the migration of metabolomic research onto mobile devices for better accessibility beyond direct instrument operation. The utility of XCMS Mobile and METLIN Mobile functionalities was developed and is demonstrated here through the metabolomic LC-MS analyses of stem cells, colon cancer, aging, and bacterial metabolism.
Assuntos
Internet , Metabolômica , Aplicativos Móveis , Smartphone , Cromatografia Líquida , Interpretação Estatística de Dados , Humanos , Espectrometria de Massas , Análise de Componente PrincipalRESUMO
MOTIVATION: Metabolite databases provide a unique window into metabolome research allowing the most commonly searched biomarkers to be catalogued. Omic scale metabolite profiling, or metabolomics, is finding increased utility in biomarker discovery largely driven by improvements in analytical technologies and the concurrent developments in bioinformatics. However, the successful translation of biomarkers into clinical or biologically relevant indicators is limited. RESULTS: With the aim of improving the discovery of translatable metabolite biomarkers, we present search analytics for over one million METLIN metabolite database queries. The most common metabolites found in METLIN were cross-correlated against XCMS Online, the widely used cloud-based data processing and pathway analysis platform. Analysis of the METLIN and XCMS common metabolite data has two primary implications: these metabolites, might indicate a conserved metabolic response to stressors and, this data may be used to gauge the relative uniqueness of potential biomarkers. AVAILABILITY AND IMPLEMENTATION: METLIN can be accessed by logging on to: https://metlin.scripps.edu CONTACT: siuzdak@scripps.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Metaboloma , Metabolômica , Biomarcadores/metabolismo , Bases de Dados Factuais , Humanos , Espectrometria de MassasRESUMO
An autonomous metabolomic workflow combining mass spectrometry analysis with tandem mass spectrometry data acquisition was designed to allow for simultaneous data processing and metabolite characterization. Although previously tandem mass spectrometry data have been generated on the fly, the experiments described herein combine this technology with the bioinformatic resources of XCMS and METLIN. As a result of this unique integration, we can analyze large profiling datasets and simultaneously obtain structural identifications. Validation of the workflow on bacterial samples allowed the profiling on the order of a thousand metabolite features with simultaneous tandem mass spectra data acquisition. The tandem mass spectrometry data acquisition enabled automatic search and matching against the METLIN tandem mass spectrometry database, shortening the current workflow from days to hours. Overall, the autonomous approach to untargeted metabolomics provides an efficient means of metabolomic profiling, and will ultimately allow the more rapid integration of comparative analyses, metabolite identification, and data analysis at a systems biology level.
Assuntos
Biologia Computacional , Desulfovibrio vulgaris/metabolismo , Processamento Eletrônico de Dados/métodos , Metabolômica/métodos , Cromatografia Líquida/métodos , Bases de Dados Factuais , Desulfovibrio vulgaris/crescimento & desenvolvimento , Software , Espectrometria de Massas em Tandem/métodosRESUMO
XCMS Online (xcmsonline.scripps.edu) is a cloud-based informatic platform designed to process and visualize mass-spectrometry-based, untargeted metabolomic data. Initially, the platform was developed for two-group comparisons to match the independent, "control" versus "disease" experimental design. Here, we introduce an enhanced XCMS Online interface that enables users to perform dependent (paired) two-group comparisons, meta-analysis, and multigroup comparisons, with comprehensive statistical output and interactive visualization tools. Newly incorporated statistical tests cover a wide array of univariate analyses. Multigroup comparison allows for the identification of differentially expressed metabolite features across multiple classes of data while higher order meta-analysis facilitates the identification of shared metabolic patterns across multiple two-group comparisons. Given the complexity of these data sets, we have developed an interactive platform where users can monitor the statistical output of univariate (cloud plots) and multivariate (PCA plots) data analysis in real time by adjusting the threshold and range of various parameters. On the interactive cloud plot, metabolite features can be filtered out by their significance level (p-value), fold change, mass-to-charge ratio, retention time, and intensity. The variation pattern of each feature can be visualized on both extracted-ion chromatograms and box plots. The interactive principal component analysis includes scores, loadings, and scree plots that can be adjusted depending on scaling criteria. The utility of XCMS functionalities is demonstrated through the metabolomic analysis of bacterial stress response and the comparison of lymphoblastic leukemia cell lines.
Assuntos
Interpretação Estatística de Dados , Espectrometria de Massas , Metabolômica/métodos , Interface Usuário-Computador , Sangue/metabolismo , Bases de Dados Factuais , Desulfovibrio/metabolismo , Feminino , Humanos , Internet , Linfoma/metabolismo , Masculino , Metanálise como Assunto , Análise Multivariada , Análise de Componente Principal , SoftwareRESUMO
Global metabolomics describes the comprehensive analysis of small molecules in a biological system without bias. With mass spectrometry-based methods, global metabolomic data sets typically comprise thousands of peaks, each of which is associated with a mass-to-charge ratio, retention time, fold change, p-value, and relative intensity. Although several visualization schemes have been used for metabolomic data, most commonly used representations exclude important data dimensions and therefore limit interpretation of global data sets. Given that metabolite identification through tandem mass spectrometry data acquisition is a time-limiting step of the untargeted metabolomic workflow, simultaneous visualization of these parameters from large sets of data could facilitate compound identification and data interpretation. Here, we present such a visualization scheme of global metabolomic data using a so-called "cloud plot" to represent multidimensional data from septic mice. While much attention has been dedicated to lipid compounds as potential biomarkers for sepsis, the cloud plot shows that alterations in hydrophilic metabolites may provide an early signature of the disease prior to the onset of clinical symptoms. The cloud plot is an effective representation of global mass spectrometry-based metabolomic data, and we describe how to extract it as standard output from our XCMS metabolomic software.
Assuntos
Sepse/metabolismo , Animais , Biomarcadores/sangue , Biomarcadores/metabolismo , Lipídeos/sangue , Espectrometria de Massas , Metabolômica , Camundongos , Camundongos Endogâmicos C57BL , Sepse/sangue , SoftwareRESUMO
Recently, interest in untargeted metabolomics has become prevalent in the general scientific community among an increasing number of investigators. The majority of these investigators, however, do not have the bioinformatic expertise that has been required to process metabolomic data by using command-line driven software programs. Here we introduce a novel platform to process untargeted metabolomic data that uses an intuitive graphical interface and does not require installation or technical expertise. This platform, called XCMS Online, is a web-based version of the widely used XCMS software that allows users to easily upload and process liquid chromatography/mass spectrometry data with only a few mouse clicks. XCMS Online provides a solution for the complete untargeted metabolomic workflow including feature detection, retention time correction, alignment, annotation, statistical analysis, and data visualization. Results can be browsed online in an interactive, customizable table showing statistics, chromatograms, and putative METLIN identities for each metabolite. Additionally, all results and images can be downloaded as zip files for offline analysis and publication. XCMS Online is available at https://xcmsonline.scripps.edu.
Assuntos
Cromatografia Líquida/estatística & dados numéricos , Processamento Eletrônico de Dados/métodos , Espectrometria de Massas/estatística & dados numéricos , Metabolômica , Software , Humanos , Internet , PlantasRESUMO
Archived metabolomics data represent a broad resource for the scientific community. However, the absence of tools for the meta-analysis of heterogeneous data types makes it challenging to perform direct comparisons in a single and cohesive workflow. Here we present a framework for the meta-analysis of metabolic pathways and interpretation with proteomic and transcriptomic data. This framework facilitates the comparison of heterogeneous types of metabolomics data from online repositories (e.g., XCMS Online, Metabolomics Workbench, GNPS, and MetaboLights) representing tens of thousands of studies, as well as locally acquired data. As a proof of concept, we apply the workflow for the meta-analysis of i) independent colon cancer studies, further interpreted with proteomics and transcriptomics data, ii) multimodal data from Alzheimer's disease and mild cognitive impairment studies, demonstrating its high-throughput capability for the systems level interpretation of metabolic pathways. Moreover, the platform has been modified for improved knowledge dissemination through a collaboration with Metabolomics Workbench and LIPID MAPS. We envision that this meta-analysis tool will help overcome the primary bottleneck in analyzing diverse datasets and facilitate the full exploitation of archival metabolomics data for addressing a broad array of questions in metabolism research and systems biology.
RESUMO
Systems biology is the study of complex living organisms, and as such, analysis on a systems-wide scale involves the collection of information-dense data sets that are representative of an entire phenotype. To uncover dynamic biological mechanisms, bioinformatics tools have become essential to facilitating data interpretation in large-scale analyses. Global metabolomics is one such method for performing systems biology, as metabolites represent the downstream functional products of ongoing biological processes. We have developed XCMS Online, a platform that enables online metabolomics data processing and interpretation. A systems biology workflow recently implemented within XCMS Online enables rapid metabolic pathway mapping using raw metabolomics data for investigating dysregulated metabolic processes. In addition, this platform supports integration of multi-omic (such as genomic and proteomic) data to garner further systems-wide mechanistic insight. Here, we provide an in-depth procedure showing how to effectively navigate and use the systems biology workflow within XCMS Online without a priori knowledge of the platform, including uploading liquid chromatography (LC)-mass spectrometry (MS) data from metabolite-extracted biological samples, defining the job parameters to identify features, correcting for retention time deviations, conducting statistical analysis of features between sample classes and performing predictive metabolic pathway analysis. Additional multi-omics data can be uploaded and overlaid with previously identified pathways to enhance systems-wide analysis of the observed dysregulations. We also describe unique visualization tools to assist in elucidation of statistically significant dysregulated metabolic pathways. Parameter input takes 5-10 min, depending on user experience; data processing typically takes 1-3 h, and data analysis takes â¼30 min.
Assuntos
Biologia Computacional/métodos , Processamento Eletrônico de Dados/métodos , Metabolismo , Metabolômica/métodos , Biologia de Sistemas/métodos , Internet , SoftwareRESUMO
Cloud-based bioinformatic platforms address the fundamental demands of creating a flexible scientific environment, facilitating data processing and general accessibility independent of a countries' affluence. These platforms have a multitude of advantages as demonstrated by omics technologies, helping to support both government and scientific mandates of a more open environment.
Assuntos
Computação em Nuvem , Armazenamento e Recuperação da Informação , Metabolômica/métodosRESUMO
Heat maps are a commonly used visualization tool for metabolomic data where the relative abundance of ions detected in each sample is represented with color intensity. A limitation of applying heat maps to global metabolomic data, however, is the large number of ions that have to be displayed and the lack of information provided about important metabolomic parameters such as m/z and retention time. Here we address these challenges by introducing the interactive cluster heat map in the data-processing software XCMS Online. XCMS Online (xcmsonline.scripps.edu) is a cloud-based informatic platform designed to process, statistically evaluate, and visualize mass-spectrometry based metabolomic data. An interactive heat map is provided for all data processed by XCMS Online. The heat map is clickable, allowing users to zoom and explore specific metabolite metadata (EICs, Box-and-whisker plots, mass spectra) that are linked to the METLIN metabolite database. The utility of the XCMS interactive heat map is demonstrated on metabolomic data set generated from different anatomical regions of the mouse brain.