RESUMEN
Molecular networking has become a key method to visualize and annotate the chemical space in non-targeted mass spectrometry data. We present feature-based molecular networking (FBMN) as an analysis method in the Global Natural Products Social Molecular Networking (GNPS) infrastructure that builds on chromatographic feature detection and alignment tools. FBMN enables quantitative analysis and resolution of isomers, including from ion mobility spectrometry.
Asunto(s)
Productos Biológicos/química , Espectrometría de Masas , Biología Computacional/métodos , Bases de Datos Factuales , Metabolómica/métodos , Programas InformáticosRESUMEN
Data-independent acquisition (DIA) is becoming a leading analysis method in biomedical mass spectrometry. The main advantages include greater reproducibility and sensitivity and a greater dynamic range compared with data-dependent acquisition (DDA). However, the data analysis is complex and often requires expert knowledge when dealing with large-scale data sets. Here we present DIAproteomics, a multifunctional, automated, high-throughput pipeline implemented in the Nextflow workflow management system that allows one to easily process proteomics and peptidomics DIA data sets on diverse compute infrastructures. The central components are well-established tools such as the OpenSwathWorkflow for the DIA spectral library search and PyProphet for the false discovery rate assessment. In addition, it provides options to generate spectral libraries from existing DDA data and to carry out the retention time and chromatogram alignment. The output includes annotated tables and diagnostic visualizations from the statistical postprocessing and computation of fold-changes across pairwise conditions, predefined in an experimental design. DIAproteomics is well documented open-source software and is available under a permissive license to the scientific community at https://www.openms.de/diaproteomics/.
Asunto(s)
Análisis de Datos , Proteómica , Espectrometría de Masas , Reproducibilidad de los Resultados , Programas InformáticosRESUMEN
Technological advances in high-resolution mass spectrometry (MS) vastly increased the number of samples that can be processed in a life science experiment, as well as volume and complexity of the generated data. To address the bottleneck of high-throughput data processing, we present SmartPeak (https://github.com/AutoFlowResearch/SmartPeak), an application that encapsulates advanced algorithms to enable fast, accurate, and automated processing of capillary electrophoresis-, gas chromatography-, and liquid chromatography (LC)-MS(/MS) data and high-pressure LC data for targeted and semitargeted metabolomics, lipidomics, and fluxomics experiments. The application allows for an approximate 100-fold reduction in the data processing time compared to manual processing while enhancing quality and reproducibility of the results.
Asunto(s)
Procesamiento Automatizado de Datos/métodos , Metabolómica/métodos , Automatización , Cromatografía Liquida , Electroforesis Capilar , Espectrometría de Masas en Tándem , Factores de TiempoRESUMEN
BACKGROUND: Aberrant hedgehog (HH) signaling is implicated in the development of various cancer entities such as medulloblastoma. Activation of GLI transcription factors was revealed as the driving force upon pathway activation. Increased phosphorylation of essential effectors such as Smoothened (SMO) and GLI proteins by kinases including Protein Kinase A, Casein Kinase 1, and Glycogen Synthase Kinase 3 ß controls effector activity, stability and processing. However, a deeper and more comprehensive understanding of phosphorylation in the signal transduction remains unclear, particularly during early response processes involved in SMO activation and preceding GLI target gene regulation. METHODS: We applied temporal quantitative phosphoproteomics to reveal phosphorylation dynamics underlying the short-term chemical activation and inhibition of early hedgehog signaling in HH responsive human medulloblastoma cells. Medulloblastoma cells were treated for 5.0 and 15 min with Smoothened Agonist (SAG) to induce and with vismodegib to inhibit the HH pathway. RESULTS: Our phosphoproteomic profiling resulted in the quantification of 7700 and 10,000 phosphosites after 5.0 and 15 min treatment, respectively. The data suggest a central role of phosphorylation in the regulation of ciliary assembly, trafficking, and signal transduction already after 5.0 min treatment. ERK/MAPK signaling, besides Protein Kinase A signaling and mTOR signaling, were differentially regulated after short-term treatment. Activation of Polo-like Kinase 1 and inhibition of Casein Kinase 2A1 were characteristic for vismodegib treatment, while SAG treatment induced Aurora Kinase A activity. Distinctive phosphorylation of central players of HH signaling such as SMO, SUFU, GLI2 and GLI3 was observed only after 15 min treatment. CONCLUSIONS: This study provides evidence that phosphorylation triggered in response to SMO modulation dictates the localization of hedgehog pathway components within the primary cilium and affects the regulation of the SMO-SUFU-GLI axis. The data are relevant for the development of targeted therapies of HH-associated cancers including sonic HH-type medulloblastoma. A deeper understanding of the mechanisms of action of SMO inhibitors such as vismodegib may lead to the development of compounds causing fewer adverse effects and lower frequencies of drug resistance. Video Abstract.
Asunto(s)
Neoplasias Cerebelosas/metabolismo , Proteínas Hedgehog/metabolismo , Meduloblastoma/metabolismo , Proteómica , Transducción de Señal , Proteínas Adaptadoras Transductoras de Señales/metabolismo , Anilidas/farmacología , Proteína BRCA1/metabolismo , Quinasa de la Caseína II/antagonistas & inhibidores , Quinasa de la Caseína II/metabolismo , Proteínas de Ciclo Celular/metabolismo , Neoplasias Cerebelosas/genética , Neoplasias Cerebelosas/patología , Cilios/efectos de los fármacos , Cilios/metabolismo , Proteínas del Citoesqueleto/metabolismo , Activación Enzimática/efectos de los fármacos , Regulación Neoplásica de la Expresión Génica/efectos de los fármacos , Humanos , Meduloblastoma/genética , Meduloblastoma/patología , Fosfopéptidos/metabolismo , Fosforilación/efectos de los fármacos , Proteínas Serina-Treonina Quinasas/metabolismo , Proteoma/metabolismo , Proteínas Proto-Oncogénicas/metabolismo , Piridinas/farmacología , Transducción de Señal/efectos de los fármacos , Especificidad por Sustrato/efectos de los fármacos , Quinasa Tipo Polo 1RESUMEN
Mass spectrometry (MS) is one of the primary techniques used for large-scale analysis of small molecules in metabolomics studies. To date, there has been little data format standardization in this field, as different software packages export results in different formats represented in XML or plain text, making data sharing, database deposition, and reanalysis highly challenging. Working within the consortia of the Metabolomics Standards Initiative, Proteomics Standards Initiative, and the Metabolomics Society, we have created mzTab-M to act as a common output format from analytical approaches using MS on small molecules. The format has been developed over several years, with input from a wide range of stakeholders. mzTab-M is a simple tab-separated text format, but importantly, the structure is highly standardized through the design of a detailed specification document, tightly coupled to validation software, and a mandatory controlled vocabulary of terms to populate it. The format is able to represent final quantification values from analyses, as well as the evidence trail in terms of features measured directly from MS (e.g., LC-MS, GC-MS, DIMS, etc.) and different types of approaches used to identify molecules. mzTab-M allows for ambiguity in the identification of molecules to be communicated clearly to readers of the files (both people and software). There are several implementations of the format available, and we anticipate widespread adoption in the field.
Asunto(s)
Metabolómica/métodos , Programas Informáticos , Bases de Datos Factuales , Espectrometría de MasasRESUMEN
The application of ketogenic diet (KD) (high fat/low carbohydrate/adequate protein) as an auxiliary cancer therapy is a field of growing attention. KD provides sufficient energy supply for healthy cells, while possibly impairing energy production in highly glycolytic tumor cells. Moreover, KD regulates insulin and tumor related growth factors (like insulin growth factor-1, IGF-1). In order to provide molecular evidence for the proposed additional inhibition of tumor growth when combining chemotherapy with KD, we applied untargeted quantitative metabolome analysis on a spontaneous breast cancer xenograft mouse model, using MDA-MB-468 cells. Healthy mice and mice bearing breast cancer xenografts and receiving cyclophosphamide chemotherapy were compared after treatment with control diet and KD. Metabolomic profiling was performed on plasma samples, applying high-performance liquid chromatography coupled to tandem mass spectrometry. Statistical analysis revealed metabolic fingerprints comprising numerous significantly regulated features in the group of mice bearing breast cancer. This fingerprint disappeared after treatment with KD, resulting in recovery to the metabolic status observed in healthy mice receiving control diet. Moreover, amino acid metabolism as well as fatty acid transport were found to be affected by both the tumor and the applied KD. Our results provide clear evidence of a significant molecular effect of adjuvant KD in the context of tumor growth inhibition and suggest additional mechanisms of tumor suppression beyond the proposed constrain in energy supply of tumor cells.
Asunto(s)
Dieta Cetogénica , Metaboloma , Metabolómica , Neoplasias/metabolismo , Acetilación , Aminoácidos/biosíntesis , Aminoácidos/metabolismo , Animales , Línea Celular Tumoral , Cromatografía Líquida de Alta Presión , Modelos Animales de Enfermedad , Ácidos Grasos/metabolismo , Xenoinjertos , Humanos , Metabolómica/métodos , Ratones , Neoplasias/patología , Espectrometría de Masas en TándemRESUMEN
The relatively new research discipline of Eco-Metabolomics is the application of metabolomics techniques to ecology with the aim to characterise biochemical interactions of organisms across different spatial and temporal scales. Metabolomics is an untargeted biochemical approach to measure many thousands of metabolites in different species, including plants and animals. Changes in metabolite concentrations can provide mechanistic evidence for biochemical processes that are relevant at ecological scales. These include physiological, phenotypic and morphological responses of plants and communities to environmental changes and also interactions with other organisms. Traditionally, research in biochemistry and ecology comes from two different directions and is performed at distinct spatiotemporal scales. Biochemical studies most often focus on intrinsic processes in individuals at physiological and cellular scales. Generally, they take a bottom-up approach scaling up cellular processes from spatiotemporally fine to coarser scales. Ecological studies usually focus on extrinsic processes acting upon organisms at population and community scales and typically study top-down and bottom-up processes in combination. Eco-Metabolomics is a transdisciplinary research discipline that links biochemistry and ecology and connects the distinct spatiotemporal scales. In this review, we focus on approaches to study chemical and biochemical interactions of plants at various ecological levels, mainly plantâ»organismal interactions, and discuss related examples from other domains. We present recent developments and highlight advancements in Eco-Metabolomics over the last decade from various angles. We further address the five key challenges: (1) complex experimental designs and large variation of metabolite profiles; (2) feature extraction; (3) metabolite identification; (4) statistical analyses; and (5) bioinformatics software tools and workflows. The presented solutions to these challenges will advance connecting the distinct spatiotemporal scales and bridging biochemistry and ecology.
Asunto(s)
Ecología , Metabolómica/tendencias , Plantas/genética , Plantas/metabolismoRESUMEN
Metabolomics experiments generate highly complex datasets, which are time and work-intensive, sometimes even error-prone if inspected manually. Therefore, new methods for automated, fast, reproducible, and accurate data processing and dereplication are required. Here, we present UmetaFlow, a computational workflow for untargeted metabolomics that combines algorithms for data pre-processing, spectral matching, molecular formula and structural predictions, and an integration to the GNPS workflows Feature-Based Molecular Networking and Ion Identity Molecular Networking for downstream analysis. UmetaFlow is implemented as a Snakemake workflow, making it easy to use, scalable, and reproducible. For more interactive computing, visualization, as well as development, the workflow is also implemented in Jupyter notebooks using the Python programming language and a set of Python bindings to the OpenMS algorithms (pyOpenMS). Finally, UmetaFlow is also offered as a web-based Graphical User Interface for parameter optimization and processing of smaller-sized datasets. UmetaFlow was validated with in-house LC-MS/MS datasets of actinomycetes producing known secondary metabolites, as well as commercial standards, and it detected all expected features and accurately annotated 76% of the molecular formulas and 65% of the structures. As a more generic validation, the publicly available MTBLS733 and MTBLS736 datasets were used for benchmarking, and UmetaFlow detected more than 90% of all ground truth features and performed exceptionally well in quantification and discriminating marker selection. We anticipate that UmetaFlow will provide a useful platform for the interpretation of large metabolomics datasets.
RESUMEN
Neuroblastoma, the most common childhood solid tumor, originates from primitive sympathetic nervous system cells. Epoxyazadiradione (EAD) is a limonoid derived from Azadirachta indica, belonging to the family Meliaceae. In this study, we isolated the EAD from Azadirachta indica seed and studied the anti-cancer potential against neuroblastoma. Herein, EAD demonstrated significant efficacy against neuroblastoma by suppressing cell proliferation, enhancing the rate of apoptosis and cycle arrest at the SubG0 and G2/M phases. EAD enhanced the pro-apoptotic Caspase 3 and Caspase 9 and inhibited the NF-kß translocation in a dose-dependent manner. In order to identify the specific EAD target, a gel-free quantitative proteomics study on SH-SY5Y cells using Liquid Chromatography with tandem mass spectrometry was done in a dose-dependent manner, followed by detailed bioinformatics analysis to identify effects on protein. Proteomics data identified that Enolase1 and HSP90 were up-regulated in neuroblastoma. EAD inhibited the expression of Enolase1 and HSP90, validated by mRNA expression, immunoblotting, Enolase1 and HSP90 kit and flow-cytometry based bioassay. Molecular docking study, Molecular dynamic simulation, and along with molecular mechanics/Poisson-Boltzmann surface area analysis also suggested that EAD binds at the active site of the proteins and were stable throughout the 100 ns Molecular dynamic simulation study. Overall, this study suggested EAD exhibited anti-cancer activity against neuroblastoma by targeting Enolase1 and HSP90 pathways.Communicated by Ramaswamy H. Sarma.
RESUMEN
The extraction of meaningful biological knowledge from high-throughput mass spectrometry data relies on limiting false discoveries to a manageable amount. For targeted approaches in metabolomics a main challenge is the detection of false positive metabolic features in the low signal-to-noise ranges of data-independent acquisition results and their filtering. Another factor is that the creation of assay libraries for data-independent acquisition analysis and the processing of extracted ion chromatograms have not been automated in metabolomics. Here we present a fully automated open-source workflow for high-throughput metabolomics that combines data-dependent and data-independent acquisition for library generation, analysis, and statistical validation, with rigorous control of the false-discovery rate while matching manual analysis regarding quantification accuracy. Using an experimentally specific data-dependent acquisition library based on reference substances allows for accurate identification of compounds and markers from data-independent acquisition data in low concentrations, facilitating biomarker quantification.
Asunto(s)
Metabolómica , Biomarcadores , Espectrometría de Masas , Metabolómica/métodos , Flujo de TrabajoRESUMEN
Mitochondria are key organelles for cellular energetics, metabolism, signaling, and quality control and have been linked to various diseases. Different views exist on the composition of the human mitochondrial proteome. We classified >8,000 proteins in mitochondrial preparations of human cells and defined a mitochondrial high-confidence proteome of >1,100 proteins (MitoCoP). We identified interactors of translocases, respiratory chain, and ATP synthase assembly factors. The abundance of MitoCoP proteins covers six orders of magnitude and amounts to 7% of the cellular proteome with the chaperones HSP60-HSP10 being the most abundant mitochondrial proteins. MitoCoP dynamics spans three orders of magnitudes, with half-lives from hours to months, and suggests a rapid regulation of biosynthesis and assembly processes. 460 MitoCoP genes are linked to human diseases with a strong prevalence for the central nervous system and metabolism. MitoCoP will provide a high-confidence resource for placing dynamics, functions, and dysfunctions of mitochondria into the cellular context.
Asunto(s)
Mitocondrias , Proteoma , Humanos , Mitocondrias/metabolismo , Membranas Mitocondriales/metabolismo , Proteínas Mitocondriales/metabolismo , Proteoma/metabolismoRESUMEN
This chapter describes the open-source tool suite OpenMS. OpenMS contains more than 180 tools which can be combined to build complex and flexible data-processing workflows. The broad range of functionality and the interoperability of these tools enable complex, complete, and reproducible data analysis workflows in computational proteomics and metabolomics. We introduce the key concepts of OpenMS and illustrate its capabilities with a complete workflow for the analysis of untargeted metabolomics data, including metabolite quantification and identification.
Asunto(s)
Biología Computacional/métodos , Interpretación Estadística de Datos , Metabolómica , Programas Informáticos , Algoritmos , Bases de Datos Factuales , Humanos , Metabolómica/métodos , Proteómica/métodos , Navegador Web , Flujo de TrabajoRESUMEN
The determination of cocoa shell content (Theobroma cacao L.) in cocoa products using a metabolomics approach was accomplished via high performance liquid chromatography quadrupole time-of-flight mass spectrometry (HPLC-QTOF-MS). The developed method was used to separately analyze the polar and non-polar metabolome of the cocoa testa (cocoa shell) and the cocoa cotyledons (cocoa nibs) of cocoa samples from 15 different geographic origins, harvest years, and varieties in positive and negative ion mode. Potential key metabolites were selected which are exclusively contained in the cocoa shell or with significant higher concentration in the cocoa shell than in the cocoa nibs. The pool of potential key metabolites was filtered by established selection criteria, such as temperature stability, fermentations stability, and independence from the geographic origin. Based on these key metabolites an inverse sparse partial least square regression (SPLS) was used for the prediction of the cocoa shell content.
Asunto(s)
Cacao/metabolismo , Chocolate/análisis , Espectrometría de Masas , Metaboloma , Calibración , Cromatografía Líquida de Alta Presión , Fermentación , Frutas/metabolismo , Geografía , Metabolómica , Estándares de Referencia , Espectrometría de Masa por Ionización de Electrospray , TemperaturaRESUMEN
BACKGROUND: In recent years, several mass spectrometry-based omics technologies emerged to investigate qualitative and quantitative changes within thousands of biologically active components such as proteins, lipids and metabolites. The research enabled through these methods potentially contributes to the diagnosis and pathophysiology of human diseases as well as to the clarification of structures and interactions between biomolecules. Simultaneously, technological advances in the field of mass spectrometry leading to an ever increasing amount of data, demand high standards in efficiency, accuracy and reproducibility of potential analysis software. RESULTS: This article presents the current state and ongoing developments in OpenMS, a versatile open-source framework aimed at enabling reproducible analyses of high-throughput mass spectrometry data. It provides implementations of frequently occurring processing operations on MS data through a clean application programming interface in C++ and Python. A collection of 185 tools and ready-made workflows for typical MS-based experiments enable convenient analyses for non-developers and facilitate reproducible research without losing flexibility. CONCLUSIONS: OpenMS will continue to increase its ease of use for developers as well as users with improved continuous integration/deployment strategies, regular trainings with updated training materials and multiple sources of support. The active developer community ensures the incorporation of new features to support state of the art research.