RESUMO
Lipidomics studies suffer from analytical and annotation challenges because of the great structural similarity of many of the lipid species. To improve lipid characterization and annotation capabilities beyond those afforded by traditional mass spectrometry (MS)-based methods, multidimensional separation methods such as those integrating liquid chromatography, ion mobility spectrometry, collision-induced dissociation and MS (LC-IMS-CID-MS) may be used. Although LC-IMS-CID-MS and other multidimensional methods offer valuable hydrophobicity, structural and mass information, the files are also complex and difficult to assess. Thus, the development of software tools to rapidly process and facilitate confident lipid annotations is essential. In this Protocol Extension, we use the freely available, vendor-neutral and open-source software Skyline to process and annotate multidimensional lipidomic data. Although Skyline ( https://skyline.ms/skyline.url ) was established for targeted processing of LC-MS-based proteomics data, it has since been extended such that it can be used to analyze small-molecule data as well as data containing the IMS dimension. This protocol uses Skyline's recently expanded capabilities, including small-molecule spectral libraries, indexed retention time and ion mobility filtering, and provides a step-by-step description for importing data, predicting retention times, validating lipid annotations, exporting results and editing our manually validated 500+ lipid library. Although the time required to complete the steps outlined here varies on the basis of multiple factors such as dataset size and familiarity with Skyline, this protocol takes ~5.5 h to complete when annotations are rigorously verified for maximum confidence.
Assuntos
Espectrometria de Mobilidade Iônica , Lipidômica , Cromatografia Líquida/métodos , Espectrometria de Massas/métodos , LipídeosRESUMO
RING-between-RING (RBR) E3 ligases mediate ubiquitin transfer through an obligate E3-ubiquitin thioester intermediate prior to substrate ubiquitination. Although RBRs share a conserved catalytic module, substrate recruitment mechanisms remain enigmatic, and the relevant domains have yet to be identified for any member of the class. Here we characterize the interaction between the auto-inhibited RBR, HHARI (AriH1), and its target protein, 4EHP, using a combination of XL-MS, HDX-MS, NMR, and biochemical studies. The results show that (1) a di-aromatic surface on the catalytic HHARI Rcat domain forms a binding platform for substrates and (2) a phosphomimetic mutation on the auto-inhibitory Ariadne domain of HHARI promotes release and reorientation of Rcat for transthiolation and substrate modification. The findings identify a direct binding interaction between a RING-between-RING ligase and its substrate and suggest a general model for RBR substrate recognition.
Assuntos
Proteínas Culina , Ubiquitina , Domínio Catalítico , Proteínas Culina/metabolismo , Ubiquitina/metabolismo , Ubiquitina-Proteína Ligases/química , UbiquitinaçãoRESUMO
The implication of lipid dysregulation in diseases, toxic exposure outcomes, and inflammation has brought great interest to lipidomic studies. However, lipids have proven to be analytically challenging due to their highly isomeric nature and vast concentration ranges in biological matrices. Therefore, multidimensional techniques such as those integrating liquid chromatography, ion mobility spectrometry, collision-induced dissociation, and mass spectrometry (LC-IMS-CID-MS) have been implemented to separate lipid isomers as well as provide structural information and increased identification confidence. These data sets are however extremely large and complex, resulting in challenges for data processing and annotation. Here, we have overcome these challenges by developing sample-specific multidimensional lipid libraries using the freely available software Skyline. Specifically, the human plasma library developed for this work contains over 500 unique lipids and is combined with adapted Skyline functions such as indexed retention time (iRT) for retention time prediction and IMS drift time filtering for enhanced selectivity. For comparison with other studies, this database was used to annotate LC-IMS-CID-MS data from a NIST SRM 1950 extract. The same workflow was then utilized to assess plasma and bronchoalveolar lavage fluid (BALF) samples from patients with varying degrees of smoke inhalation injury to identify lipid-based patient prognostic and diagnostic markers.
Assuntos
Lipidômica , Lesão por Inalação de Fumaça , Cromatografia Líquida , Humanos , Espectrometria de Mobilidade Iônica , LipídeosRESUMO
Searching tandem mass spectra against a peptide database requires accurate knowledge of various experimental parameters, including machine settings and details of the sample preparation protocol. In some cases, such as in reanalysis of public data sets, this experimental metadata may be missing or inaccurate. We describe a method for automatically inferring the presence of various types of modifications, including stable-isotope and isobaric labeling and tandem mass tags as well as the enrichment of phosphorylated peptides, directly from a given set of mass spectra. We demonstrate the sensitivity and specificity of the proposed approach, and we provide open-source Python and C++ implementations in a new version of the software tool Param-Medic.
Assuntos
Marcação por Isótopo , Proteoma/análise , Proteômica/métodos , Software , Espectrometria de Massas em Tandem , Bases de Dados de Proteínas , Fosfopeptídeos/análiseRESUMO
Decoy database search with target-decoy competition (TDC) provides an intuitive, easy-to-implement method for estimating the false discovery rate (FDR) associated with spectrum identifications from shotgun proteomics data. However, the procedure can yield different results for a fixed data set analyzed with different decoy databases, and this decoy-induced variability is particularly problematic for smaller FDR thresholds, data sets, or databases. The average TDC (aTDC) protocol combats this problem by exploiting multiple independently shuffled decoy databases to provide an FDR estimate with reduced variability. We provide a tutorial introduction to aTDC, describe an improved variant of the protocol that offers increased statistical power, and discuss how to deploy aTDC in practice using the Crux software toolkit.
Assuntos
Bases de Dados de Proteínas/normas , Proteômica/métodos , Software , Conjuntos de Dados como Assunto , Humanos , Modelos Estatísticos , Reprodutibilidade dos TestesRESUMO
In shotgun proteomics analysis, user-specified parameters are critical to database search performance and therefore to the yield of confident peptide-spectrum matches (PSMs). Two of the most important parameters are related to the accuracy of the mass spectrometer. Precursor mass tolerance defines the peptide candidates considered for each spectrum. Fragment mass tolerance or bin size determines how close observed and theoretical fragments must be to be considered a match. For either of these two parameters, too wide a setting yields randomly high-scoring false PSMs, whereas too narrow a setting erroneously excludes true PSMs, in both cases, lowering the yield of peptides detected at a given false discovery rate. We describe a strategy for inferring optimal search parameters by assembling and analyzing pairs of spectra that are likely to have been generated by the same peptide ion to infer precursor and fragment mass error. This strategy does not rely on a database search, making it usable in a wide variety of settings. In our experiments on data from a variety of instruments including Orbitrap and Q-TOF acquisitions, this strategy yields more high-confidence PSMs than using settings based on instrument defaults or determined by experts. Param-Medic is open-source and cross-platform. It is available as a standalone tool ( http://noble.gs.washington.edu/proj/param-medic/ ) and has been integrated into the Crux proteomics toolkit ( http://crux.ms ), providing automatic parameter selection for the Comet and Tide search engines.
Assuntos
Bases de Dados de Proteínas , Peptídeos/isolamento & purificação , Proteômica , Espectrometria de Massas em Tandem/métodos , Algoritmos , Mapeamento de Peptídeos , Peptídeos/genética , Ferramenta de Busca , SoftwareRESUMO
Efficiently and accurately analyzing big protein tandem mass spectrometry data sets requires robust software that incorporates state-of-the-art computational, machine learning, and statistical methods. The Crux mass spectrometry analysis software toolkit ( http://cruxtoolkit.sourceforge.net ) is an open source project that aims to provide users with a cross-platform suite of analysis tools for interpreting protein mass spectrometry data.
Assuntos
Proteínas/química , Espectrometria de Massas em Tandem/métodos , Bases de Dados de Proteínas , InternetRESUMO
UNLABELLED: Skyline is a Windows client application for targeted proteomics method creation and quantitative data analysis. The Skyline document model contains extensive mass spectrometry data from targeted proteomics experiments performed using selected reaction monitoring, parallel reaction monitoring and data-independent and data-dependent acquisition methods. Researchers have developed software tools that perform statistical analysis of the experimental data contained within Skyline documents. The new external tools framework allows researchers to integrate their tools into Skyline without modifying the Skyline codebase. Installed tools provide point-and-click access to downstream statistical analysis of data processed in Skyline. The framework also specifies a uniform interface to format tools for installation into Skyline. Tool developers can now easily share their tools with proteomics researchers using Skyline. AVAILABILITY AND IMPLEMENTATION: Skyline is available as a single-click self-updating web installation at http://skyline.maccosslab.org. This Web site also provides access to installable external tools and documentation. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.