RESUMEN
Rice crops are often subject to multiple abiotic stresses simultaneously in both natural and cultivated environments, resulting in yield reductions beyond those expected from single stress. We report physiological changes after a 4 day exposure to combined drought, salt and extreme temperature treatments, following a 2 day salinity pre-treatment in two rice genotypes-Nipponbare (a paddy rice) and IAC1131 (an upland landrace). Stomata closed after two days of combined stresses, causing intercellular CO2 concentrations and assimilation rates to diminish rapidly. Abscisic acid (ABA) levels increased at least five-fold but did not differ significantly between the genotypes. Tandem Mass Tag isotopic labelling quantitative proteomics revealed 6215 reproducibly identified proteins in mature leaves across the two genotypes and three time points (0, 2 and 4 days of stress). Of these, 987 were differentially expressed due to stress (cf. control plants), including 41 proteins that changed significantly in abundance in all stressed plants. Heat shock proteins, late embryogenesis abundant proteins and photosynthesis-related proteins were consistently responsive to stress in both Nipponbare and IAC1131. Remarkably, even after 2 days of stress there were almost six times fewer proteins differentially expressed in IAC1131 than Nipponbare. This contrast in the translational response to multiple stresses is consistent with the known tolerance of IAC1131 to dryland conditions.
Asunto(s)
Oryza/fisiología , Estrés Fisiológico/fisiología , Ácido Abscísico/metabolismo , Gases/metabolismo , Regulación de la Expresión Génica de las Plantas , Genotipo , Oryza/genética , Fotosíntesis , Hojas de la Planta/metabolismo , Proteínas de Plantas/metabolismo , Proteoma/metabolismo , Proteómica , Estrés Fisiológico/genéticaRESUMEN
Credible detection and quantification of low abundance proteins from human blood plasma is a major challenge in precision medicine biomarker discovery when using mass spectrometry (MS). In this proof-of-concept study, we employed a mixture of selected recombinant proteins in DDA libraries to subsequently identify (not quantify) cancer-associated low abundance plasma proteins using SWATH/DIA. The exemplar DDA recombinant protein spectral library (rPSL) was derived from tryptic digestion of 36 recombinant human proteins that had been previously implicated as possible cancer biomarkers from both our own and other studies. The rPSL was then used to identify proteins from nondepleted colorectal cancer (CRC) EDTA plasmas by SWATH-MS. Most (32/36) of the proteins used in the rPSL were reliably identified from CRC plasma samples, including 8 proteins (i.e., BTC, CXCL10, IL1B, IL6, ITGB6, TGFα, TNF, TP53) not previously detected using high-stringency protein inference MS according to PeptideAtlas. The rPSL SWATH-MS protocol was compared to DDA-MS using MARS-depleted and postdigestion peptide fractionated plasmas (here referred to as a human plasma DDA library). Of the 32 proteins identified using rPSL SWATH, only 12 could be identified using DDA-MS. The 20 additional proteins exclusively identified using the rPSL SWATH approach were almost exclusively lower abundance (i.e., <10 ng/mL) proteins. To mitigate justified FDR concerns, and to replicate a more typical library creation approach, the DDA rPSL library was merged with a human plasma DDA library and SWATH identification repeated using such a merged library. The majority (33/36) of the low abundance plasma proteins added from the rPSL were still able to be identified using such a merged library when high-stringency HPP Guidelines v3.0 protein inference criteria were applied to our data set. The MS data set has been deposited to ProteomeXchange Consortium via the PRIDE partner repository (PXD022361).
Asunto(s)
Proteoma , Proteómica , Biomarcadores , Proteínas Sanguíneas , Bases de Datos de Proteínas , Humanos , Proteínas RecombinantesRESUMEN
We describe a useful workflow for characterizing proteomics experiments incorporating many conditions and abundance data using the popular weighted gene correlation network analysis (WGCNA) approach and functional annotation with the PloGO2 R package, the latter of which we have extended and made available to Bioconductor. The approach can use quantitative data from labeled or label-free experiments and was developed to handle multiple files stemming from data partition or multiple pairwise comparisons. The WGCNA approach can similarly produce a potentially large number of clusters of interest, which can also be functionally characterized using PloGO2. Enrichment analysis will identify clusters or subsets of proteins of interest, and the WGCNA network topology scores will produce a ranking of proteins within these clusters or subsets. This can naturally lead to prioritized proteins to be considered for further analysis or as candidates of interest for validation in the context of complex experiments. We demonstrate the use of the package on two published data sets using two different biological systems (plant and human plasma) and proteomics platforms (sequential window acquisition of all theoretical fragment-ion spectra (SWATH) and tandem mass tag (TMT)): an analysis of the effect of drought on rice over time generated using TMT and a pediatric plasma sample data set generated using SWATH. In both, the automated workflow recapitulates key insights or observations of the published papers and provides additional suggestions for further investigation. These findings indicate that the data set analysis using WGCNA combined with the updated PloGO2 package is a powerful method to gain biological insights from complex multifaceted proteomics experiments.
Asunto(s)
Proteínas , Proteómica , Niño , Correlación de Datos , Humanos , Plasma , Programas Informáticos , Flujo de TrabajoRESUMEN
Summary: Large-scale peptide mass spectrometry (MS)/MS reference libraries are essential for the comprehensive analysis of data-independent acquisition (DIA) MS datasets, providing a comprehensive set of spectra for identification and quantification of proteins. We have developed a novel web-based R-package (iSwathX) for combining reference libraries that is compatible with different DIA analysis software. This open-source web GUI automates the process of normalization and combination of spectral libraries and provides a user-friendly method for performing library format conversions, analysis and visualizations, with no need for programing familiarity. Availability and implementation: iSwathX is freely accessible at https://biolinfo.shinyapps.io/iSwathX with the R-package and Shiny source code available from GitHub (https://github.com/znoor/iSwathX). Supplementary information: Supplementary data are available at Bioinformatics online.
Asunto(s)
Biblioteca de Péptidos , Programas Informáticos , Biología Computacional , Internet , Espectrometría de Masas en TándemRESUMEN
BACKGROUND: One of the most significant challenges in colorectal cancer (CRC) management is the use of compliant early stage population-based diagnostic tests as adjuncts to confirmatory colonoscopy. Despite the near curative nature of early clinical stage surgical resection, mortality remains unacceptably high-as the majority of patients diagnosed by faecal haemoglobin followed by colonoscopy occur at latter stages. Additionally, current population-based screens reliant on fecal occult blood test (FOBT) have low compliance (~ 40%) and tests suffer low sensitivities. Therefore, blood-based diagnostic tests offer survival benefits from their higher compliance (≥ 97%), if they can at least match the sensitivity and specificity of FOBTs. However, discovery of low abundance plasma biomarkers is difficult due to occupancy of a high percentage of proteomic discovery space by many high abundance plasma proteins (e.g., human serum albumin). METHODS: A combination of high abundance protein ultradepletion (e.g., MARS-14 and an in-house IgY depletion columns) strategies, extensive peptide fractionation methods (SCX, SAX, High pH and SEC) and SWATH-MS were utilized to uncover protein biomarkers from a cohort of 100 plasma samples (i.e., pools of 20 healthy and 20 stages I-IV CRC plasmas). The differentially expressed proteins were analyzed using ANOVA and pairwise t-tests (p < 0.05; fold-change > 1.5), and further examined with a neural network classification method using in silico augmented 5000 patient datasets. RESULTS: Ultradepletion combined with peptide fractionation allowed for the identification of a total of 513 plasma proteins, 8 of which had not been previously reported in human plasma (based on PeptideAtlas database). SWATH-MS analysis revealed 37 protein biomarker candidates that exhibited differential expression across CRC stages compared to healthy controls. Of those, 7 candidates (CST3, GPX3, CFD, MRC1, COMP, PON1 and ADAMDEC1) were validated using Western blotting and/or ELISA. The neural network classification narrowed down candidate biomarkers to 5 proteins (SAA2, APCS, APOA4, F2 and AMBP) that had maintained accuracy which could discern early (I/II) from late (III/IV) stage CRC. CONCLUSION: MS-based proteomics in combination with ultradepletion strategies have an immense potential of identifying diagnostic protein biosignature.
RESUMEN
The use of data-independent acquisition methods such as SWATH for mass spectrometry based proteomics is usually performed with peptide MS/MS assay libraries which enable identification and quantitation of peptide peak areas. Reference assay libraries can be generated locally through information dependent acquisition, or obtained from community data repositories for commonly studied organisms. However, there have been no studies performed to systematically evaluate how locally generated or repository-based assay libraries affect SWATH performance for proteomic studies. To undertake this analysis, we developed a software workflow, SwathXtend, which generates extended peptide assay libraries by integration with a local seed library and delivers statistical analysis of SWATH-quantitative comparisons. We designed test samples using peptides from a yeast extract spiked into peptides from human K562 cell lysates at three different ratios to simulate protein abundance change comparisons. SWATH-MS performance was assessed using local and external assay libraries of varying complexities and proteome compositions. These experiments demonstrated that local seed libraries integrated with external assay libraries achieve better performance than local assay libraries alone, in terms of the number of identified peptides and proteins and the specificity to detect differentially abundant proteins. Our findings show that the performance of extended assay libraries is influenced by the MS/MS feature similarity of the seed and external libraries, while statistical analysis using multiple testing corrections increases the statistical rigor needed when searching against large extended assay libraries.
Asunto(s)
Proteómica/métodos , Espectrometría de Masas en Tándem/métodos , Cromatografía Liquida , Humanos , Células K562 , Biblioteca de Péptidos , Programas InformáticosRESUMEN
Post-translational modifications (PTMs) can occur soon after translation or at any stage in the lifecycle of a given protein, and they may help regulate protein folding, stability, cellular localisation, activity, or the interactions proteins have with other proteins or biomolecular species. PTMs are crucial to our functional understanding of biology, and new quantitative mass spectrometry (MS) and bioinformatics workflows are maturing both in labelled multiplexed and label-free techniques, offering increasing coverage and new opportunities to study human health and disease. Techniques such as Data Independent Acquisition (DIA) are emerging as promising approaches due to their re-mining capability. Many bioinformatics tools have been developed to support the analysis of PTMs by mass spectrometry, from prediction and identifying PTM site assignment, open searches enabling better mining of unassigned mass spectra-many of which likely harbour PTMs-through to understanding PTM associations and interactions. The remaining challenge lies in extracting functional information from clinically relevant PTM studies. This review focuses on canvassing the options and progress of PTM analysis for large quantitative studies, from choosing the platform, through to data analysis, with an emphasis on clinically relevant samples such as plasma and other body fluids, and well-established tools and options for data interpretation.
Asunto(s)
Biología Computacional , Procesamiento Proteico-Postraduccional , Proteínas/metabolismo , Algoritmos , Líquidos Corporales/metabolismo , Biología Computacional/métodos , Bases de Datos Genéticas , Humanos , Espectrometría de Masas , Péptidos/metabolismo , Fosforilación , Proteómica/métodos , Reproducibilidad de los Resultados , Programas Informáticos , Flujo de TrabajoRESUMEN
Protein quantification using data-independent acquisition methods such as SWATH-MS most commonly relies on spectral matching to a reference MS/MS assay library. To enable deep proteome coverage and efficient use of existing data, in silico approaches have been described to use archived or publicly available large reference spectral libraries for spectral matching. Since implicit in the use of larger libraries is the increasing likelihood of false-discoveries, new workflows are needed to ensure high confidence in protein matching under these conditions. We present a workflow which introduces a range of filters and thresholds aimed at increasing confidence that the resulting proteins are reliably detected and their quantitation is consistent and reproducible. We demonstrated the workflow using extended libraries with SWATH data from human plasma samples and yeast-spiked human K562 cell lysate digest.
Asunto(s)
Proteínas Sanguíneas/metabolismo , Biblioteca de Péptidos , Proteoma/análisis , Proteómica/métodos , Espectrometría de Masas en Tándem/normas , Adolescente , Adulto , Niño , Preescolar , Cromatografía Liquida/normas , Humanos , Lactante , Recién Nacido , Estándares de Referencia , Programas Informáticos , Flujo de Trabajo , Adulto JovenRESUMEN
Multiple testing corrections are a useful tool for restricting the FDR, but can be blunt in the context of low power, as we demonstrate by a series of simple simulations. Unfortunately, in proteomics experiments low power can be common, driven by proteomics-specific issues like small effects due to ratio compression, and few replicates due to reagent high cost, instrument time availability and other issues; in such situations, most multiple testing corrections methods, if used with conventional thresholds, will fail to detect any true positives even when many exist. In this low power, medium scale situation, other methods such as effect size considerations or peptide-level calculations may be a more effective option, even if they do not offer the same theoretical guarantee of a low FDR. Thus, we aim to highlight in this article that proteomics presents some specific challenges to the standard multiple testing corrections methods, which should be employed as a useful tool but not be regarded as a required rubber stamp.
Asunto(s)
Biología Computacional/métodos , Proteómica/métodos , Algoritmos , Proteínas/análisis , Espectrometría de Masas en TándemRESUMEN
A standardized procedure for label-free nano-LC-SRM analysis of 32 high-medium abundance proteins from nondepleted human plasma was established and SRM data were acquired on 45 separate days for a control sample that was independently prepared on 39 distinct dates over an 18-month period (542 days). This case study enabled us to assess quantitative variance associated with nano-LC-SRM plasma analysis, mimicking experimental conditions that would be experienced with clinical trial biomarker studies. We assessed sample preparation variability attributed to different technicians and sample storage stability. Instrument performance varied over the 18-month period requiring ion path cleaning, so we assessed the impact of declining performance on specific peptide ion sensitivity and evaluated how various data normalization strategies could compensate for these changes. Our analysis demonstrated that while sample preparation was the main contributor for data variances when MS data were acquired within days, variability in SRM sensitivity was a far greater source of variance when data were acquired over a long period. The overall median multiplexed assay CV was 13% over the 18-month period. This case study is illustrative of large-scale plasma biomarker studies using nano-LC-SRM over extended periods and highlights aspects of bioanalysis that require careful attention to ensure reliable quantitation.
Asunto(s)
Biomarcadores/sangre , Cromatografía Liquida/métodos , Espectrometría de Masas/métodos , Proteoma/análisis , HumanosRESUMEN
In this protocol we describe our workflow for analyzing complex, multi-condition quantitative proteomic experiments, with the aim to extract biological insights. The tool we use is an R package, PloGO2, contributed to Bioconductor, which we can optionally precede by running correlation network analysis with WGCNA. We describe the data required and the steps we take, including detailed code examples and outputs explanation. The package was designed to generate gene ontology or pathway summaries for many data subsets at the same time, visualize protein abundance summaries for each biological category examined, help determine enriched protein subsets by comparing them all to a reference set, and suggest key highly correlated hub proteins, if the optional network analysis is employed.
Asunto(s)
Proteínas , Proteómica , Proteómica/métodos , Ontología de Genes , Flujo de TrabajoRESUMEN
Extracellular deposits of the amyloid-beta peptide (Aß) are known as the main pathological hallmark of Alzheimer's disease. In Alzheimer's disease, neurons are injured and die throughout the brain, a process in which Aß neurotoxicity is considered to play an important role. However, the molecular mechanisms underlying Aß toxicity that lead to neurodegeneration are not clearly established. Here we have elucidated the molecular pathways and networks which are impacted by Aß in neurons using SH-SY5Y human neuroblastoma cells as a model. These cells were treated with Aß1-42 peptides to study changes in biochemical networks using tandem mass tag labeled quantitative proteomic technique followed by computational analysis of the data. The molecular impacts of Aß on cells were evident in a time- and dose-dependent manner, albeit the duration of treatment induced greater differential changes in cellular proteome compared to the effects of concentration. Aß induced early changes in proteins associated with lysosomes, collagen chain trimerization and extracellular matrix receptor interaction, complement and coagulation cascade, oxidative stress induced senescence, ribosome biogenesis, regulation of insulin-like growth factor transport and uptake by insulin-like growth factor-binding protein. These novel findings provide molecular insights on the effects of Aß on neurons, with implications for better understanding the impacts of Aß on early neurodegeneration in Alzheimer's disease pathology.
RESUMEN
Amyloid ß (Aß) accumulation and its aggregation is characteristic molecular feature of the development of Alzheimer's disease (AD). More recently, Aß has been suggested to be associated with retinal pathology associated with AD, glaucoma and drusen deposits in age related macular degeneration (AMD). In this study, we investigated the proteins and biochemical networks that are affected by Aß in the 661 W photoreceptor cells in culture. Time and dose dependent effects of Aß on the photoreceptor cells were determined utilizing tandem mass tag (TMT) labeling-based quantitative mass-spectrometric approach. Bioinformatic analysis of the data revealed concentration and time dependent effects of the Aß peptide stimulation on various key biochemical pathways that might be involved in mediating the toxicity effects of the peptide. We identified increased Tau phosphorylation, GSK3ß dysregulation and reduced cell viability in cells treated with Aß in a dose and time dependent manner. This study has delineated for the first-time molecular networks in photoreceptor cells that are impacted early upon Aß treatment and contrasted the findings with a longer-term treatment effect. Proteins associated with ribosomal machinery homeostasis, mitochondrial function and cytoskeletal organization were affected in the initial stages of Aß exposure, which may provide key insights into AD effects on the photoreceptors and specific molecular changes induced by Aß peptide.
RESUMEN
Increased amyloid ß (Aß) aggregation is a hallmark feature of Alzheimer's disease (AD) pathology. The APP/PS1 mouse model of AD exhibits accumulation of Aß in the retina and demonstrates reduced retinal function and other degenerative changes. The overall molecular effects of AD pathology on the retina remain undetermined. Using a proteomics approach, this study assessed the molecular effects of Aß accumulation and progression of AD pathology on the retina. Retinal tissues from younger (2.5 months) and older 8-month APP/PS1 mice were analysed for protein expression changes. A multiplexed proteomics approach using chemical isobaric tandem mass tags was applied followed by functional and protein-protein interaction analyses using Ingenuity pathway (IPA) and STRING computational tools. We identified approximately 2000 proteins each in the younger (upregulated 50; downregulated 36) and older set of APP/PS1 (upregulated 85; downregulated 79) mice retinas. Amyloid precursor protein (APP) was consistently upregulated two to threefold in both younger and older retinas (p < 0.0001). Mass spectrometry data further revealed that older APP/PS1 mice retinas had elevated levels of proteolytic enzymes cathepsin D, presenilin 2 and nicastrin that are associated with APP processing. Increased levels of proteasomal proteins Psma5, Psmd3 and Psmb2 were also observed in the older AD retinas. In contrast to the younger animals, significant downregulation of protein synthesis and elongation associated proteins such as Eef1a1, Rpl35a, Mrpl2 and Eef1e1 (p < 0.04) was identified in the older mice retinas. This study reports for the first time that not only old but also young APP/PS1 animals demonstrate increased amyloid protein levels in their retinas. Quantitative proteomics reveals new molecular insights which may represent a cellular response to clear amyloid build-up. Further, downregulation of ribosomal proteins involved in protein biosynthesis was observed which might be considered a toxicity effect.
Asunto(s)
Enfermedad de Alzheimer/metabolismo , Enfermedad de Alzheimer/patología , Biosíntesis de Proteínas , Proteolisis , Retina/metabolismo , Retina/patología , Regulación hacia Arriba , Envejecimiento/patología , Precursor de Proteína beta-Amiloide/metabolismo , Animales , Modelos Animales de Enfermedad , Regulación hacia Abajo , Humanos , Ratones Endogámicos C57BL , Ratones Transgénicos , Presenilina-1/genética , Complejo de la Endopetidasa Proteasomal/metabolismo , Mapas de Interacción de Proteínas , Proteómica , Cadena B de alfa-Cristalina/metabolismoRESUMEN
In this chapter we describe the workflow we use for labeled quantitative proteomics analysis using tandem mass tags (TMT) starting with the sample preparation and ending with the multivariate analysis of the resulting data. We detail the step-by-step process from sample processing, labeling, fractionation, and data processing using Proteome Discoverer through to data analysis and interpretation in the context of a multi-run experiment. The final analysis and data interpretation rely on an R package we call TMTPrepPro, which are deployed on a local GenePattern server, and used for generating various outputs which are also outlined herein.