Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
1.
Cell ; 184(19): 5031-5052.e26, 2021 09 16.
Article in English | MEDLINE | ID: mdl-34534465

ABSTRACT

Pancreatic ductal adenocarcinoma (PDAC) is a highly aggressive cancer with poor patient survival. Toward understanding the underlying molecular alterations that drive PDAC oncogenesis, we conducted comprehensive proteogenomic analysis of 140 pancreatic cancers, 67 normal adjacent tissues, and 9 normal pancreatic ductal tissues. Proteomic, phosphoproteomic, and glycoproteomic analyses were used to characterize proteins and their modifications. In addition, whole-genome sequencing, whole-exome sequencing, methylation, RNA sequencing (RNA-seq), and microRNA sequencing (miRNA-seq) were performed on the same tissues to facilitate an integrated proteogenomic analysis and determine the impact of genomic alterations on protein expression, signaling pathways, and post-translational modifications. To ensure robust downstream analyses, tumor neoplastic cellularity was assessed via multiple orthogonal strategies using molecular features and verified via pathological estimation of tumor cellularity based on histological review. This integrated proteogenomic characterization of PDAC will serve as a valuable resource for the community, paving the way for early detection and identification of novel therapeutic targets.


Subject(s)
Adenocarcinoma/genetics , Carcinoma, Pancreatic Ductal/genetics , Pancreatic Neoplasms/genetics , Proteogenomics , Adenocarcinoma/diagnosis , Adult , Aged , Aged, 80 and over , Algorithms , Carcinoma, Pancreatic Ductal/diagnosis , Cohort Studies , Endothelial Cells/metabolism , Epigenesis, Genetic , Female , Gene Dosage , Genome, Human , Glycolysis , Glycoproteins/biosynthesis , Humans , Male , Middle Aged , Molecular Targeted Therapy , Pancreatic Neoplasms/diagnosis , Phenotype , Phosphoproteins/metabolism , Phosphorylation , Prognosis , Protein Kinases/metabolism , Proteome/metabolism , Substrate Specificity , Transcriptome/genetics
2.
Mol Cell Proteomics ; 23(1): 100687, 2024 Jan.
Article in English | MEDLINE | ID: mdl-38029961

ABSTRACT

Pancreatic ductal adenocarcinoma (PDAC) is one of the most lethal cancer types, partly because it is frequently identified at an advanced stage, when surgery is no longer feasible. Therefore, early detection using minimally invasive methods such as blood tests may improve outcomes. However, studies to discover molecular signatures for the early detection of PDAC using blood tests have only been marginally successful. In the current study, a quantitative glycoproteomic approach via data-independent acquisition mass spectrometry was utilized to detect glycoproteins in 29 patient-matched PDAC tissues and sera. A total of 892 N-linked glycopeptides originating from 141 glycoproteins had PDAC-associated changes beyond normal variation. We further evaluated the specificity of these serum-detectable glycoproteins by comparing their abundance in 53 independent PDAC patient sera and 65 cancer-free controls. The PDAC tissue-associated glycoproteins we have identified represent an inventory of serum-detectable PDAC-associated glycoproteins as candidate biomarkers that can be potentially used for the detection of PDAC using blood tests.


Subject(s)
Carcinoma, Pancreatic Ductal , Pancreatic Neoplasms , Humans , Biomarkers, Tumor/metabolism , Pancreatic Neoplasms/metabolism , Carcinoma, Pancreatic Ductal/metabolism , Glycoproteins , Mass Spectrometry
3.
Anal Chem ; 96(25): 10145-10151, 2024 06 25.
Article in English | MEDLINE | ID: mdl-38869158

ABSTRACT

Rapid development and wide adoption of mass spectrometry-based glycoproteomic technologies have empowered scientists to study proteins and protein glycosylation in complex samples on a large scale. This progress has also created unprecedented challenges for individual laboratories to store, manage, and analyze proteomic and glycoproteomic data, both in the cost for proprietary software and high-performance computing and in the long processing time that discourages on-the-fly changes of data processing settings required in explorative and discovery analysis. We developed an open-source, cloud computing-based pipeline, MS-PyCloud, with graphical user interface (GUI), for proteomic and glycoproteomic data analysis. The major components of this pipeline include data file integrity validation, MS/MS database search for spectral assignments to peptide sequences, false discovery rate estimation, protein inference, quantitation of global protein levels, and specific glycan-modified glycopeptides as well as other modification-specific peptides such as phosphorylation, acetylation, and ubiquitination. To ensure the transparency and reproducibility of data analysis, MS-PyCloud includes open-source software tools with comprehensive testing and versioning for spectrum assignments. Leveraging public cloud computing infrastructure via Amazon Web Services (AWS), MS-PyCloud scales seamlessly based on analysis demand to achieve fast and efficient performance. Application of the pipeline to the analysis of large-scale LC-MS/MS data sets demonstrated the effectiveness and high performance of MS-PyCloud. The software can be downloaded at https://github.com/huizhanglab-jhu/ms-pycloud.


Subject(s)
Proteomics , Proteomics/methods , Software , Tandem Mass Spectrometry/methods , Cloud Computing , Glycoproteins/analysis , Humans
4.
Clin Proteomics ; 21(1): 60, 2024 Oct 24.
Article in English | MEDLINE | ID: mdl-39443867

ABSTRACT

BACKGROUND: Spatial proteomics seeks to understand the spatial organization of proteins in tissues or at different subcellular localization in their native environment. However, capturing the spatial organization of proteins is challenging. Here, we present an innovative approach termed Spatial Proteomics through On-site Tissue-protein-labeling (SPOT), which combines the direct labeling of tissue proteins in situ on a slide and quantitative mass spectrometry for the profiling of spatially-resolved proteomics. MATERIALS AND METHODS: Efficacy of direct TMT labeling was investigated using seven types of sagittal mouse brain slides, including frozen tissues without staining, formalin-fixed paraffin-embedded (FFPE) tissues without staining, deparaffinized FFPE tissues, deparaffinized and decrosslinked FFPE tissues, and tissues with hematoxylin & eosin (H&E) staining, hematoxylin (H) staining, eosin (E) staining. The ability of SPOT to profile proteomes at a spatial resolution was further evaluated on a horizontal mouse brain slide with direct TMT labeling at eight different mouse brain regions. Finally, SPOT was applied to human prostate cancer tissues as well as a tissue microarray (TMA), where TMT tags were meticulously applied to confined regions based on the pathological annotations. After on-site direct tissue-protein-labeling, tissues were scraped off the slides and subject to standard TMT-based quantitative proteomics analysis. RESULTS: Tissue proteins on different types of mouse brain slides could be directly labeled with TMT tags. Moreover, the versatility of our direct-labeling approach extended to discerning specific mouse brain regions based on quantitative outcomes. The SPOT was further applied on both frozen tissues on slides and FFPE tissues on TMAs from prostate cancer tissues, where a distinct proteomic profile was observed among the regions with different Gleason scores. CONCLUSIONS: SPOT is a robust and versatile technique that allows comprehensive profiling of spatially-resolved proteomics across diverse types of tissue slides to advance our understanding of intricate molecular landscapes.

5.
Anal Chem ; 92(2): 1680-1686, 2020 01 21.
Article in English | MEDLINE | ID: mdl-31859482

ABSTRACT

Aberrant glycosylation has been shown to associate with disease progression, and with glycoproteins representing the major protein component of biological fluids this makes them attractive targets for disease monitoring. Leveraging glycoproteomic analysis via mass spectrometry (MS) could provide the insight into the altered glycosylation patterns that relate to disease progression. However, investigation of large sample cohorts requires rapid, efficient, and highly reproducible sample preparation. To address the limitation, we developed a high-throughput method for characterizing glycans, glycosites, and intact glycopeptides (IGPs) derived from N-linked glycoproteins. We combined disparate peptide enrichment strategies (i.e., hydrophilic and hydrophobic) and a liquid handling platform allowing for a high throughput and rapid enrichment of IGP in a 96-well plate format. The C18/MAX-Tip workflow reduced sample processing time and facilitated the selective enrichment of IGPs from complex samples. Furthermore, our approach enabled the analysis of deglycosylated peptides and glycans from enriched IGPs following PNGase F digest. Following development and optimization of the C18/MAX-Tip methodology using the standard glycoprotein, fetuin, we investigated normal urine samples to obtain N-linked glycoprotein information. Together, our method enables a high-throughput enrichment of glycan, glycosites, and IGPs from biological samples.


Subject(s)
Glycopeptides/urine , Glycoproteins/chemistry , Polysaccharides/urine , Automation , Glycosylation , Humans , Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization , Tandem Mass Spectrometry
6.
Nucleic Acids Res ; 44(W1): W575-80, 2016 Jul 08.
Article in English | MEDLINE | ID: mdl-27084943

ABSTRACT

MAGIC-web is the first web server, to the best of our knowledge, that performs both untargeted and targeted analyses of mass spectrometry-based glycoproteomics data for site-specific N-linked glycoprotein identification. The first two modules, MAGIC and MAGIC+, are designed for untargeted and targeted analysis, respectively. MAGIC is implemented with our previously proposed novel Y1-ion pattern matching method, which adequately detects Y1- and Y0-ion without prior information of proteins and glycans, and then generates in silico MS(2) spectra that serve as input to a database search engine (e.g. Mascot) to search against a large-scale protein sequence database. On top of that, the newly implemented MAGIC+ allows users to determine glycopeptide sequences using their own protein sequence file. The third module, Reports Integrator, provides the service of combining protein identification results from Mascot and glycan-related information from MAGIC-web to generate a complete site-specific protein-glycan summary report. The last module, Glycan Search, is designed for the users who are interested in finding possible glycan structures with specific numbers and types of monosaccharides. The results from MAGIC, MAGIC+ and Reports Integrator can be downloaded via provided links whereas the annotated spectra and glycan structures can be visualized in the browser. MAGIC-web is accessible from http://ms.iis.sinica.edu.tw/MAGIC-web/index.html.


Subject(s)
Glycoproteins/analysis , Glycoproteins/chemistry , Internet , Polysaccharides/analysis , Polysaccharides/chemistry , Software , Computer Simulation , Databases, Protein , Glycopeptides/analysis , Glycopeptides/chemistry , Humans , Mass Spectrometry , Proteomics , Search Engine , User-Computer Interface , Web Browser
7.
Anal Chem ; 87(4): 2143-51, 2015 Feb 17.
Article in English | MEDLINE | ID: mdl-25543920

ABSTRACT

Metabolite identification remains a bottleneck in mass spectrometry (MS)-based metabolomics. Currently, this process relies heavily on tandem mass spectrometry (MS/MS) spectra generated separately for peaks of interest identified from previous MS runs. Such a delayed and labor-intensive procedure creates a barrier to automation. Further, information embedded in MS data has not been used to its full extent for metabolite identification. Multimers, adducts, multiply charged ions, and fragments of given metabolites occupy a substantial proportion (40-80%) of the peaks of a quantitation result. However, extensive information on these derivatives, especially fragments, may facilitate metabolite identification. We propose a procedure with automation capability to group and annotate peaks associated with the same metabolite in the quantitation results of opposite modes and to integrate this information for metabolite identification. In addition to the conventional mass and isotope ratio matches, we would match annotated fragments with low-energy MS/MS spectra in public databases. For identification of metabolites without accessible MS/MS spectra, we have developed characteristic fragment and common substructure matches. The accuracy and effectiveness of the procedure were evaluated using one public and two in-house liquid chromatography-mass spectrometry (LC-MS) data sets. The procedure accurately identified 89% of 28 standard metabolites with derivative ions in the data sets. With respect to effectiveness, the procedure confidently identified the correct chemical formula of at least 42% of metabolites with derivative ions via MS/MS spectrum, characteristic fragment, and common substructure matches. The confidence level was determined according to the fulfilled identification criteria of various matches and relative retention time.


Subject(s)
Metabolomics/methods , Tandem Mass Spectrometry/methods , Animals , Chromatography, Liquid/methods , Diabetes Mellitus, Experimental/metabolism , Diet , Ions/analysis , Ions/metabolism , Metabolome , Mice , Rats
8.
Anal Chem ; 87(4): 2466-73, 2015 Feb 17.
Article in English | MEDLINE | ID: mdl-25629585

ABSTRACT

Glycosylation is a highly complex modification influencing the functions and activities of proteins. Interpretation of intact glycopeptide spectra is crucial but challenging. In this paper, we present a mass spectrometry-based automated glycopeptide identification platform (MAGIC) to identify peptide sequences and glycan compositions directly from intact N-linked glycopeptide collision-induced-dissociation spectra. The identification of the Y1 (peptideY0 + GlcNAc) ion is critical for the correct analysis of unknown glycoproteins, especially without prior knowledge of the proteins and glycans present in the sample. To ensure accurate Y1-ion assignment, we propose a novel algorithm called Trident that detects a triplet pattern corresponding to [Y0, Y1, Y2] or [Y0-NH3, Y0, Y1] from the fragmentation of the common trimannosyl core of N-linked glycopeptides. To facilitate the subsequent peptide sequence identification by common database search engines, MAGIC generates in silico spectra by overwriting the original precursor with the naked peptide m/z and removing all of the glycan-related ions. Finally, MAGIC computes the glycan compositions and ranks them. For the model glycoprotein horseradish peroxidase (HRP) and a 5-glycoprotein mixture, a 2- to 31-fold increase in the relative intensities of the peptide fragments was achieved, which led to the identification of 7 tryptic glycopeptides from HRP and 16 glycopeptides from the mixture via Mascot. In the HeLa cell proteome data set, MAGIC processed over a thousand MS(2) spectra in 3 min on a PC and reported 36 glycopeptides from 26 glycoproteins. Finally, a remarkable false discovery rate of 0 was achieved on the N-glycosylation-free Escherichia coli data set. MAGIC is available at http://ms.iis.sinica.edu.tw/COmics/Software_MAGIC.html .


Subject(s)
Algorithms , Computational Biology , Glycopeptides/analysis , Software , Automation , Databases, Factual , Escherichia coli/chemistry , Glycopeptides/chemistry , HeLa Cells , Humans
9.
bioRxiv ; 2024 Jul 07.
Article in English | MEDLINE | ID: mdl-39005476

ABSTRACT

In order to advance our understanding of precancers in the pancreas, 69 pancreatic intraductal papillary neoplasms (IPNs), including 64 intraductal papillary mucinous neoplasms (IPMNs) and 5 intraductal oncocytic papillary neoplasms (IOPNs), 32 pancreatic cyst fluid samples, 104 invasive pancreatic ductal adenocarcinomas (PDACs), 43 normal adjacent tissues (NATs), and 76 macro-dissected normal pancreatic ducts (NDs) were analyzed by mass spectrometry. A total of 10,246 proteins and 22,284 glycopeptides were identified in all tissue samples, and 756 proteins with more than 1.5-fold increase in abundance in IPMNs relative to NDs were identified, 45% of which were also identified in cyst fluids. The over-expression of selected proteins was validated by immunolabeling. Proteins and glycoproteins overexpressed in IPMNs included those involved in glycan biosynthesis and the immune system. In addition, multiomics clustering identified two subtypes of IPMNs. This study provides a foundation for understanding tumor progression and targets for earlier detection and therapies. Significance: This multilevel characterization of intraductal papillary neoplasms of the pancreas provides a foundation for understanding the changes in protein and glycoprotein expression during the progression from normal duct to intraductal papillary neoplasm, and to invasive pancreatic carcinoma, providing a foundation for informed approaches to earlier detection and treatment.

10.
Cell Rep ; 42(5): 112409, 2023 05 30.
Article in English | MEDLINE | ID: mdl-37074911

ABSTRACT

Clear cell renal cell carcinoma (ccRCC), a common form of RCC, is responsible for the high mortality rate of kidney cancer. Dysregulations of glycoproteins have been shown to associate with ccRCC. However, the molecular mechanism has not been well characterized. Here, a comprehensive glycoproteomic analysis is conducted using 103 tumors and 80 paired normal adjacent tissues. Altered glycosylation enzymes and corresponding protein glycosylation are observed, while two of the major ccRCC mutations, BAP1 and PBRM1, show distinct glycosylation profiles. Additionally, inter-tumor heterogeneity and cross-correlation between glycosylation and phosphorylation are observed. The relation of glycoproteomic features to genomic, transcriptomic, proteomic, and phosphoproteomic changes shows the role of glycosylation in ccRCC development with potential for therapeutic interventions. This study reports a large-scale tandem mass tag (TMT)-based quantitative glycoproteomic analysis of ccRCC that can serve as a valuable resource for the community.


Subject(s)
Carcinoma, Renal Cell , Carcinoma , Kidney Neoplasms , Humans , Carcinoma, Renal Cell/metabolism , Proteomics , Kidney Neoplasms/metabolism , Genomics , Phosphorylation
11.
Nat Commun ; 13(1): 3910, 2022 07 07.
Article in English | MEDLINE | ID: mdl-35798744

ABSTRACT

Core fucosylation of N-linked glycoproteins has been linked to the functions of glycoproteins in physiological and pathological processes. However, quantitative characterization of core fucosylation remains challenging due to the complexity and heterogeneity of N-linked glycosylation. Here we report a mass spectrometry-based method that employs sequential treatment of intact glycopeptides with enzymes (STAGE) to analyze site-specific core fucosylation of glycoproteins. The STAGE method utilizes Endo F3 followed by PNGase F treatment to generate mass signatures for glycosites that are formerly modified by core fucosylated N-linked glycans. We benchmark the STAGE method and use it to characterize site specific core fucosylation of glycoproteins from human hepatocellular carcinoma and pancreatic ductal adenocarcinoma, resulting in the identification of 1130 and 782 core fucosylated glycosites, respectively. These results indicate that our STAGE method enables quantitative characterization of core fucosylation events from complex protein mixtures, which may benefit our understanding of core fucosylation functions in various diseases.


Subject(s)
Glycopeptides , Liver Neoplasms , Fucose/metabolism , Glycopeptides/chemistry , Glycoproteins/metabolism , Glycosylation , Humans , Mass Spectrometry/methods
12.
Theranostics ; 10(26): 11892-11907, 2020.
Article in English | MEDLINE | ID: mdl-33204318

ABSTRACT

Background: There is an urgent need for the detection of aggressive prostate cancer. Glycoproteins play essential roles in cancer development, while urine is a noninvasive and easily obtainable biological fluid that contains secretory glycoproteins from the urogenital system. Therefore, here we aimed to identify urinary glycoproteins that are capable of differentiating aggressive from non-aggressive prostate cancer. Methods: Quantitative mass spectrometry data of glycopeptides from a discovery cohort comprised of 74 aggressive (Gleason score ≥8) and 68 non-aggressive (Gleason score = 6) prostate cancer urine specimens were acquired via a data independent acquisition approach. The glycopeptides showing distinct expression profiles in aggressive relative to non-aggressive prostate cancer were further evaluated for their performance in distinguishing the two groups either individually or in combination with others using repeated 5-fold cross validation with logistic regression to build predictive models. Predictive models showing good performance from the discovery cohort were further evaluated using a validation cohort. Results: Among the 20 candidate glycoproteins, urinary ACPP outperformed the other candidates. Urinary ACPP can also serve as an adjunct to serum PSA to further improve the discrimination power for aggressive prostate cancer (AUC= 0.82, 95% confidence interval 0.75 to 0.89). A three-signature panel including urinary ACPP, urinary CLU, and serum PSA displayed the ability to distinguish aggressive prostate cancer from non-aggressive prostate cancer with an AUC of 0.86 (95% confidence interval 0.8 to 0.92). Another three-signature panel containing urinary ACPP, urinary LOX, and serum PSA also demonstrated its ability in recognizing aggressive prostate cancer (AUC=0.82, 95% confidence interval 0.75 to 0.9). Moreover, consistent performance was observed from each panel when evaluated using a validation cohort. Conclusion: We have identified glycopeptides of urinary glycoproteins associated with aggressive prostate cancer using a quantitative mass spectrometry-based glycoproteomic approach and demonstrated their potential to serve as noninvasive urinary glycoprotein biomarkers worthy of further validation by a multi-center study.


Subject(s)
Biomarkers, Tumor/urine , Glycoproteins/urine , Prostatic Neoplasms/diagnosis , Adult , Aged , Biomarkers, Tumor/blood , Cohort Studies , Digital Rectal Examination , Feasibility Studies , Humans , Kallikreins/blood , Male , Middle Aged , Neoplasm Grading , Prostate-Specific Antigen/blood , Prostatic Neoplasms/blood , Prostatic Neoplasms/urine , ROC Curve
13.
J Hematol Oncol ; 13(1): 170, 2020 12 07.
Article in English | MEDLINE | ID: mdl-33287876

ABSTRACT

BACKGROUND: Proteomic characterization of cancers is essential for a comprehensive understanding of key molecular aberrations. However, proteomic profiling of a large cohort of cancer tissues is often limited by the conventional approaches. METHODS: We present a proteomic landscape of 16 major types of human cancer, based on the analysis of 126 treatment-naïve primary tumor tissues, 94 tumor-matched normal adjacent tissues, and 12 normal tissues, using mass spectrometry-based data-independent acquisition approach. RESULTS: In our study, a total of 8527 proteins were mapped to brain, head and neck, breast, lung (both small cell and non-small cell lung cancers), esophagus, stomach, pancreas, liver, colon, kidney, bladder, prostate, uterus and ovary cancers, including 2458 tissue-enriched proteins. Our DIA-based proteomic approach has characterized major human cancers and identified universally expressed proteins as well as tissue-type-specific and cancer-type-specific proteins. In addition, 1139 therapeutic targetable proteins and 21 cancer/testis (CT) antigens were observed. CONCLUSIONS: Our discoveries not only advance our understanding of human cancers, but also have implications for the design of future large-scale cancer proteomic studies to assist the development of diagnostic and/or therapeutic targets in multiple cancers.


Subject(s)
Neoplasms/pathology , Proteins/analysis , Drug Discovery , Humans , Molecular Targeted Therapy , Neoplasms/diagnosis , Neoplasms/drug therapy , Neoplasms/metabolism , Proteins/metabolism , Proteome/analysis , Proteome/metabolism , Proteomics
14.
PLoS One ; 11(1): e0146112, 2016.
Article in English | MEDLINE | ID: mdl-26784691

ABSTRACT

Efficient and accurate quantitation of metabolites from LC-MS data has become an important topic. Here we present an automated tool, called iMet-Q (intelligent Metabolomic Quantitation), for label-free metabolomics quantitation from high-throughput MS1 data. By performing peak detection and peak alignment, iMet-Q provides a summary of quantitation results and reports ion abundance at both replicate level and sample level. Furthermore, it gives the charge states and isotope ratios of detected metabolite peaks to facilitate metabolite identification. An in-house standard mixture and a public Arabidopsis metabolome data set were analyzed by iMet-Q. Three public quantitation tools, including XCMS, MetAlign, and MZmine 2, were used for performance comparison. From the mixture data set, seven standard metabolites were detected by the four quantitation tools, for which iMet-Q had a smaller quantitation error of 12% in both profile and centroid data sets. Our tool also correctly determined the charge states of seven standard metabolites. By searching the mass values for those standard metabolites against Human Metabolome Database, we obtained a total of 183 metabolite candidates. With the isotope ratios calculated by iMet-Q, 49% (89 out of 183) metabolite candidates were filtered out. From the public Arabidopsis data set reported with two internal standards and 167 elucidated metabolites, iMet-Q detected all of the peaks corresponding to the internal standards and 167 metabolites. Meanwhile, our tool had small abundance variation (≤ 0.19) when quantifying the two internal standards and had higher abundance correlation (≥ 0.92) when quantifying the 167 metabolites. iMet-Q provides user-friendly interfaces and is publicly available for download at http://ms.iis.sinica.edu.tw/comics/Software_iMet-Q.html.


Subject(s)
Metabolome , Metabolomics/methods , Software , Arabidopsis/metabolism , Humans
SELECTION OF CITATIONS
SEARCH DETAIL