Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
1.
Anal Chem ; 2024 Jun 13.
Artigo em Inglês | MEDLINE | ID: mdl-38869158

RESUMO

Rapid development and wide adoption of mass spectrometry-based glycoproteomic technologies have empowered scientists to study proteins and protein glycosylation in complex samples on a large scale. This progress has also created unprecedented challenges for individual laboratories to store, manage, and analyze proteomic and glycoproteomic data, both in the cost for proprietary software and high-performance computing and in the long processing time that discourages on-the-fly changes of data processing settings required in explorative and discovery analysis. We developed an open-source, cloud computing-based pipeline, MS-PyCloud, with graphical user interface (GUI), for proteomic and glycoproteomic data analysis. The major components of this pipeline include data file integrity validation, MS/MS database search for spectral assignments to peptide sequences, false discovery rate estimation, protein inference, quantitation of global protein levels, and specific glycan-modified glycopeptides as well as other modification-specific peptides such as phosphorylation, acetylation, and ubiquitination. To ensure the transparency and reproducibility of data analysis, MS-PyCloud includes open-source software tools with comprehensive testing and versioning for spectrum assignments. Leveraging public cloud computing infrastructure via Amazon Web Services (AWS), MS-PyCloud scales seamlessly based on analysis demand to achieve fast and efficient performance. Application of the pipeline to the analysis of large-scale LC-MS/MS data sets demonstrated the effectiveness and high performance of MS-PyCloud. The software can be downloaded at https://github.com/huizhanglab-jhu/ms-pycloud.

2.
Mol Cell Proteomics ; 23(1): 100687, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38029961

RESUMO

Pancreatic ductal adenocarcinoma (PDAC) is one of the most lethal cancer types, partly because it is frequently identified at an advanced stage, when surgery is no longer feasible. Therefore, early detection using minimally invasive methods such as blood tests may improve outcomes. However, studies to discover molecular signatures for the early detection of PDAC using blood tests have only been marginally successful. In the current study, a quantitative glycoproteomic approach via data-independent acquisition mass spectrometry was utilized to detect glycoproteins in 29 patient-matched PDAC tissues and sera. A total of 892 N-linked glycopeptides originating from 141 glycoproteins had PDAC-associated changes beyond normal variation. We further evaluated the specificity of these serum-detectable glycoproteins by comparing their abundance in 53 independent PDAC patient sera and 65 cancer-free controls. The PDAC tissue-associated glycoproteins we have identified represent an inventory of serum-detectable PDAC-associated glycoproteins as candidate biomarkers that can be potentially used for the detection of PDAC using blood tests.


Assuntos
Carcinoma Ductal Pancreático , Neoplasias Pancreáticas , Humanos , Biomarcadores Tumorais/metabolismo , Neoplasias Pancreáticas/metabolismo , Carcinoma Ductal Pancreático/metabolismo , Glicoproteínas , Espectrometria de Massas
3.
Cell Rep ; 42(5): 112409, 2023 05 30.
Artigo em Inglês | MEDLINE | ID: mdl-37074911

RESUMO

Clear cell renal cell carcinoma (ccRCC), a common form of RCC, is responsible for the high mortality rate of kidney cancer. Dysregulations of glycoproteins have been shown to associate with ccRCC. However, the molecular mechanism has not been well characterized. Here, a comprehensive glycoproteomic analysis is conducted using 103 tumors and 80 paired normal adjacent tissues. Altered glycosylation enzymes and corresponding protein glycosylation are observed, while two of the major ccRCC mutations, BAP1 and PBRM1, show distinct glycosylation profiles. Additionally, inter-tumor heterogeneity and cross-correlation between glycosylation and phosphorylation are observed. The relation of glycoproteomic features to genomic, transcriptomic, proteomic, and phosphoproteomic changes shows the role of glycosylation in ccRCC development with potential for therapeutic interventions. This study reports a large-scale tandem mass tag (TMT)-based quantitative glycoproteomic analysis of ccRCC that can serve as a valuable resource for the community.


Assuntos
Carcinoma de Células Renais , Carcinoma , Neoplasias Renais , Humanos , Carcinoma de Células Renais/metabolismo , Proteômica , Neoplasias Renais/metabolismo , Genômica , Fosforilação
4.
Nat Commun ; 13(1): 3910, 2022 07 07.
Artigo em Inglês | MEDLINE | ID: mdl-35798744

RESUMO

Core fucosylation of N-linked glycoproteins has been linked to the functions of glycoproteins in physiological and pathological processes. However, quantitative characterization of core fucosylation remains challenging due to the complexity and heterogeneity of N-linked glycosylation. Here we report a mass spectrometry-based method that employs sequential treatment of intact glycopeptides with enzymes (STAGE) to analyze site-specific core fucosylation of glycoproteins. The STAGE method utilizes Endo F3 followed by PNGase F treatment to generate mass signatures for glycosites that are formerly modified by core fucosylated N-linked glycans. We benchmark the STAGE method and use it to characterize site specific core fucosylation of glycoproteins from human hepatocellular carcinoma and pancreatic ductal adenocarcinoma, resulting in the identification of 1130 and 782 core fucosylated glycosites, respectively. These results indicate that our STAGE method enables quantitative characterization of core fucosylation events from complex protein mixtures, which may benefit our understanding of core fucosylation functions in various diseases.


Assuntos
Glicopeptídeos , Neoplasias Hepáticas , Fucose/metabolismo , Glicopeptídeos/química , Glicoproteínas/metabolismo , Glicosilação , Humanos , Espectrometria de Massas/métodos
5.
Cell ; 184(19): 5031-5052.e26, 2021 09 16.
Artigo em Inglês | MEDLINE | ID: mdl-34534465

RESUMO

Pancreatic ductal adenocarcinoma (PDAC) is a highly aggressive cancer with poor patient survival. Toward understanding the underlying molecular alterations that drive PDAC oncogenesis, we conducted comprehensive proteogenomic analysis of 140 pancreatic cancers, 67 normal adjacent tissues, and 9 normal pancreatic ductal tissues. Proteomic, phosphoproteomic, and glycoproteomic analyses were used to characterize proteins and their modifications. In addition, whole-genome sequencing, whole-exome sequencing, methylation, RNA sequencing (RNA-seq), and microRNA sequencing (miRNA-seq) were performed on the same tissues to facilitate an integrated proteogenomic analysis and determine the impact of genomic alterations on protein expression, signaling pathways, and post-translational modifications. To ensure robust downstream analyses, tumor neoplastic cellularity was assessed via multiple orthogonal strategies using molecular features and verified via pathological estimation of tumor cellularity based on histological review. This integrated proteogenomic characterization of PDAC will serve as a valuable resource for the community, paving the way for early detection and identification of novel therapeutic targets.


Assuntos
Adenocarcinoma/genética , Carcinoma Ductal Pancreático/genética , Neoplasias Pancreáticas/genética , Proteogenômica , Adenocarcinoma/diagnóstico , Adulto , Idoso , Idoso de 80 Anos ou mais , Algoritmos , Carcinoma Ductal Pancreático/diagnóstico , Estudos de Coortes , Células Endoteliais/metabolismo , Epigênese Genética , Feminino , Dosagem de Genes , Genoma Humano , Glicólise , Glicoproteínas/biossíntese , Humanos , Masculino , Pessoa de Meia-Idade , Terapia de Alvo Molecular , Neoplasias Pancreáticas/diagnóstico , Fenótipo , Fosfoproteínas/metabolismo , Fosforilação , Prognóstico , Proteínas Quinases/metabolismo , Proteoma/metabolismo , Especificidade por Substrato , Transcriptoma/genética
6.
J Hematol Oncol ; 13(1): 170, 2020 12 07.
Artigo em Inglês | MEDLINE | ID: mdl-33287876

RESUMO

BACKGROUND: Proteomic characterization of cancers is essential for a comprehensive understanding of key molecular aberrations. However, proteomic profiling of a large cohort of cancer tissues is often limited by the conventional approaches. METHODS: We present a proteomic landscape of 16 major types of human cancer, based on the analysis of 126 treatment-naïve primary tumor tissues, 94 tumor-matched normal adjacent tissues, and 12 normal tissues, using mass spectrometry-based data-independent acquisition approach. RESULTS: In our study, a total of 8527 proteins were mapped to brain, head and neck, breast, lung (both small cell and non-small cell lung cancers), esophagus, stomach, pancreas, liver, colon, kidney, bladder, prostate, uterus and ovary cancers, including 2458 tissue-enriched proteins. Our DIA-based proteomic approach has characterized major human cancers and identified universally expressed proteins as well as tissue-type-specific and cancer-type-specific proteins. In addition, 1139 therapeutic targetable proteins and 21 cancer/testis (CT) antigens were observed. CONCLUSIONS: Our discoveries not only advance our understanding of human cancers, but also have implications for the design of future large-scale cancer proteomic studies to assist the development of diagnostic and/or therapeutic targets in multiple cancers.


Assuntos
Neoplasias/patologia , Proteínas/análise , Descoberta de Drogas , Humanos , Terapia de Alvo Molecular , Neoplasias/diagnóstico , Neoplasias/tratamento farmacológico , Neoplasias/metabolismo , Proteínas/metabolismo , Proteoma/análise , Proteoma/metabolismo , Proteômica
7.
Theranostics ; 10(26): 11892-11907, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33204318

RESUMO

Background: There is an urgent need for the detection of aggressive prostate cancer. Glycoproteins play essential roles in cancer development, while urine is a noninvasive and easily obtainable biological fluid that contains secretory glycoproteins from the urogenital system. Therefore, here we aimed to identify urinary glycoproteins that are capable of differentiating aggressive from non-aggressive prostate cancer. Methods: Quantitative mass spectrometry data of glycopeptides from a discovery cohort comprised of 74 aggressive (Gleason score ≥8) and 68 non-aggressive (Gleason score = 6) prostate cancer urine specimens were acquired via a data independent acquisition approach. The glycopeptides showing distinct expression profiles in aggressive relative to non-aggressive prostate cancer were further evaluated for their performance in distinguishing the two groups either individually or in combination with others using repeated 5-fold cross validation with logistic regression to build predictive models. Predictive models showing good performance from the discovery cohort were further evaluated using a validation cohort. Results: Among the 20 candidate glycoproteins, urinary ACPP outperformed the other candidates. Urinary ACPP can also serve as an adjunct to serum PSA to further improve the discrimination power for aggressive prostate cancer (AUC= 0.82, 95% confidence interval 0.75 to 0.89). A three-signature panel including urinary ACPP, urinary CLU, and serum PSA displayed the ability to distinguish aggressive prostate cancer from non-aggressive prostate cancer with an AUC of 0.86 (95% confidence interval 0.8 to 0.92). Another three-signature panel containing urinary ACPP, urinary LOX, and serum PSA also demonstrated its ability in recognizing aggressive prostate cancer (AUC=0.82, 95% confidence interval 0.75 to 0.9). Moreover, consistent performance was observed from each panel when evaluated using a validation cohort. Conclusion: We have identified glycopeptides of urinary glycoproteins associated with aggressive prostate cancer using a quantitative mass spectrometry-based glycoproteomic approach and demonstrated their potential to serve as noninvasive urinary glycoprotein biomarkers worthy of further validation by a multi-center study.


Assuntos
Biomarcadores Tumorais/urina , Glicoproteínas/urina , Neoplasias da Próstata/diagnóstico , Adulto , Idoso , Biomarcadores Tumorais/sangue , Estudos de Coortes , Exame Retal Digital , Estudos de Viabilidade , Humanos , Calicreínas/sangue , Masculino , Pessoa de Meia-Idade , Gradação de Tumores , Antígeno Prostático Específico/sangue , Neoplasias da Próstata/sangue , Neoplasias da Próstata/urina , Curva ROC
8.
Anal Chem ; 92(2): 1680-1686, 2020 01 21.
Artigo em Inglês | MEDLINE | ID: mdl-31859482

RESUMO

Aberrant glycosylation has been shown to associate with disease progression, and with glycoproteins representing the major protein component of biological fluids this makes them attractive targets for disease monitoring. Leveraging glycoproteomic analysis via mass spectrometry (MS) could provide the insight into the altered glycosylation patterns that relate to disease progression. However, investigation of large sample cohorts requires rapid, efficient, and highly reproducible sample preparation. To address the limitation, we developed a high-throughput method for characterizing glycans, glycosites, and intact glycopeptides (IGPs) derived from N-linked glycoproteins. We combined disparate peptide enrichment strategies (i.e., hydrophilic and hydrophobic) and a liquid handling platform allowing for a high throughput and rapid enrichment of IGP in a 96-well plate format. The C18/MAX-Tip workflow reduced sample processing time and facilitated the selective enrichment of IGPs from complex samples. Furthermore, our approach enabled the analysis of deglycosylated peptides and glycans from enriched IGPs following PNGase F digest. Following development and optimization of the C18/MAX-Tip methodology using the standard glycoprotein, fetuin, we investigated normal urine samples to obtain N-linked glycoprotein information. Together, our method enables a high-throughput enrichment of glycan, glycosites, and IGPs from biological samples.


Assuntos
Glicopeptídeos/urina , Glicoproteínas/química , Polissacarídeos/urina , Automação , Glicosilação , Humanos , Espectrometria de Massas por Ionização e Dessorção a Laser Assistida por Matriz , Espectrometria de Massas em Tandem
9.
Nucleic Acids Res ; 44(W1): W575-80, 2016 Jul 08.
Artigo em Inglês | MEDLINE | ID: mdl-27084943

RESUMO

MAGIC-web is the first web server, to the best of our knowledge, that performs both untargeted and targeted analyses of mass spectrometry-based glycoproteomics data for site-specific N-linked glycoprotein identification. The first two modules, MAGIC and MAGIC+, are designed for untargeted and targeted analysis, respectively. MAGIC is implemented with our previously proposed novel Y1-ion pattern matching method, which adequately detects Y1- and Y0-ion without prior information of proteins and glycans, and then generates in silico MS(2) spectra that serve as input to a database search engine (e.g. Mascot) to search against a large-scale protein sequence database. On top of that, the newly implemented MAGIC+ allows users to determine glycopeptide sequences using their own protein sequence file. The third module, Reports Integrator, provides the service of combining protein identification results from Mascot and glycan-related information from MAGIC-web to generate a complete site-specific protein-glycan summary report. The last module, Glycan Search, is designed for the users who are interested in finding possible glycan structures with specific numbers and types of monosaccharides. The results from MAGIC, MAGIC+ and Reports Integrator can be downloaded via provided links whereas the annotated spectra and glycan structures can be visualized in the browser. MAGIC-web is accessible from http://ms.iis.sinica.edu.tw/MAGIC-web/index.html.


Assuntos
Glicoproteínas/análise , Glicoproteínas/química , Internet , Polissacarídeos/análise , Polissacarídeos/química , Software , Simulação por Computador , Bases de Dados de Proteínas , Glicopeptídeos/análise , Glicopeptídeos/química , Humanos , Espectrometria de Massas , Proteômica , Ferramenta de Busca , Interface Usuário-Computador , Navegador
10.
PLoS One ; 11(1): e0146112, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-26784691

RESUMO

Efficient and accurate quantitation of metabolites from LC-MS data has become an important topic. Here we present an automated tool, called iMet-Q (intelligent Metabolomic Quantitation), for label-free metabolomics quantitation from high-throughput MS1 data. By performing peak detection and peak alignment, iMet-Q provides a summary of quantitation results and reports ion abundance at both replicate level and sample level. Furthermore, it gives the charge states and isotope ratios of detected metabolite peaks to facilitate metabolite identification. An in-house standard mixture and a public Arabidopsis metabolome data set were analyzed by iMet-Q. Three public quantitation tools, including XCMS, MetAlign, and MZmine 2, were used for performance comparison. From the mixture data set, seven standard metabolites were detected by the four quantitation tools, for which iMet-Q had a smaller quantitation error of 12% in both profile and centroid data sets. Our tool also correctly determined the charge states of seven standard metabolites. By searching the mass values for those standard metabolites against Human Metabolome Database, we obtained a total of 183 metabolite candidates. With the isotope ratios calculated by iMet-Q, 49% (89 out of 183) metabolite candidates were filtered out. From the public Arabidopsis data set reported with two internal standards and 167 elucidated metabolites, iMet-Q detected all of the peaks corresponding to the internal standards and 167 metabolites. Meanwhile, our tool had small abundance variation (≤ 0.19) when quantifying the two internal standards and had higher abundance correlation (≥ 0.92) when quantifying the 167 metabolites. iMet-Q provides user-friendly interfaces and is publicly available for download at http://ms.iis.sinica.edu.tw/comics/Software_iMet-Q.html.


Assuntos
Metaboloma , Metabolômica/métodos , Software , Arabidopsis/metabolismo , Humanos
11.
Anal Chem ; 87(4): 2466-73, 2015 Feb 17.
Artigo em Inglês | MEDLINE | ID: mdl-25629585

RESUMO

Glycosylation is a highly complex modification influencing the functions and activities of proteins. Interpretation of intact glycopeptide spectra is crucial but challenging. In this paper, we present a mass spectrometry-based automated glycopeptide identification platform (MAGIC) to identify peptide sequences and glycan compositions directly from intact N-linked glycopeptide collision-induced-dissociation spectra. The identification of the Y1 (peptideY0 + GlcNAc) ion is critical for the correct analysis of unknown glycoproteins, especially without prior knowledge of the proteins and glycans present in the sample. To ensure accurate Y1-ion assignment, we propose a novel algorithm called Trident that detects a triplet pattern corresponding to [Y0, Y1, Y2] or [Y0-NH3, Y0, Y1] from the fragmentation of the common trimannosyl core of N-linked glycopeptides. To facilitate the subsequent peptide sequence identification by common database search engines, MAGIC generates in silico spectra by overwriting the original precursor with the naked peptide m/z and removing all of the glycan-related ions. Finally, MAGIC computes the glycan compositions and ranks them. For the model glycoprotein horseradish peroxidase (HRP) and a 5-glycoprotein mixture, a 2- to 31-fold increase in the relative intensities of the peptide fragments was achieved, which led to the identification of 7 tryptic glycopeptides from HRP and 16 glycopeptides from the mixture via Mascot. In the HeLa cell proteome data set, MAGIC processed over a thousand MS(2) spectra in 3 min on a PC and reported 36 glycopeptides from 26 glycoproteins. Finally, a remarkable false discovery rate of 0 was achieved on the N-glycosylation-free Escherichia coli data set. MAGIC is available at http://ms.iis.sinica.edu.tw/COmics/Software_MAGIC.html .


Assuntos
Algoritmos , Biologia Computacional , Glicopeptídeos/análise , Software , Automação , Bases de Dados Factuais , Escherichia coli/química , Glicopeptídeos/química , Células HeLa , Humanos
12.
Anal Chem ; 87(4): 2143-51, 2015 Feb 17.
Artigo em Inglês | MEDLINE | ID: mdl-25543920

RESUMO

Metabolite identification remains a bottleneck in mass spectrometry (MS)-based metabolomics. Currently, this process relies heavily on tandem mass spectrometry (MS/MS) spectra generated separately for peaks of interest identified from previous MS runs. Such a delayed and labor-intensive procedure creates a barrier to automation. Further, information embedded in MS data has not been used to its full extent for metabolite identification. Multimers, adducts, multiply charged ions, and fragments of given metabolites occupy a substantial proportion (40-80%) of the peaks of a quantitation result. However, extensive information on these derivatives, especially fragments, may facilitate metabolite identification. We propose a procedure with automation capability to group and annotate peaks associated with the same metabolite in the quantitation results of opposite modes and to integrate this information for metabolite identification. In addition to the conventional mass and isotope ratio matches, we would match annotated fragments with low-energy MS/MS spectra in public databases. For identification of metabolites without accessible MS/MS spectra, we have developed characteristic fragment and common substructure matches. The accuracy and effectiveness of the procedure were evaluated using one public and two in-house liquid chromatography-mass spectrometry (LC-MS) data sets. The procedure accurately identified 89% of 28 standard metabolites with derivative ions in the data sets. With respect to effectiveness, the procedure confidently identified the correct chemical formula of at least 42% of metabolites with derivative ions via MS/MS spectrum, characteristic fragment, and common substructure matches. The confidence level was determined according to the fulfilled identification criteria of various matches and relative retention time.


Assuntos
Metabolômica/métodos , Espectrometria de Massas em Tandem/métodos , Animais , Cromatografia Líquida/métodos , Diabetes Mellitus Experimental/metabolismo , Dieta , Íons/análise , Íons/metabolismo , Metaboloma , Camundongos , Ratos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA