Pesquisa | BVS Economia da Saúde

Community Assessment of the Predictability of Cancer Protein and Phosphoprotein Levels from Genomics and Transcriptomics.

Yang, Mi; Petralia, Francesca; Li, Zhi; Li, Hongyang; Ma, Weiping; Song, Xiaoyu; Kim, Sunkyu; Lee, Heewon; Yu, Han; Lee, Bora; Bae, Seohui; Heo, Eunji; Kaczmarczyk, Jan; Stepniak, Piotr; Warchol, Michal; Yu, Thomas; Calinawan, Anna P; Boutros, Paul C; Payne, Samuel H; Reva, Boris; Boja, Emily; Rodriguez, Henry; Stolovitzky, Gustavo; Guan, Yuanfang; Kang, Jaewoo; Wang, Pei; Fenyö, David; Saez-Rodriguez, Julio.

Cell Syst ; 11(2): 186-195.e9, 2020 08 26.

Artigo em Inglês | MEDLINE | ID: mdl-32710834

RESUMO

Cancer is driven by genomic alterations, but the processes causing this disease are largely performed by proteins. However, proteins are harder and more expensive to measure than genes and transcripts. To catalyze developments of methods to infer protein levels from other omics measurements, we leveraged crowdsourcing via the NCI-CPTAC DREAM proteogenomic challenge. We asked for methods to predict protein and phosphorylation levels from genomic and transcriptomic data in cancer patients. The best performance was achieved by an ensemble of models, including as predictors transcript level of the corresponding genes, interaction between genes, conservation across tumor types, and phosphosite proximity for phosphorylation prediction. Proteins from metabolic pathways and complexes were the best and worst predicted, respectively. The performance of even the best-performing model was modest, suggesting that many proteins are strongly regulated through translational control and degradation. Our results set a reference for the limitations of computational inference in proteogenomics. A record of this paper's transparent peer review process is included in the Supplemental Information.

Assuntos

Crowdsourcing/métodos , Genômica/métodos , Aprendizado de Máquina/normas , Neoplasias/genética , Fosfoproteínas/metabolismo , Proteínas/genética , Proteômica/métodos , Transcriptoma/genética , Feminino , Humanos , Masculino

Reproducibility of Differential Proteomic Technologies in CPTAC Fractionated Xenografts.

Tabb, David L; Wang, Xia; Carr, Steven A; Clauser, Karl R; Mertins, Philipp; Chambers, Matthew C; Holman, Jerry D; Wang, Jing; Zhang, Bing; Zimmerman, Lisa J; Chen, Xian; Gunawardena, Harsha P; Davies, Sherri R; Ellis, Matthew J C; Li, Shunqiang; Townsend, R Reid; Boja, Emily S; Ketchum, Karen A; Kinsinger, Christopher R; Mesri, Mehdi; Rodriguez, Henry; Liu, Tao; Kim, Sangtae; McDermott, Jason E; Payne, Samuel H; Petyuk, Vladislav A; Rodland, Karin D; Smith, Richard D; Yang, Feng; Chan, Daniel W; Zhang, Bai; Zhang, Hui; Zhang, Zhen; Zhou, Jian-Ying; Liebler, Daniel C.

J Proteome Res ; 15(3): 691-706, 2016 Mar 04.

Artigo em Inglês | MEDLINE | ID: mdl-26653538

RESUMO

The NCI Clinical Proteomic Tumor Analysis Consortium (CPTAC) employed a pair of reference xenograft proteomes for initial platform validation and ongoing quality control of its data collection for The Cancer Genome Atlas (TCGA) tumors. These two xenografts, representing basal and luminal-B human breast cancer, were fractionated and analyzed on six mass spectrometers in a total of 46 replicates divided between iTRAQ and label-free technologies, spanning a total of 1095 LC-MS/MS experiments. These data represent a unique opportunity to evaluate the stability of proteomic differentiation by mass spectrometry over many months of time for individual instruments or across instruments running dissimilar workflows. We evaluated iTRAQ reporter ions, label-free spectral counts, and label-free extracted ion chromatograms as strategies for data interpretation (source code is available from http://homepages.uc.edu/~wang2x7/Research.htm ). From these assessments, we found that differential genes from a single replicate were confirmed by other replicates on the same instrument from 61 to 93% of the time. When comparing across different instruments and quantitative technologies, using multiple replicates, differential genes were reproduced by other data sets from 67 to 99% of the time. Projecting gene differences to biological pathways and networks increased the degree of similarity. These overlaps send an encouraging message about the maturity of technologies for proteomic differentiation.

Assuntos

Xenoenxertos/química , Proteômica/métodos , Proteômica/normas , Neoplasias da Mama/química , Neoplasias da Mama/metabolismo , Cromatografia Líquida , Interpretação Estatística de Dados , Feminino , Perfilação da Expressão Gênica/métodos , Humanos , Redes e Vias Metabólicas , Variações Dependentes do Observador , Proteoma , Proteômica/instrumentação , Controle de Qualidade , Reprodutibilidade dos Testes , Espectrometria de Massas em Tandem/normas

An Optimized Informatics Pipeline for Mass Spectrometry-Based Peptidomics.

Wu, Chaochao; Monroe, Matthew E; Xu, Zhe; Slysz, Gordon W; Payne, Samuel H; Rodland, Karin D; Liu, Tao; Smith, Richard D.

J Am Soc Mass Spectrom ; 26(12): 2002-8, 2015 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-26015166

RESUMO

The comprehensive MS analysis of the peptidome, the intracellular and intercellular products of protein degradation, has the potential to provide novel insights on endogenous proteolytic processing and its utility in disease diagnosis and prognosis. Along with the advances in MS instrumentation and related platforms, a plethora of proteomics data analysis tools have been applied for direct use in peptidomics; however, an evaluation of the currently available informatics pipelines for peptidomics data analysis has yet to be reported. In this study, we began by evaluating the results of several popular MS/MS database search engines, including MS-GF+, SEQUEST, and MS-Align+, for peptidomics data analysis, followed by identification and label-free quantification using the well-established accurate mass and time (AMT) tag and newly developed informed quantification (IQ) approaches, both based on direct LC-MS analysis. Our results demonstrated that MS-GF+ outperformed both SEQUEST and MS-Align+ in identifying peptidome peptides. Using a database established from MS-GF+ peptide identifications, both the AMT tag and IQ approaches provided significantly deeper peptidome coverage and less missing data for each individual data set than the MS/MS methods, while achieving robust label-free quantification. Besides having an excellent correlation with the AMT tag quantification results, IQ also provided slightly higher peptidome coverage. Taken together, we propose an optimized informatics pipeline combining MS-GF+ for initial database searching with IQ (or AMT tag) approaches for identification and label-free quantification for high-throughput, comprehensive, and quantitative peptidomics analysis. Graphical Abstract á.

Assuntos

Peptídeos/análise , Proteômica/métodos , Espectrometria de Massas em Tandem/métodos , Cromatografia Líquida/economia , Cromatografia Líquida/métodos , Bases de Dados de Proteínas , Humanos , Neoplasias/química , Mapeamento de Peptídeos/economia , Mapeamento de Peptídeos/métodos , Proteômica/economia , Ferramenta de Busca , Software , Espectrometria de Massas em Tandem/economia , Fluxo de Trabalho

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA