RESUMO
The KRAS gene is one of the most frequently mutated oncogenes in human cancer and gives rise to two isoforms, KRAS4A and KRAS4B. KRAS post-translational modifications (PTMs) have the potential to influence downstream signaling. However, the relationship between KRAS PTMs and oncogenic mutations remains unclear, and the extent of isoform-specific modification is unknown. Here, we present the first top-down proteomics study evaluating both KRAS4A and KRAS4B, resulting in 39 completely characterized proteoforms across colorectal cancer cell lines and primary tumor samples. We determined which KRAS PTMs are present, along with their relative abundance, and that proteoforms of KRAS4A versus KRAS4B are differentially modified. Moreover, we identified a subset of KRAS4B proteoforms lacking the C185 residue and associated C-terminal PTMs. By confocal microscopy, we confirmed that this truncated GFP-KRAS4BC185∗ proteoform is unable to associate with the plasma membrane, resulting in a decrease in mitogen-activated protein kinase signaling pathway activation. Collectively, our study provides a reference set of functionally distinct KRAS proteoforms and the colorectal cancer contexts in which they are present.
Assuntos
Neoplasias Colorretais , Proteínas Quinases Ativadas por Mitógeno , Proteínas Proto-Oncogênicas p21(ras) , Transdução de Sinais , Humanos , Neoplasias Colorretais/genética , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Processamento de Proteína Pós-Traducional , Proteínas Proto-Oncogênicas p21(ras)/genética , Proteínas Proto-Oncogênicas p21(ras)/metabolismo , Linhagem Celular Tumoral , Proteômica , Proteínas Quinases Ativadas por Mitógeno/metabolismoRESUMO
The precisionFDA Truth Challenge V2 aimed to assess the state of the art of variant calling in challenging genomic regions. Starting with FASTQs, 20 challenge participants applied their variant-calling pipelines and submitted 64 variant call sets for one or more sequencing technologies (Illumina, PacBio HiFi, and Oxford Nanopore Technologies). Submissions were evaluated following best practices for benchmarking small variants with updated Genome in a Bottle benchmark sets and genome stratifications. Challenge submissions included numerous innovative methods, with graph-based and machine learning methods scoring best for short-read and long-read datasets, respectively. With machine learning approaches, combining multiple sequencing technologies performed particularly well. Recent developments in sequencing and variant calling have enabled benchmarking variants in challenging genomic regions, paving the way for the identification of previously unknown clinically relevant variants.
RESUMO
Immunotherapies are revolutionizing cancer care, producing durable responses and potentially cures in a subset of patients. However, response rates are low for most tumors, grade 3/4 toxicities are not uncommon, and our current understanding of tumor immunobiology is incomplete. While hundreds of immunomodulatory proteins in the tumor microenvironment shape the anti-tumor response, few of them can be reliably quantified. To address this need, we developed a multiplex panel of targeted proteomic assays targeting 52 peptides representing 46 proteins using peptide immunoaffinity enrichment coupled to multiple reaction monitoring-mass spectrometry. We validated the assays in tissue and plasma matrices, where performance figures of merit showed over 3 orders of dynamic range and median inter-day CVs of 5.2% (tissue) and 21% (plasma). A feasibility study in clinical biospecimens showed detection of 48/52 peptides in frozen tissue and 38/52 peptides in plasma. The assays are publicly available as a resource for the research community.
Assuntos
Cromatografia Líquida/métodos , Espectrometria de Massas/métodos , Peptídeos/análise , Proteoma/análise , Proteômica/métodos , Manejo de Espécimes/métodos , Anticorpos/análise , Anticorpos/imunologia , Western Blotting , Linhagem Celular Tumoral , Células HeLa , Humanos , Células Jurkat , Células MCF-7 , Peptídeos/sangue , Peptídeos/imunologia , Proteoma/genética , Proteoma/imunologia , RNA-Seq/métodos , Reprodutibilidade dos TestesRESUMO
SUMMARY: A primary goal of the US National Cancer Institute's Ras initiative at the Frederick National Laboratory for Cancer Research is to develop methods to quantify RAS signaling to facilitate development of novel cancer therapeutics. We use targeted proteomics technologies to develop a community resource consisting of 256 validated multiple reaction monitoring (MRM)-based, multiplexed assays for quantifying protein expression and phosphorylation through the receptor tyrosine kinase, MAPK, and AKT signaling networks. As proof of concept, we quantify the response of melanoma (A375 and SK-MEL-2) and colorectal cancer (HCT-116 and HT-29) cell lines to BRAF inhibition by PLX-4720. These assays replace over 60 Western blots with quantitative mass spectrometry-based assays of high molecular specificity and quantitative precision, showing the value of these methods for pharmacodynamic measurements and mechanism of action studies. Methods, fit-for-purpose validation, and results are publicly available as a resource for the community at assays.cancer.gov. MOTIVATION: A lack of quantitative, multiplexable assays for phosphosignaling limits comprehensive investigation of aberrant signaling in cancer and evaluation of novel treatments. To alleviate this limitation, we sought to develop assays using targeted mass spectrometry for quantifying protein expression and phosphorylation through the receptor tyrosine kinase, MAPK, and AKT signaling networks. The resulting assays provide a resource for replacing over 60 Western blots in examining cancer signaling and tumor biology with high molecular specificity and quantitative rigor.
Assuntos
Melanoma , Proteínas Proto-Oncogênicas c-akt , Humanos , Proteínas Proto-Oncogênicas c-akt/metabolismo , Espectrometria de Massas/métodos , Receptores Proteína Tirosina Quinases , Quinases de Proteína Quinase Ativadas por Mitógeno , TirosinaRESUMO
Pancreatic ductal adenocarcinoma (PDAC) is a highly aggressive cancer with poor patient survival. Toward understanding the underlying molecular alterations that drive PDAC oncogenesis, we conducted comprehensive proteogenomic analysis of 140 pancreatic cancers, 67 normal adjacent tissues, and 9 normal pancreatic ductal tissues. Proteomic, phosphoproteomic, and glycoproteomic analyses were used to characterize proteins and their modifications. In addition, whole-genome sequencing, whole-exome sequencing, methylation, RNA sequencing (RNA-seq), and microRNA sequencing (miRNA-seq) were performed on the same tissues to facilitate an integrated proteogenomic analysis and determine the impact of genomic alterations on protein expression, signaling pathways, and post-translational modifications. To ensure robust downstream analyses, tumor neoplastic cellularity was assessed via multiple orthogonal strategies using molecular features and verified via pathological estimation of tumor cellularity based on histological review. This integrated proteogenomic characterization of PDAC will serve as a valuable resource for the community, paving the way for early detection and identification of novel therapeutic targets.
Assuntos
Adenocarcinoma/genética , Carcinoma Ductal Pancreático/genética , Neoplasias Pancreáticas/genética , Proteogenômica , Adenocarcinoma/diagnóstico , Adulto , Idoso , Idoso de 80 Anos ou mais , Algoritmos , Carcinoma Ductal Pancreático/diagnóstico , Estudos de Coortes , Células Endoteliais/metabolismo , Epigênese Genética , Feminino , Dosagem de Genes , Genoma Humano , Glicólise , Glicoproteínas/biossíntese , Humanos , Masculino , Pessoa de Meia-Idade , Terapia de Alvo Molecular , Neoplasias Pancreáticas/diagnóstico , Fenótipo , Fosfoproteínas/metabolismo , Fosforilação , Prognóstico , Proteínas Quinases/metabolismo , Proteoma/metabolismo , Especificidade por Substrato , Transcriptoma/genéticaRESUMO
Sample mislabeling or misannotation has been a long-standing problem in scientific research, particularly prevalent in large-scale, multi-omic studies due to the complexity of multi-omic workflows. There exists an urgent need for implementing quality controls to automatically screen for and correct sample mislabels or misannotations in multi-omic studies. Here, we describe a crowdsourced precisionFDA NCI-CPTAC Multi-omics Enabled Sample Mislabeling Correction Challenge, which provides a framework for systematic benchmarking and evaluation of mislabel identification and correction methods for integrative proteogenomic studies. The challenge received a large number of submissions from domestic and international data scientists, with highly variable performance observed across the submitted methods. Post-challenge collaboration between the top-performing teams and the challenge organizers has created an open-source software, COSMO, with demonstrated high accuracy and robustness in mislabeling identification and correction in simulated and real multi-omic datasets.
RESUMO
The integration of mass spectrometry-based proteomics with next-generation DNA and RNA sequencing profiles tumors more comprehensively. Here this "proteogenomics" approach was applied to 122 treatment-naive primary breast cancers accrued to preserve post-translational modifications, including protein phosphorylation and acetylation. Proteogenomics challenged standard breast cancer diagnoses, provided detailed analysis of the ERBB2 amplicon, defined tumor subsets that could benefit from immune checkpoint therapy, and allowed more accurate assessment of Rb status for prediction of CDK4/6 inhibitor responsiveness. Phosphoproteomics profiles uncovered novel associations between tumor suppressor loss and targetable kinases. Acetylproteome analysis highlighted acetylation on key nuclear proteins involved in the DNA damage response and revealed cross-talk between cytoplasmic and mitochondrial acetylation and metabolism. Our results underscore the potential of proteogenomics for clinical investigation of breast cancer through more accurate annotation of targetable pathways and biological features of this remarkably heterogeneous malignancy.
Assuntos
Neoplasias da Mama/genética , Neoplasias da Mama/patologia , Carcinogênese/genética , Carcinogênese/patologia , Terapia de Alvo Molecular , Proteogenômica , Desaminases APOBEC/metabolismo , Adulto , Idoso , Idoso de 80 Anos ou mais , Neoplasias da Mama/imunologia , Neoplasias da Mama/terapia , Estudos de Coortes , Dano ao DNA , Reparo do DNA , Feminino , Humanos , Imunoterapia , Metabolômica , Pessoa de Meia-Idade , Mutagênese/genética , Fosforilação , Inibidores de Proteínas Quinases/farmacologia , Proteínas Quinases/metabolismo , Receptor ErbB-2/metabolismo , Proteína do Retinoblastoma/metabolismo , Microambiente Tumoral/imunologiaRESUMO
Many gene products exhibit great structural heterogeneity because of an array of modifications. These modifications are not directly encoded in the genomic template but often affect the functionality of proteins. Protein glycosylation plays a vital role in proper protein functions. However, the analysis of glycoproteins has been challenging compared with other protein modifications, such as phosphorylation. Here, we perform an integrated proteomic and glycoproteomic analysis of 83 prospectively collected high-grade serous ovarian carcinoma (HGSC) and 23 non-tumor tissues. Integration of the expression data from global proteomics and glycoproteomics reveals tumor-specific glycosylation, uncovers different glycosylation associated with three tumor clusters, and identifies glycosylation enzymes that were correlated with the altered glycosylation. In addition to providing a valuable resource, these results provide insights into the potential roles of glycosylation in the pathogenesis of HGSC, with the possibility of distinguishing pathological outcomes of ovarian tumors from non-tumors, as well as classifying tumor clusters.
Assuntos
Cistadenocarcinoma Seroso/metabolismo , Neoplasias Ovarianas/genética , Neoplasias Ovarianas/metabolismo , Biomarcadores Tumorais/metabolismo , Cistadenocarcinoma Seroso/genética , Cistadenocarcinoma Seroso/patologia , Feminino , Glicoproteínas/metabolismo , Glicosilação , Humanos , Neoplasias Ovarianas/patologia , Proteômica/métodos , Bancos de TecidosRESUMO
[This corrects the article PMC7289043.].
RESUMO
To explore the biology of lung adenocarcinoma (LUAD) and identify new therapeutic opportunities, we performed comprehensive proteogenomic characterization of 110 tumors and 101 matched normal adjacent tissues (NATs) incorporating genomics, epigenomics, deep-scale proteomics, phosphoproteomics, and acetylproteomics. Multi-omics clustering revealed four subgroups defined by key driver mutations, country, and gender. Proteomic and phosphoproteomic data illuminated biology downstream of copy number aberrations, somatic mutations, and fusions and identified therapeutic vulnerabilities associated with driver events involving KRAS, EGFR, and ALK. Immune subtyping revealed a complex landscape, reinforced the association of STK11 with immune-cold behavior, and underscored a potential immunosuppressive role of neutrophil degranulation. Smoking-associated LUADs showed correlation with other environmental exposure signatures and a field effect in NATs. Matched NATs allowed identification of differentially expressed proteins with potential diagnostic and therapeutic utility. This proteogenomics dataset represents a unique public resource for researchers and clinicians seeking to better understand and treat lung adenocarcinomas.
Assuntos
Adenocarcinoma de Pulmão/tratamento farmacológico , Adenocarcinoma de Pulmão/genética , Neoplasias Pulmonares/tratamento farmacológico , Neoplasias Pulmonares/genética , Proteogenômica , Adenocarcinoma de Pulmão/imunologia , Adulto , Idoso , Idoso de 80 Anos ou mais , Biomarcadores Tumorais/metabolismo , Carcinogênese/genética , Carcinogênese/patologia , Variações do Número de Cópias de DNA/genética , Metilação de DNA/genética , Feminino , Humanos , Neoplasias Pulmonares/imunologia , Masculino , Pessoa de Meia-Idade , Mutação/genética , Proteínas de Fusão Oncogênica , Fenótipo , Fosfoproteínas/metabolismo , Proteoma/metabolismoRESUMO
Cancer is driven by genomic alterations, but the processes causing this disease are largely performed by proteins. However, proteins are harder and more expensive to measure than genes and transcripts. To catalyze developments of methods to infer protein levels from other omics measurements, we leveraged crowdsourcing via the NCI-CPTAC DREAM proteogenomic challenge. We asked for methods to predict protein and phosphorylation levels from genomic and transcriptomic data in cancer patients. The best performance was achieved by an ensemble of models, including as predictors transcript level of the corresponding genes, interaction between genes, conservation across tumor types, and phosphosite proximity for phosphorylation prediction. Proteins from metabolic pathways and complexes were the best and worst predicted, respectively. The performance of even the best-performing model was modest, suggesting that many proteins are strongly regulated through translational control and degradation. Our results set a reference for the limitations of computational inference in proteogenomics. A record of this paper's transparent peer review process is included in the Supplemental Information.
Assuntos
Crowdsourcing/métodos , Genômica/métodos , Aprendizado de Máquina/normas , Neoplasias/genética , Fosfoproteínas/metabolismo , Proteínas/genética , Proteômica/métodos , Transcriptoma/genética , Feminino , Humanos , MasculinoRESUMO
In the absence of a dominant driving mutation other than uniformly present TP53 mutations, deeper understanding of the biology driving ovarian high-grade serous cancer (HGSC) requires analysis at a functional level, including post-translational modifications. Comprehensive proteogenomic and phosphoproteomic characterization of 83 prospectively collected ovarian HGSC and appropriate normal precursor tissue samples (fallopian tube) under strict control of ischemia time reveals pathways that significantly differentiate between HGSC and relevant normal tissues in the context of homologous repair deficiency (HRD) status. In addition to confirming key features of HGSC from previous studies, including a potential survival-associated signature and histone acetylation as a marker of HRD, deep phosphoproteomics provides insights regarding the potential role of proliferation-induced replication stress in promoting the characteristic chromosomal instability of HGSC and suggests potential therapeutic targets for use in precision medicine trials.
Assuntos
Instabilidade Cromossômica/fisiologia , Cistadenocarcinoma Seroso , Replicação do DNA/genética , Neoplasias Ovarianas , Fosfotransferases/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Pontos de Checagem do Ciclo Celular/genética , Estudos de Coortes , Cistadenocarcinoma Seroso/genética , Cistadenocarcinoma Seroso/metabolismo , Cistadenocarcinoma Seroso/mortalidade , Dano ao DNA , Neoplasias das Tubas Uterinas/genética , Neoplasias das Tubas Uterinas/metabolismo , Neoplasias das Tubas Uterinas/mortalidade , Feminino , Regulação Neoplásica da Expressão Gênica , Humanos , Pessoa de Meia-Idade , Mitose/genética , Neoplasias Ovarianas/genética , Neoplasias Ovarianas/metabolismo , Neoplasias Ovarianas/mortalidade , Fosfotransferases/metabolismo , Proteogenômica , Transcriptoma , Proteína Supressora de Tumor p53/genéticaRESUMO
Methodologies that facilitate high-throughput proteomic analysis are a key step toward moving proteome investigations into clinical translation. Data independent acquisition (DIA) has potential as a high-throughput analytical method due to the reduced time needed for sample analysis, as well as its highly quantitative accuracy. However, a limiting feature of DIA methods is the sensitivity of detection of low abundant proteins and depth of coverage, which other mass spectrometry approaches address by two-dimensional fractionation (2D) to reduce sample complexity during data acquisition. In this study, we developed a 2D-DIA method intended for rapid- and deeper-proteome analysis compared to conventional 1D-DIA analysis. First, we characterized 96 individual fractions obtained from the protein standard, NCI-7, using a data-dependent approach (DDA), identifying a total of 151,366 unique peptides from 11,273 protein groups. We observed that the majority of the proteins can be identified from just a few selected fractions. By performing an optimization analysis, we identified six fractions with high peptide number and uniqueness that can account for 80% of the proteins identified in the entire experiment. These selected fractions were combined into a single sample which was then subjected to DIA (referred to as 2D-DIA) quantitative analysis. Furthermore, improved DIA quantification was achieved using a hybrid spectral library, obtained by combining peptides identified from DDA data with peptides identified directly from the DIA runs with the help of DIA-Umpire. The optimized 2D-DIA method allowed for improved identification and quantification of low abundant proteins compared to conventional unfractionated DIA analysis (1D-DIA). We then applied the 2D-DIA method to profile the proteomes of two breast cancer patient-derived xenograft (PDX) models, quantifying 6,217 and 6,167 unique proteins in basal- and luminal- tumors, respectively. Overall, this study demonstrates the potential of high-throughput quantitative proteomics using a novel 2D-DIA method.
Assuntos
Peptídeos/análise , Proteínas/análise , Proteômica , Humanos , Espectrometria de MassasRESUMO
We performed the first proteogenomic characterization of hepatitis B virus (HBV)-related hepatocellular carcinoma (HCC) using paired tumor and adjacent liver tissues from 159 patients. Integrated proteogenomic analyses revealed consistency and discordance among multi-omics, activation status of key signaling pathways, and liver-specific metabolic reprogramming in HBV-related HCC. Proteomic profiling identified three subgroups associated with clinical and molecular attributes including patient survival, tumor thrombus, genetic profile, and the liver-specific proteome. These proteomic subgroups have distinct features in metabolic reprogramming, microenvironment dysregulation, cell proliferation, and potential therapeutics. Two prognostic biomarkers, PYCR2 and ADH1A, related to proteomic subgrouping and involved in HCC metabolic reprogramming, were identified. CTNNB1 and TP53 mutation-associated signaling and metabolic profiles were revealed, among which mutated CTNNB1-associated ALDOA phosphorylation was validated to promote glycolysis and cell proliferation. Our study provides a valuable resource that significantly expands the knowledge of HBV-related HCC and may eventually benefit clinical practice.
Assuntos
Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/virologia , Frutose-Bifosfato Aldolase/genética , Vírus da Hepatite B , Hepatite B Crônica/complicações , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/virologia , Proteogenômica/métodos , beta Catenina/genética , Animais , Proliferação de Células , Estudos de Coortes , Feminino , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Células Hep G2 , Humanos , Masculino , Camundongos , Camundongos Endogâmicos BALB C , Pessoa de Meia-Idade , Microambiente Tumoral/genéticaRESUMO
RAS genes are frequently mutated in cancer and have for decades eluded effective therapeutic attack. The National Cancer Institute's RAS Initiative has a focus on understanding pathways and discovering therapies for RAS-driven cancers. Part of these efforts is the generation of novel reagents to enable the quantification of RAS network proteins. Here we present a dataset describing the development, validation (following consensus principles developed by the broader research community), and distribution of 104 monoclonal antibodies (mAbs) enabling detection of 27 phosphopeptides and 69 unmodified peptides from 20 proteins in the RAS network. The dataset characterizes the utility of the antibodies in a variety of applications, including Western blotting, immunoprecipitation, protein array, immunohistochemistry, and targeted mass spectrometry. All antibodies and characterization data are publicly available through the CPTAC Antibody Portal, Panorama Public Repository, and/or PRIDE databases. These reagents will aid researchers in discerning pathways and measuring expression changes in the RAS signaling network.
Assuntos
Anticorpos Monoclonais/química , Genes ras , Transdução de Sinais , Linhagem Celular , Impressões Digitais de DNA , Humanos , Indicadores e Reagentes/química , Repetições de Microssatélites , Neoplasias/genéticaRESUMO
BACKGROUND: Mass spectrometry-based proteomics has become a powerful tool for the identification and quantification of proteins from a wide variety of biological specimens. To date, the majority of studies utilizing tissue samples have been carried out on prospectively collected fresh frozen or optimal cutting temperature (OCT) embedded specimens. However, such specimens are often difficult to obtain, in limited in supply, and clinical information and outcomes on patients are inherently delayed as compared to banked samples. Annotated formalin fixed, paraffin embedded (FFPE) tumor tissue specimens are available for research use from a variety of tissue banks, such as from the surveillance, epidemiology and end results (SEER) registries' residual tissue repositories. Given the wealth of outcomes information associated with such samples, the reuse of archived FFPE blocks for deep proteomic characterization with mass spectrometry technologies would provide a valuable resource for population-based cancer studies. Further, due to the widespread availability of FFPE specimens, validation of specimen integrity opens the possibility for thousands of studies that can be conducted worldwide. METHODS: To examine the suitability of the SEER repository tissues for proteomic and phosphoproteomic analysis, we analyzed 60 SEER patient samples, with time in storage ranging from 7 to 32 years; 60 samples with expression proteomics and 18 with phosphoproteomics, using isobaric labeling. Linear modeling and gene set enrichment analysis was used to evaluate the impacts of collection site and storage time. RESULTS: All samples, regardless of age, yielded suitable protein mass after extraction for expression analysis and 18 samples yielded sufficient mass for phosphopeptide analysis. Although peptide, protein, and phosphopeptide identifications were reduced by 50, 20 and 76% respectively, from comparable OCT specimens, we found no statistically significant differences in protein quantitation correlating with collection site or specimen age. GSEA analysis of GO-term level measurements of protein abundance differences between FFPE and OCT embedded specimens suggest that the formalin fixation process may alter representation of protein categories in the resulting dataset. CONCLUSIONS: These studies demonstrate that residual FFPE tissue specimens, of varying age and collection site, are a promising source of protein for proteomic investigations if paired with rigorously verified mass spectrometry workflows.
RESUMO
Mutations of the KRAS gene are found in human cancers with high frequency and result in the constitutive activation of its protein products. This leads to aberrant regulation of downstream pathways, promoting cell survival, proliferation, and tumorigenesis that drive cancer progression and negatively affect treatment outcomes. Here, we describe a workflow that can detect and quantify mutation-specific consequences of KRAS biochemistry, namely linked changes in posttranslational modifications (PTMs). We combined immunoaffinity enrichment with detection by top-down mass spectrometry to discover and quantify proteoforms with or without the Gly13Asp mutation (G13D) specifically in the KRAS4b isoform. The workflow was applied first to isogenic KRAS colorectal cancer (CRC) cell lines and then to patient CRC tumors with matching KRAS genotypes. In two cellular models, a direct link between the knockout of the mutant G13D allele and the complete nitrosylation of cysteine 118 of the remaining WT KRAS4b was observed. Analysis of tumor samples quantified the percentage of mutant KRAS4b actually present in cancer tissue and identified major differences in the levels of C-terminal carboxymethylation, a modification critical for membrane association. These data from CRC cells and human tumors suggest mechanisms of posttranslational regulation that are highly context-dependent and which lead to preferential production of specific KRAS4b proteoforms.