Búsqueda | Portal Regional de la BVS

1.

IDPpub: Illuminating the Dark Phosphoproteome Through PubMed Mining.

Savage, Sara R; Zhang, Yaoyun; Jaehnig, Eric J; Liao, Yuxing; Shi, Zhiao; Pham, Huy Anh; Xu, Hua; Zhang, Bing.

Mol Cell Proteomics ; 23(1): 100682, 2024 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-37993103

RESUMEN

Global phosphoproteomics experiments quantify tens of thousands of phosphorylation sites. However, data interpretation is hampered by our limited knowledge on functions, biological contexts, or precipitating enzymes of the phosphosites. This study establishes a repository of phosphosites with associated evidence in biomedical abstracts, using deep learning-based natural language processing techniques. Our model for illuminating the dark phosphoproteome through PubMed mining (IDPpub) was generated by fine-tuning BioBERT, a deep learning tool for biomedical text mining. Trained using sentences containing protein substrates and phosphorylation site positions from 3000 abstracts, the IDPpub model was then used to extract phosphorylation sites from all MEDLINE abstracts. The extracted proteins were normalized to gene symbols using the National Center for Biotechnology Information gene query, and sites were mapped to human UniProt sequences using ProtMapper and mouse UniProt sequences by direct match. Precision and recall were calculated using 150 curated abstracts, and utility was assessed by analyzing the CPTAC (Clinical Proteomics Tumor Analysis Consortium) pan-cancer phosphoproteomics datasets and the PhosphoSitePlus database. Using 10-fold cross validation, pairs of correct substrates and phosphosite positions were extracted with an average precision of 0.93 and recall of 0.94. After entity normalization and site mapping to human reference sequences, an independent validation achieved a precision of 0.91 and recall of 0.77. The IDPpub repository contains 18,458 unique human phosphorylation sites with evidence sentences from 58,227 abstracts and 5918 mouse sites in 14,610 abstracts. This included evidence sentences for 1803 sites identified in CPTAC studies that are not covered by manually curated functional information in PhosphoSitePlus. Evaluation results demonstrate the potential of IDPpub as an effective biomedical text mining tool for collecting phosphosites. Moreover, the repository (http://idppub.ptmax.org), which can be automatically updated, can serve as a powerful complement to existing resources.

Asunto(s)

Minería de Datos , Procesamiento de Lenguaje Natural , Humanos , Minería de Datos/métodos , Bases de Datos Factuales , PubMed

2.

A proteogenomics data-driven knowledge base of human cancer.

Liao, Yuxing; Savage, Sara R; Dou, Yongchao; Shi, Zhiao; Yi, Xinpei; Jiang, Wen; Lei, Jonathan T; Zhang, Bing.

Cell Syst ; 14(9): 777-787.e5, 2023 09 20.

Artículo en Inglés | MEDLINE | ID: mdl-37619559

RESUMEN

By combining mass-spectrometry-based proteomics and phosphoproteomics with genomics, epi-genomics, and transcriptomics, proteogenomics provides comprehensive molecular characterization of cancer. Using this approach, the Clinical Proteomic Tumor Analysis Consortium (CPTAC) has characterized over 1,000 primary tumors spanning 10 cancer types, many with matched normal tissues. Here, we present LinkedOmicsKB, a proteogenomics data-driven knowledge base that makes consistently processed and systematically precomputed CPTAC pan-cancer proteogenomics data available to the public through â¼40,000 gene-, protein-, mutation-, and phenotype-centric web pages. Visualization techniques facilitate efficient exploration and reasoning of complex, interconnected data. Using three case studies, we illustrate the practical utility of LinkedOmicsKB in providing new insights into genes, phosphorylation sites, somatic mutations, and cancer phenotypes. With precomputed results of 19,701 coding genes, 125,969 phosphosites, and 256 genotypes and phenotypes, LinkedOmicsKB provides a comprehensive resource to accelerate proteogenomics data-driven discoveries to improve our understanding and treatment of human cancer. A record of this paper's transparent peer review process is included in the supplemental information.

Asunto(s)

Neoplasias , Proteogenómica , Humanos , Proteómica , Proteogenómica/métodos , Genómica , Neoplasias/genética , Bases del Conocimiento

3.

Proteogenomic Markers of Chemotherapy Resistance and Response in Triple-Negative Breast Cancer.

Anurag, Meenakshi; Jaehnig, Eric J; Krug, Karsten; Lei, Jonathan T; Bergstrom, Erik J; Kim, Beom-Jun; Vashist, Tanmayi D; Huynh, Anh Minh Tran; Dou, Yongchao; Gou, Xuxu; Huang, Chen; Shi, Zhiao; Wen, Bo; Korchina, Viktoriya; Gibbs, Richard A; Muzny, Donna M; Doddapaneni, Harshavardhan; Dobrolecki, Lacey E; Rodriguez, Henry; Robles, Ana I; Hiltke, Tara; Lewis, Michael T; Nangia, Julie R; Nemati Shafaee, Maryam; Li, Shunqiang; Hagemann, Ian S; Hoog, Jeremy; Lim, Bora; Osborne, C Kent; Mani, D R; Gillette, Michael A; Zhang, Bing; Echeverria, Gloria V; Miles, George; Rimawi, Mothaffar F; Carr, Steven A; Ademuyiwa, Foluso O; Satpathy, Shankha; Ellis, Matthew J.

Cancer Discov ; 12(11): 2586-2605, 2022 11 02.

Artículo en Inglés | MEDLINE | ID: mdl-36001024

RESUMEN

Microscaled proteogenomics was deployed to probe the molecular basis for differential response to neoadjuvant carboplatin and docetaxel combination chemotherapy for triple-negative breast cancer (TNBC). Proteomic analyses of pretreatment patient biopsies uniquely revealed metabolic pathways, including oxidative phosphorylation, adipogenesis, and fatty acid metabolism, that were associated with resistance. Both proteomics and transcriptomics revealed that sensitivity was marked by elevation of DNA repair, E2F targets, G2-M checkpoint, interferon-gamma signaling, and immune-checkpoint components. Proteogenomic analyses of somatic copy-number aberrations identified a resistance-associated 19q13.31-33 deletion where LIG1, POLD1, and XRCC1 are located. In orthogonal datasets, LIG1 (DNA ligase I) gene deletion and/or low mRNA expression levels were associated with lack of pathologic complete response, higher chromosomal instability index (CIN), and poor prognosis in TNBC, as well as carboplatin-selective resistance in TNBC preclinical models. Hemizygous loss of LIG1 was also associated with higher CIN and poor prognosis in other cancer types, demonstrating broader clinical implications. SIGNIFICANCE: Proteogenomic analysis of triple-negative breast tumors revealed a complex landscape of chemotherapy response associations, including a 19q13.31-33 somatic deletion encoding genes serving lagging-strand DNA synthesis (LIG1, POLD1, and XRCC1), that correlate with lack of pathologic response, carboplatin-selective resistance, and, in pan-cancer studies, poor prognosis and CIN. This article is highlighted in the In This Issue feature, p. 2483.

Asunto(s)

Proteogenómica , Neoplasias de la Mama Triple Negativas , Humanos , Neoplasias de la Mama Triple Negativas/tratamiento farmacológico , Carboplatino , Proteómica , Protocolos de Quimioterapia Combinada Antineoplásica/uso terapéutico , Terapia Neoadyuvante , Proteína 1 de Reparación por Escisión del Grupo de Complementación Cruzada de las Lesiones por Rayos X

4.

Proteogenomic characterization of pancreatic ductal adenocarcinoma.

Cao, Liwei; Huang, Chen; Cui Zhou, Daniel; Hu, Yingwei; Lih, T Mamie; Savage, Sara R; Krug, Karsten; Clark, David J; Schnaubelt, Michael; Chen, Lijun; da Veiga Leprevost, Felipe; Eguez, Rodrigo Vargas; Yang, Weiming; Pan, Jianbo; Wen, Bo; Dou, Yongchao; Jiang, Wen; Liao, Yuxing; Shi, Zhiao; Terekhanova, Nadezhda V; Cao, Song; Lu, Rita Jui-Hsien; Li, Yize; Liu, Ruiyang; Zhu, Houxiang; Ronning, Peter; Wu, Yige; Wyczalkowski, Matthew A; Easwaran, Hariharan; Danilova, Ludmila; Mer, Arvind Singh; Yoo, Seungyeul; Wang, Joshua M; Liu, Wenke; Haibe-Kains, Benjamin; Thiagarajan, Mathangi; Jewell, Scott D; Hostetter, Galen; Newton, Chelsea J; Li, Qing Kay; Roehrl, Michael H; Fenyö, David; Wang, Pei; Nesvizhskii, Alexey I; Mani, D R; Omenn, Gilbert S; Boja, Emily S; Mesri, Mehdi; Robles, Ana I; Rodriguez, Henry.

Cell ; 184(19): 5031-5052.e26, 2021 09 16.

Artículo en Inglés | MEDLINE | ID: mdl-34534465

RESUMEN

Pancreatic ductal adenocarcinoma (PDAC) is a highly aggressive cancer with poor patient survival. Toward understanding the underlying molecular alterations that drive PDAC oncogenesis, we conducted comprehensive proteogenomic analysis of 140 pancreatic cancers, 67 normal adjacent tissues, and 9 normal pancreatic ductal tissues. Proteomic, phosphoproteomic, and glycoproteomic analyses were used to characterize proteins and their modifications. In addition, whole-genome sequencing, whole-exome sequencing, methylation, RNA sequencing (RNA-seq), and microRNA sequencing (miRNA-seq) were performed on the same tissues to facilitate an integrated proteogenomic analysis and determine the impact of genomic alterations on protein expression, signaling pathways, and post-translational modifications. To ensure robust downstream analyses, tumor neoplastic cellularity was assessed via multiple orthogonal strategies using molecular features and verified via pathological estimation of tumor cellularity based on histological review. This integrated proteogenomic characterization of PDAC will serve as a valuable resource for the community, paving the way for early detection and identification of novel therapeutic targets.

Asunto(s)

Adenocarcinoma/genética , Carcinoma Ductal Pancreático/genética , Neoplasias Pancreáticas/genética , Proteogenómica , Adenocarcinoma/diagnóstico , Adulto , Anciano , Anciano de 80 o más Años , Algoritmos , Carcinoma Ductal Pancreático/diagnóstico , Estudios de Cohortes , Células Endoteliales/metabolismo , Epigénesis Genética , Femenino , Dosificación de Gen , Genoma Humano , Glucólisis , Glicoproteínas/biosíntesis , Humanos , Masculino , Persona de Mediana Edad , Terapia Molecular Dirigida , Neoplasias Pancreáticas/diagnóstico , Fenotipo , Fosfoproteínas/metabolismo , Fosforilación , Pronóstico , Proteínas Quinasas/metabolismo , Proteoma/metabolismo , Especificidad por Sustrato , Transcriptoma/genética

5.

A community effort to identify and correct mislabeled samples in proteogenomic studies.

Yoo, Seungyeul; Shi, Zhiao; Wen, Bo; Kho, SoonJye; Pan, Renke; Feng, Hanying; Chen, Hong; Carlsson, Anders; Edén, Patrik; Ma, Weiping; Raymer, Michael; Maier, Ezekiel J; Tezak, Zivana; Johanson, Elaine; Hinton, Denise; Rodriguez, Henry; Zhu, Jun; Boja, Emily; Wang, Pei; Zhang, Bing.

Patterns (N Y) ; 2(5): 100245, 2021 May 14.

Artículo en Inglés | MEDLINE | ID: mdl-34036290

RESUMEN

Sample mislabeling or misannotation has been a long-standing problem in scientific research, particularly prevalent in large-scale, multi-omic studies due to the complexity of multi-omic workflows. There exists an urgent need for implementing quality controls to automatically screen for and correct sample mislabels or misannotations in multi-omic studies. Here, we describe a crowdsourced precisionFDA NCI-CPTAC Multi-omics Enabled Sample Mislabeling Correction Challenge, which provides a framework for systematic benchmarking and evaluation of mislabel identification and correction methods for integrative proteogenomic studies. The challenge received a large number of submissions from domestic and international data scientists, with highly variable performance observed across the submitted methods. Post-challenge collaboration between the top-performing teams and the challenge organizers has created an open-source software, COSMO, with demonstrated high accuracy and robustness in mislabeling identification and correction in simulated and real multi-omic datasets.

6.

Feature Selection Methods for Protein Biomarker Discovery from Proteomics or Multiomics Data.

Shi, Zhiao; Wen, Bo; Gao, Qiang; Zhang, Bing.

Mol Cell Proteomics ; 20: 100083, 2021.

Artículo en Inglés | MEDLINE | ID: mdl-33887487

RESUMEN

Untargeted mass spectrometry (MS)-based proteomics provides a powerful platform for protein biomarker discovery, but clinical translation depends on the selection of a small number of proteins for downstream verification and validation. Due to the small sample size of typical discovery studies, protein markers identified from discovery data may not be generalizable to independent datasets. In addition, a good protein marker identified using a discovery platform may be difficult to implement in verification and validation platforms. Moreover, although multiomics characterization is being increasingly used in discovery cohort studies, there is no existing method for multiomics-facilitated protein biomarker selection. Here, we present ProMS, a computational algorithm for protein marker selection. The algorithm is based on the hypothesis that a phenotype is characterized by a few underlying biological functions, each manifested by a group of coexpressed proteins. A weighted k-medoids clustering algorithm is applied to all univariately informative proteins to identify both coexpressed protein clusters and a representative protein for each cluster as markers. In two clinically important classification problems, ProMS shows superior performance compared with existing feature selection methods. ProMS can be extended to the multiomics setting (ProMS_mo) through a constrained weighted k-medoids clustering algorithm, and the protein panels selected by ProMS_mo show improved performance on independent test data compared with ProMS. In addition to superior performance, ProMS and ProMS_mo also have two unique strengths. First, the feature clusters enable functional interpretation of the selected protein markers. Second, the feature clusters provide an opportunity to select replacement protein markers, facilitating a robust transition to the verification and validation platforms. In summary, this study provides a unified and effective computational framework for selecting protein biomarkers using proteomics or multiomics data. The software implementation is publicly available at https://github.com/bzhanglab/proms.

Asunto(s)

Algoritmos , Biomarcadores de Tumor/metabolismo , Carcinoma Hepatocelular/metabolismo , Neoplasias Colorrectales/metabolismo , Neoplasias Hepáticas/metabolismo , Proteínas de Neoplasias/metabolismo , Proteómica/métodos , Humanos , Espectrometría de Masas , Pronóstico , Programas Informáticos

7.

Proteogenomic insights into the biology and treatment of HPV-negative head and neck squamous cell carcinoma.

Huang, Chen; Chen, Lijun; Savage, Sara R; Eguez, Rodrigo Vargas; Dou, Yongchao; Li, Yize; da Veiga Leprevost, Felipe; Jaehnig, Eric J; Lei, Jonathan T; Wen, Bo; Schnaubelt, Michael; Krug, Karsten; Song, Xiaoyu; Cieslik, Marcin; Chang, Hui-Yin; Wyczalkowski, Matthew A; Li, Kai; Colaprico, Antonio; Li, Qing Kay; Clark, David J; Hu, Yingwei; Cao, Liwei; Pan, Jianbo; Wang, Yuefan; Cho, Kyung-Cho; Shi, Zhiao; Liao, Yuxing; Jiang, Wen; Anurag, Meenakshi; Ji, Jiayi; Yoo, Seungyeul; Zhou, Daniel Cui; Liang, Wen-Wei; Wendl, Michael; Vats, Pankaj; Carr, Steven A; Mani, D R; Zhang, Zhen; Qian, Jiang; Chen, Xi S; Pico, Alexander R; Wang, Pei; Chinnaiyan, Arul M; Ketchum, Karen A; Kinsinger, Christopher R; Robles, Ana I; An, Eunkyung; Hiltke, Tara; Mesri, Mehdi; Thiagarajan, Mathangi.

Cancer Cell ; 39(3): 361-379.e16, 2021 03 08.

Artículo en Inglés | MEDLINE | ID: mdl-33417831

RESUMEN

We present a proteogenomic study of 108 human papilloma virus (HPV)-negative head and neck squamous cell carcinomas (HNSCCs). Proteomic analysis systematically catalogs HNSCC-associated proteins and phosphosites, prioritizes copy number drivers, and highlights an oncogenic role for RNA processing genes. Proteomic investigation of mutual exclusivity between FAT1 truncating mutations and 11q13.3 amplifications reveals dysregulated actin dynamics as a common functional consequence. Phosphoproteomics characterizes two modes of EGFR activation, suggesting a new strategy to stratify HNSCCs based on EGFR ligand abundance for effective treatment with inhibitory EGFR monoclonal antibodies. Widespread deletion of immune modulatory genes accounts for low immune infiltration in immune-cold tumors, whereas concordant upregulation of multiple immune checkpoint proteins may underlie resistance to anti-programmed cell death protein 1 monotherapy in immune-hot tumors. Multi-omic analysis identifies three molecular subtypes with high potential for treatment with CDK inhibitors, anti-EGFR antibody therapy, and immunotherapy, respectively. Altogether, proteogenomics provides a systematic framework to inform HNSCC biology and treatment.

Asunto(s)

Antineoplásicos Inmunológicos/uso terapéutico , Infecciones por Papillomavirus/genética , Carcinoma de Células Escamosas de Cabeza y Cuello/tratamiento farmacológico , Carcinoma de Células Escamosas de Cabeza y Cuello/genética , Adulto , Anciano , Anciano de 80 o más Años , Receptores ErbB/genética , Femenino , Humanos , Inmunoterapia/métodos , Masculino , Persona de Mediana Edad , Infecciones por Papillomavirus/tratamiento farmacológico , Infecciones por Papillomavirus/virología , Proteogenómica/métodos , Proteómica/métodos , Adulto Joven

8.

Proteogenomic Landscape of Breast Cancer Tumorigenesis and Targeted Therapy.

Krug, Karsten; Jaehnig, Eric J; Satpathy, Shankha; Blumenberg, Lili; Karpova, Alla; Anurag, Meenakshi; Miles, George; Mertins, Philipp; Geffen, Yifat; Tang, Lauren C; Heiman, David I; Cao, Song; Maruvka, Yosef E; Lei, Jonathan T; Huang, Chen; Kothadia, Ramani B; Colaprico, Antonio; Birger, Chet; Wang, Jarey; Dou, Yongchao; Wen, Bo; Shi, Zhiao; Liao, Yuxing; Wiznerowicz, Maciej; Wyczalkowski, Matthew A; Chen, Xi Steven; Kennedy, Jacob J; Paulovich, Amanda G; Thiagarajan, Mathangi; Kinsinger, Christopher R; Hiltke, Tara; Boja, Emily S; Mesri, Mehdi; Robles, Ana I; Rodriguez, Henry; Westbrook, Thomas F; Ding, Li; Getz, Gad; Clauser, Karl R; Fenyö, David; Ruggles, Kelly V; Zhang, Bing; Mani, D R; Carr, Steven A; Ellis, Matthew J; Gillette, Michael A.

Cell ; 183(5): 1436-1456.e31, 2020 11 25.

Artículo en Inglés | MEDLINE | ID: mdl-33212010

RESUMEN

The integration of mass spectrometry-based proteomics with next-generation DNA and RNA sequencing profiles tumors more comprehensively. Here this "proteogenomics" approach was applied to 122 treatment-naive primary breast cancers accrued to preserve post-translational modifications, including protein phosphorylation and acetylation. Proteogenomics challenged standard breast cancer diagnoses, provided detailed analysis of the ERBB2 amplicon, defined tumor subsets that could benefit from immune checkpoint therapy, and allowed more accurate assessment of Rb status for prediction of CDK4/6 inhibitor responsiveness. Phosphoproteomics profiles uncovered novel associations between tumor suppressor loss and targetable kinases. Acetylproteome analysis highlighted acetylation on key nuclear proteins involved in the DNA damage response and revealed cross-talk between cytoplasmic and mitochondrial acetylation and metabolism. Our results underscore the potential of proteogenomics for clinical investigation of breast cancer through more accurate annotation of targetable pathways and biological features of this remarkably heterogeneous malignancy.

Asunto(s)

Neoplasias de la Mama/genética , Neoplasias de la Mama/patología , Carcinogénesis/genética , Carcinogénesis/patología , Terapia Molecular Dirigida , Proteogenómica , Desaminasas APOBEC/metabolismo , Adulto , Anciano , Anciano de 80 o más Años , Neoplasias de la Mama/inmunología , Neoplasias de la Mama/terapia , Estudios de Cohortes , Daño del ADN , Reparación del ADN , Femenino , Humanos , Inmunoterapia , Metabolómica , Persona de Mediana Edad , Mutagénesis/genética , Fosforilación , Inhibidores de Proteínas Quinasas/farmacología , Proteínas Quinasas/metabolismo , Receptor ErbB-2/metabolismo , Proteína de Retinoblastoma/metabolismo , Microambiente Tumoral/inmunología

9.

Deep Learning in Proteomics.

Wen, Bo; Zeng, Wen-Feng; Liao, Yuxing; Shi, Zhiao; Savage, Sara R; Jiang, Wen; Zhang, Bing.

Proteomics ; 20(21-22): e1900335, 2020 11.

Artículo en Inglés | MEDLINE | ID: mdl-32939979

RESUMEN

Proteomics, the study of all the proteins in biological systems, is becoming a data-rich science. Protein sequences and structures are comprehensively catalogued in online databases. With recent advancements in tandem mass spectrometry (MS) technology, protein expression and post-translational modifications (PTMs) can be studied in a variety of biological systems at the global scale. Sophisticated computational algorithms are needed to translate the vast amount of data into novel biological insights. Deep learning automatically extracts data representations at high levels of abstraction from data, and it thrives in data-rich scientific research domains. Here, a comprehensive overview of deep learning applications in proteomics, including retention time prediction, MS/MS spectrum prediction, de novo peptide sequencing, PTM prediction, major histocompatibility complex-peptide binding prediction, and protein structure prediction, is provided. Limitations and the future directions of deep learning in proteomics are also discussed. This review will provide readers an overview of deep learning and how it can be used to analyze proteomics data.

Asunto(s)

Aprendizaje Profundo , Proteómica , Algoritmos , Procesamiento Proteico-Postraduccional , Espectrometría de Masas en Tándem

10.

Correction: Graph Algorithms for Condensing and Consolidating Gene Set Analysis Results.

Savage, Sara R; Shi, Zhiao; Liao, Yuxing; Zhang, Bing.

Mol Cell Proteomics ; 19(2): 431, 2020 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-32007940

11.

Proteogenomic Characterization of Endometrial Carcinoma.

Dou, Yongchao; Kawaler, Emily A; Cui Zhou, Daniel; Gritsenko, Marina A; Huang, Chen; Blumenberg, Lili; Karpova, Alla; Petyuk, Vladislav A; Savage, Sara R; Satpathy, Shankha; Liu, Wenke; Wu, Yige; Tsai, Chia-Feng; Wen, Bo; Li, Zhi; Cao, Song; Moon, Jamie; Shi, Zhiao; Cornwell, MacIntosh; Wyczalkowski, Matthew A; Chu, Rosalie K; Vasaikar, Suhas; Zhou, Hua; Gao, Qingsong; Moore, Ronald J; Li, Kai; Sethuraman, Sunantha; Monroe, Matthew E; Zhao, Rui; Heiman, David; Krug, Karsten; Clauser, Karl; Kothadia, Ramani; Maruvka, Yosef; Pico, Alexander R; Oliphant, Amanda E; Hoskins, Emily L; Pugh, Samuel L; Beecroft, Sean J I; Adams, David W; Jarman, Jonathan C; Kong, Andy; Chang, Hui-Yin; Reva, Boris; Liao, Yuxing; Rykunov, Dmitry; Colaprico, Antonio; Chen, Xi Steven; Czekanski, Andrzej; Jedryka, Marcin.

Cell ; 180(4): 729-748.e26, 2020 02 20.

Artículo en Inglés | MEDLINE | ID: mdl-32059776

RESUMEN

We undertook a comprehensive proteogenomic characterization of 95 prospectively collected endometrial carcinomas, comprising 83 endometrioid and 12 serous tumors. This analysis revealed possible new consequences of perturbations to the p53 and Wnt/ß-catenin pathways, identified a potential role for circRNAs in the epithelial-mesenchymal transition, and provided new information about proteomic markers of clinical and genomic tumor subgroups, including relationships to known druggable pathways. An extensive genome-wide acetylation survey yielded insights into regulatory mechanisms linking Wnt signaling and histone acetylation. We also characterized aspects of the tumor immune landscape, including immunogenic alterations, neoantigens, common cancer/testis antigens, and the immune microenvironment, all of which can inform immunotherapy decisions. Collectively, our multi-omic analyses provide a valuable resource for researchers and clinicians, identify new molecular associations of potential mechanistic significance in the development of endometrial cancers, and suggest novel approaches for identifying potential therapeutic targets.

Asunto(s)

Carcinoma/genética , Neoplasias Endometriales/genética , Regulación Neoplásica de la Expresión Génica , Proteoma/genética , Transcriptoma , Acetilación , Animales , Antígenos de Neoplasias/genética , Carcinoma/inmunología , Carcinoma/patología , Neoplasias Endometriales/inmunología , Neoplasias Endometriales/patología , Transición Epitelial-Mesenquimal/genética , Retroalimentación Fisiológica , Femenino , Inestabilidad Genómica , Humanos , Ratones , MicroARNs/genética , MicroARNs/metabolismo , Repeticiones de Microsatélite , Fosforilación , Procesamiento Proteico-Postraduccional , Proteoma/metabolismo , Transducción de Señal

12.

Graph Algorithms for Condensing and Consolidating Gene Set Analysis Results.

Savage, Sara R; Shi, Zhiao; Liao, Yuxing; Zhang, Bing.

Mol Cell Proteomics ; 18(8 suppl 1): S141-S152, 2019 08 09.

Artículo en Inglés | MEDLINE | ID: mdl-31142576

RESUMEN

Gene set analysis plays a critical role in the functional interpretation of omics data. Although this is typically done for one omics experiment at a time, there is an increasing need to combine gene set analysis results from multiple experiments performed on the same or different omics platforms, such as in multi-omics studies. Integrating results from multiple experiments is challenging, and annotation redundancy between gene sets further obscures clear conclusions. We propose to use a weighted set cover algorithm to reduce redundancy of gene sets identified in a single experiment. Next, we use affinity propagation to consolidate similar gene sets identified from multiple experiments into clusters and to automatically determine the most representative gene set for each cluster. Using three examples from over representation analysis and gene set enrichment analysis, we showed that weighted set cover outperformed a previously published set cover method and reduced the number of gene sets by 52-77%. Focusing on overlapping genes between the list of input genes and the enriched gene sets in over-representation analysis and leading-edge genes in gene set enrichment analysis further reduced the number of gene sets. A use case combining enrichment analysis results from RNA-Seq and proteomics data comparing basal and luminal A breast cancer samples highlighted the known difference in proliferation and DNA damage response. Finally, we used these algorithms for a pan-cancer survival analysis. Our analysis clearly revealed prognosis-related pathways common to multiple cancer types or specific to individual cancer types, as well as pathways associated with prognosis in different directions in different cancer types. We implemented these two algorithms in an R package, Sumer, which generates tables and static and interactive plots for exploration and publication. Sumer is publicly available at https://github.com/bzhanglab/sumer.

Asunto(s)

Algoritmos , Genómica/métodos , Neoplasias de la Mama/genética , Neoplasias Colorrectales/genética , Femenino , Regulación Neoplásica de la Expresión Génica , Humanos , Proteínas de Neoplasias/genética , RNA-Seq

13.

WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs.

Liao, Yuxing; Wang, Jing; Jaehnig, Eric J; Shi, Zhiao; Zhang, Bing.

Nucleic Acids Res ; 47(W1): W199-W205, 2019 07 02.

Artículo en Inglés | MEDLINE | ID: mdl-31114916

RESUMEN

WebGestalt is a popular tool for the interpretation of gene lists derived from large scale -omics studies. In the 2019 update, WebGestalt supports 12 organisms, 342 gene identifiers and 155 175 functional categories, as well as user-uploaded functional databases. To address the growing and unique need for phosphoproteomics data interpretation, we have implemented phosphosite set analysis to identify important kinases from phosphoproteomics data. We have completely redesigned result visualizations and user interfaces to improve user-friendliness and to provide multiple types of interactive and publication-ready figures. To facilitate comprehension of the enrichment results, we have implemented two methods to reduce redundancy between enriched gene sets. We introduced a web API for other applications to get data programmatically from the WebGestalt server or pass data to WebGestalt for analysis. We also wrapped the core computation into an R package called WebGestaltR for users to perform analysis locally or in third party workflows. WebGestalt can be freely accessed at http://www.webgestalt.org.

Asunto(s)

Bases de Datos Genéticas , Programas Informáticos , Conjuntos de Datos como Asunto , Interfaz Usuario-Computador , Navegador Web

14.

Proteogenomic Analysis of Human Colon Cancer Reveals New Therapeutic Opportunities.

Vasaikar, Suhas; Huang, Chen; Wang, Xiaojing; Petyuk, Vladislav A; Savage, Sara R; Wen, Bo; Dou, Yongchao; Zhang, Yun; Shi, Zhiao; Arshad, Osama A; Gritsenko, Marina A; Zimmerman, Lisa J; McDermott, Jason E; Clauss, Therese R; Moore, Ronald J; Zhao, Rui; Monroe, Matthew E; Wang, Yi-Ting; Chambers, Matthew C; Slebos, Robbert J C; Lau, Ken S; Mo, Qianxing; Ding, Li; Ellis, Matthew; Thiagarajan, Mathangi; Kinsinger, Christopher R; Rodriguez, Henry; Smith, Richard D; Rodland, Karin D; Liebler, Daniel C; Liu, Tao; Zhang, Bing.

Cell ; 177(4): 1035-1049.e19, 2019 05 02.

Artículo en Inglés | MEDLINE | ID: mdl-31031003

RESUMEN

We performed the first proteogenomic study on a prospectively collected colon cancer cohort. Comparative proteomic and phosphoproteomic analysis of paired tumor and normal adjacent tissues produced a catalog of colon cancer-associated proteins and phosphosites, including known and putative new biomarkers, drug targets, and cancer/testis antigens. Proteogenomic integration not only prioritized genomically inferred targets, such as copy-number drivers and mutation-derived neoantigens, but also yielded novel findings. Phosphoproteomics data associated Rb phosphorylation with increased proliferation and decreased apoptosis in colon cancer, which explains why this classical tumor suppressor is amplified in colon tumors and suggests a rationale for targeting Rb phosphorylation in colon cancer. Proteomics identified an association between decreased CD8 T cell infiltration and increased glycolysis in microsatellite instability-high (MSI-H) tumors, suggesting glycolysis as a potential target to overcome the resistance of MSI-H tumors to immune checkpoint blockade. Proteogenomics presents new avenues for biological discoveries and therapeutic development.

Asunto(s)

Neoplasias del Colon/genética , Neoplasias del Colon/terapia , Proteogenómica/métodos , Apoptosis/genética , Biomarcadores de Tumor/genética , Biomarcadores de Tumor/metabolismo , Linfocitos T CD8-positivos , Proliferación Celular/genética , Neoplasias del Colon/metabolismo , Genómica/métodos , Glucólisis , Humanos , Inestabilidad de Microsatélites , Mutación , Fosforilación , Estudios Prospectivos , Proteómica/métodos , Proteína de Retinoblastoma/genética , Proteína de Retinoblastoma/metabolismo

15.

Colorectal Cancer Cell Line Proteomes Are Representative of Primary Tumors and Predict Drug Sensitivity.

Wang, Jing; Mouradov, Dmitri; Wang, Xiaojing; Jorissen, Robert N; Chambers, Matthew C; Zimmerman, Lisa J; Vasaikar, Suhas; Love, Christopher G; Li, Shan; Lowes, Kym; Leuchowius, Karl-Johan; Jousset, Helene; Weinstock, Janet; Yau, Christopher; Mariadason, John; Shi, Zhiao; Ban, Yuguang; Chen, Xi; Coffey, Robert J C; Slebos, Robbert J C; Burgess, Antony W; Liebler, Daniel C; Zhang, Bing; Sieber, Oliver M.

Gastroenterology ; 153(4): 1082-1095, 2017 10.

Artículo en Inglés | MEDLINE | ID: mdl-28625833

RESUMEN

BACKGROUND AND AIMS: Proteomics holds promise for individualizing cancer treatment. We analyzed to what extent the proteomic landscape of human colorectal cancer (CRC) is maintained in established CRC cell lines and the utility of proteomics for predicting therapeutic responses. METHODS: Proteomic and transcriptomic analyses were performed on 44 CRC cell lines, compared against primary CRCs (n=95) and normal tissues (n=60), and integrated with genomic and drug sensitivity data. RESULTS: Cell lines mirrored the proteomic aberrations of primary tumors, in particular for intrinsic programs. Tumor relationships of protein expression with DNA copy number aberrations and signatures of post-transcriptional regulation were recapitulated in cell lines. The 5 proteomic subtypes previously identified in tumors were represented among cell lines. Nonetheless, systematic differences between cell line and tumor proteomes were apparent, attributable to stroma, extrinsic signaling, and growth conditions. Contribution of tumor stroma obscured signatures of DNA mismatch repair identified in cell lines with a hypermutation phenotype. Global proteomic data showed improved utility for predicting both known drug-target relationships and overall drug sensitivity as compared with genomic or transcriptomic measurements. Inhibition of targetable proteins associated with drug responses further identified corresponding synergistic or antagonistic drug combinations. Our data provide evidence for CRC proteomic subtype-specific drug responses. CONCLUSIONS: Proteomes of established CRC cell line are representative of primary tumors. Proteomic data tend to exhibit improved prediction of drug sensitivity as compared with genomic and transcriptomic profiles. Our integrative proteogenomic analysis highlights the potential of proteome profiling to inform personalized cancer medicine.

Asunto(s)

Antineoplásicos/farmacología , Biomarcadores de Tumor/metabolismo , Neoplasias Colorrectales/tratamiento farmacológico , Neoplasias Colorrectales/metabolismo , Proteínas de Neoplasias/metabolismo , Medicina de Precisión , Proteoma , Biomarcadores de Tumor/genética , Línea Celular Tumoral , Cromatografía Liquida , Neoplasias Colorrectales/genética , Neoplasias Colorrectales/patología , Bases de Datos de Proteínas , Relación Dosis-Respuesta a Droga , Ensayos de Selección de Medicamentos Antitumorales , Perfilación de la Expresión Génica , Regulación Neoplásica de la Expresión Génica , Humanos , Mutación , Proteínas de Neoplasias/genética , Selección de Paciente , Polimorfismo de Nucleótido Simple , Proteómica/métodos , Transducción de Señal , Células del Estroma/metabolismo , Espectrometría de Masas en Tándem , Transcriptoma , Microambiente Tumoral

16.

WebGestalt 2017: a more comprehensive, powerful, flexible and interactive gene set enrichment analysis toolkit.

Wang, Jing; Vasaikar, Suhas; Shi, Zhiao; Greer, Michael; Zhang, Bing.

Nucleic Acids Res ; 45(W1): W130-W137, 2017 07 03.

Artículo en Inglés | MEDLINE | ID: mdl-28472511

RESUMEN

Functional enrichment analysis has played a key role in the biological interpretation of high-throughput omics data. As a long-standing and widely used web application for functional enrichment analysis, WebGestalt has been constantly updated to satisfy the needs of biologists from different research areas. WebGestalt 2017 supports 12 organisms, 324 gene identifiers from various databases and technology platforms, and 150 937 functional categories from public databases and computational analyses. Omics data with gene identifiers not supported by WebGestalt and functional categories not included in the WebGestalt database can also be uploaded for enrichment analysis. In addition to the Over-Representation Analysis in the previous versions, Gene Set Enrichment Analysis and Network Topology-based Analysis have been added to WebGestalt 2017, providing complementary approaches to the interpretation of high-throughput omics data. The new user-friendly output interface and the GOView tool allow interactive and efficient exploration and comparison of enrichment results. Thus, WebGestalt 2017 enables more comprehensive, powerful, flexible and interactive functional enrichment analysis. It is freely available at http://www.webgestalt.org.

Asunto(s)

Genes , Programas Informáticos , Animales , Bovinos , Humanos , Internet , Ratones , Neoplasias/genética , Ratas , Interfaz Usuario-Computador

17.

Empowering biologists with multi-omics data: colorectal cancer as a paradigm.

Zhu, Jing; Shi, Zhiao; Wang, Jing; Zhang, Bing.

Bioinformatics ; 31(9): 1436-43, 2015 May 01.

Artículo en Inglés | MEDLINE | ID: mdl-25527095

RESUMEN

MOTIVATION: Recent completion of the global proteomic characterization of The Cancer Genome Atlas (TCGA) colorectal cancer (CRC) cohort resulted in the first tumor dataset with complete molecular measurements at DNA, RNA and protein levels. Using CRC as a paradigm, we describe the application of the NetGestalt framework to provide easy access and interpretation of multi-omics data. RESULTS: The NetGestalt CRC portal includes genomic, epigenomic, transcriptomic, proteomic and clinical data for the TCGA CRC cohort, data from other CRC tumor cohorts and cell lines, and existing knowledge on pathways and networks, giving a total of more than 17 million data points. The portal provides features for data query, upload, visualization and integration. These features can be flexibly combined to serve various needs of the users, maximizing the synergy among omics data, human visualization and quantitative analysis. Using three case studies, we demonstrate that the portal not only provides user-friendly data query and visualization but also enables efficient data integration within a single omics data type, across multiple omics data types, and over biological networks. AVAILABILITY AND IMPLEMENTATION: The NetGestalt CRC portal can be freely accessed at http://www.netgestalt.org. CONTACT: bing.zhang@vanderbilt.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

Neoplasias Colorrectales/genética , Genómica , Proteómica , Programas Informáticos , Neoplasias Colorrectales/metabolismo , Epigénesis Genética , Redes Reguladoras de Genes , Silenciador del Gen , Humanos , Neoplasias , Transcriptoma

18.

Nuclear factor of activated T-cell activity is associated with metastatic capacity in colon cancer.

Tripathi, Manish K; Deane, Natasha G; Zhu, Jing; An, Hanbing; Mima, Shinji; Wang, Xiaojing; Padmanabhan, Sekhar; Shi, Zhiao; Prodduturi, Naresh; Ciombor, Kristen K; Chen, Xi; Washington, M Kay; Zhang, Bing; Beauchamp, R Daniel.

Cancer Res ; 74(23): 6947-57, 2014 Dec 01.

Artículo en Inglés | MEDLINE | ID: mdl-25320007

RESUMEN

Metastatic recurrence is the leading cause of cancer-related deaths in patients with colorectal carcinoma. To capture the molecular underpinnings for metastasis and tumor progression, we performed integrative network analysis on 11 independent human colorectal cancer gene expression datasets and applied expression data from an immunocompetent mouse model of metastasis as an additional filter for this biologic process. In silico analysis of one metastasis-related coexpression module predicted nuclear factor of activated T-cell (NFAT) transcription factors as potential regulators for the module. Cells selected for invasiveness and metastatic capability expressed higher levels of NFATc1 as compared with poorly metastatic and less invasive parental cells. We found that inhibition of NFATc1 in human and mouse colon cancer cells resulted in decreased invasiveness in culture and downregulation of metastasis-related network genes. Overexpression of NFATc1 significantly increased the metastatic potential of colon cancer cells, whereas inhibition of NFATc1 reduced metastasis growth in an immunocompetent mouse model. Finally, we found that an 8-gene signature comprising genes upregulated by NFATc1 significantly correlated with worse clinical outcomes in stage II and III colorectal cancer patients. Thus, NFATc1 regulates colon cancer cell behavior and its transcriptional targets constitute a novel, biologically anchored gene expression signature for the identification of colon cancers with high risk of metastatic recurrence.

Asunto(s)

Neoplasias del Colon/genética , Neoplasias del Colon/patología , Factores de Transcripción NFATC/genética , Animales , Línea Celular Tumoral , Regulación Neoplásica de la Expresión Génica , Células HCT116 , Células HT29 , Humanos , Ratones , Invasividad Neoplásica , Metástasis de la Neoplasia , Factores de Transcripción/genética

19.

Proteogenomic characterization of human colon and rectal cancer.

Zhang, Bing; Wang, Jing; Wang, Xiaojing; Zhu, Jing; Liu, Qi; Shi, Zhiao; Chambers, Matthew C; Zimmerman, Lisa J; Shaddox, Kent F; Kim, Sangtae; Davies, Sherri R; Wang, Sean; Wang, Pei; Kinsinger, Christopher R; Rivers, Robert C; Rodriguez, Henry; Townsend, R Reid; Ellis, Matthew J C; Carr, Steven A; Tabb, David L; Coffey, Robert J; Slebos, Robbert J C; Liebler, Daniel C.

Nature ; 513(7518): 382-7, 2014 Sep 18.

Artículo en Inglés | MEDLINE | ID: mdl-25043054

RESUMEN

Extensive genomic characterization of human cancers presents the problem of inference from genomic abnormalities to cancer phenotypes. To address this problem, we analysed proteomes of colon and rectal tumours characterized previously by The Cancer Genome Atlas (TCGA) and perform integrated proteogenomic analyses. Somatic variants displayed reduced protein abundance compared to germline variants. Messenger RNA transcript abundance did not reliably predict protein abundance differences between tumours. Proteomics identified five proteomic subtypes in the TCGA cohort, two of which overlapped with the TCGA 'microsatellite instability/CpG island methylation phenotype' transcriptomic subtype, but had distinct mutation, methylation and protein expression patterns associated with different clinical outcomes. Although copy number alterations showed strong cis- and trans-effects on mRNA abundance, relatively few of these extend to the protein level. Thus, proteomics data enabled prioritization of candidate driver genes. The chromosome 20q amplicon was associated with the largest global changes at both mRNA and protein levels; proteomics data highlighted potential 20q candidates, including HNF4A (hepatocyte nuclear factor 4, alpha), TOMM34 (translocase of outer mitochondrial membrane 34) and SRC (SRC proto-oncogene, non-receptor tyrosine kinase). Integrated proteogenomic analysis provides functional context to interpret genomic abnormalities and affords a new paradigm for understanding cancer biology.

Asunto(s)

Neoplasias del Colon/genética , Neoplasias del Colon/metabolismo , Genómica , Proteoma/metabolismo , Neoplasias del Recto/genética , Neoplasias del Recto/metabolismo , Transcriptoma/genética , Cromosomas Humanos Par 20/genética , Islas de CpG/genética , Variaciones en el Número de Copia de ADN/genética , Metilación de ADN , Factor Nuclear 4 del Hepatocito/genética , Humanos , Repeticiones de Microsatélite/genética , Proteínas de Transporte de Membrana Mitocondrial/genética , Proteínas del Complejo de Importación de Proteínas Precursoras Mitocondriales , Mutación Missense/genética , Proteínas de Neoplasias/análisis , Proteínas de Neoplasias/genética , Proteínas de Neoplasias/metabolismo , Mutación Puntual/genética , Proteoma/análisis , Proteoma/genética , Proteómica , Proto-Oncogenes Mas , Proteínas Proto-Oncogénicas pp60(c-src)/genética , ARN Mensajero/análisis , ARN Mensajero/genética , ARN Mensajero/metabolismo , ARN Neoplásico/análisis , ARN Neoplásico/genética , ARN Neoplásico/metabolismo

20.

Deciphering genomic alterations in colorectal cancer through transcriptional subtype-based network analysis.

Zhu, Jing; Wang, Jing; Shi, Zhiao; Franklin, Jeffrey L; Deane, Natasha G; Coffey, Robert J; Beauchamp, R Daniel; Zhang, Bing.

PLoS One ; 8(11): e79282, 2013.

Artículo en Inglés | MEDLINE | ID: mdl-24260186

RESUMEN

Both transcriptional subtype and signaling network analyses have proved useful in cancer genomics research. However, these two approaches are usually applied in isolation in existing studies. We reason that deciphering genomic alterations based on cancer transcriptional subtypes may help reveal subtype-specific driver networks and provide insights for the development of personalized therapeutic strategies. In this study, we defined transcriptional subtypes for colorectal cancer (CRC) and identified driver networks/pathways for each subtype. Applying consensus clustering to a patient cohort with 1173 samples identified three transcriptional subtypes, which were validated in an independent cohort with 485 samples. The three subtypes were characterized by different transcriptional programs related to normal adult colon, early colon embryonic development, and epithelial mesenchymal transition, respectively. They also showed statistically different clinical outcomes. For each subtype, we mapped somatic mutation and copy number variation data onto an integrated signaling network and identified subtype-specific driver networks using a random walk-based strategy. We found that genomic alterations in the Wnt signaling pathway were common among all three subtypes; however, unique combinations of pathway alterations including Wnt, VEGF and Notch drove distinct molecular and clinical phenotypes in different CRC subtypes. Our results provide a coherent and integrated picture of human CRC that links genomic alterations to molecular and clinical consequences, and which provides insights for the development of personalized therapeutic strategies for different CRC subtypes.

Asunto(s)

Neoplasias Colorrectales , Regulación Neoplásica de la Expresión Génica/genética , Genoma Humano , Proteínas de Neoplasias , Redes Neurales de la Computación , Vía de Señalización Wnt/genética , Animales , Estudios de Cohortes , Neoplasias Colorrectales/genética , Neoplasias Colorrectales/metabolismo , Femenino , Humanos , Masculino , Ratones , Proteínas de Neoplasias/biosíntesis , Proteínas de Neoplasias/genética

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA