RESUMEN
Methods that integrate molecular network information and tumor genome data could complement gene-based statistical tests to identify likely new cancer genes; but such approaches are challenging to validate at scale, and their predictive value remains unclear. We developed a robust statistic (NetSig) that integrates protein interaction networks with data from 4,742 tumor exomes. NetSig can accurately classify known driver genes in 60% of tested tumor types and predicts 62 new driver candidates. Using a quantitative experimental framework to determine in vivo tumorigenic potential in mice, we found that NetSig candidates induce tumors at rates that are comparable to those of known oncogenes and are ten-fold higher than those of random genes. By reanalyzing nine tumor-inducing NetSig candidates in 242 patients with oncogene-negative lung adenocarcinomas, we find that two (AKT2 and TFDP2) are significantly amplified. Our study presents a scalable integrated computational and experimental workflow to expand discovery from cancer genomes.
Asunto(s)
Carcinogénesis/genética , Biología Computacional/métodos , Regulación Neoplásica de la Expresión Génica , Redes Reguladoras de Genes , Proteínas de Neoplasias/genética , Neoplasias/genética , Humanos , MutaciónRESUMEN
UNLABELLED: Cancer genome characterization efforts now provide an initial view of the somatic alterations in primary tumors. However, most point mutations occur at low frequency, and the function of these alleles remains undefined. We have developed a scalable systematic approach to interrogate the function of cancer-associated gene variants. We subjected 474 mutant alleles curated from 5,338 tumors to pooled in vivo tumor formation assays and gene expression profiling. We identified 12 transforming alleles, including two in genes (PIK3CB, POT1) that have not been shown to be tumorigenic. One rare KRAS allele, D33E, displayed tumorigenicity and constitutive activation of known RAS effector pathways. By comparing gene expression changes induced upon expression of wild-type and mutant alleles, we inferred the activity of specific alleles. Because alleles found to be mutated only once in 5,338 tumors rendered cells tumorigenic, these observations underscore the value of integrating genomic information with functional studies. SIGNIFICANCE: Experimentally inferring the functional status of cancer-associated mutations facilitates the interpretation of genomic information in cancer. Pooled in vivo screen and gene expression profiling identified functional variants and demonstrated that expression of rare variants induced tumorigenesis. Variant phenotyping through functional studies will facilitate defining key somatic events in cancer. Cancer Discov; 6(7); 714-26. ©2016 AACR.See related commentary by Cho and Collisson, p. 694This article is highlighted in the In This Issue feature, p. 681.
Asunto(s)
Alelos , Transformación Celular Neoplásica/genética , Variación Genética , Neoplasias/genética , Oncogenes , Animales , Línea Celular Tumoral , Modelos Animales de Enfermedad , Perfilación de la Expresión Génica/métodos , Estudios de Asociación Genética , Predisposición Genética a la Enfermedad , Xenoinjertos , Ensayos Analíticos de Alto Rendimiento , Humanos , Masculino , Ratones , Neoplasias/diagnóstico , Reproducibilidad de los ResultadosRESUMEN
Cellular signal transduction generally involves cascades of post-translational protein modifications that rapidly catalyze changes in protein-DNA interactions and gene expression. High-throughput measurements are improving our ability to study each of these stages individually, but do not capture the connections between them. Here we present an approach for building a network of physical links among these data that can be used to prioritize targets for pharmacological intervention. Our method recovers the critical missing links between proteomic and transcriptional data by relating changes in chromatin accessibility to changes in expression and then uses these links to connect proteomic and transcriptome data. We applied our approach to integrate epigenomic, phosphoproteomic and transcriptome changes induced by the variant III mutation of the epidermal growth factor receptor (EGFRvIII) in a cell line model of glioblastoma multiforme (GBM). To test the relevance of the network, we used small molecules to target highly connected nodes implicated by the network model that were not detected by the experimental data in isolation and we found that a large fraction of these agents alter cell viability. Among these are two compounds, ICG-001, targeting CREB binding protein (CREBBP), and PKF118-310, targeting ß-catenin (CTNNB1), which have not been tested previously for effectiveness against GBM. At the level of transcriptional regulation, we used chromatin immunoprecipitation sequencing (ChIP-Seq) to experimentally determine the genome-wide binding locations of p300, a transcriptional co-regulator highly connected in the network. Analysis of p300 target genes suggested its role in tumorigenesis. We propose that this general method, in which experimental measurements are used as constraints for building regulatory networks from the interactome while taking into account noise and missing data, should be applicable to a wide range of high-throughput datasets.
Asunto(s)
Biología Computacional/métodos , Perfilación de la Expresión Génica/métodos , Oncogenes , Mapas de Interacción de Proteínas , Transducción de Señal , Línea Celular Tumoral , Supervivencia Celular/efectos de los fármacos , Descubrimiento de Drogas , Glioblastoma/genética , Glioblastoma/metabolismo , Humanos , Reproducibilidad de los Resultados , TranscriptomaRESUMEN
Polycomb repressive complexes (PRCs) play key roles in developmental epigenetic regulation. Yet the mechanisms that target PRCs to specific loci in mammalian cells remain incompletely understood. In this study we show that Bmi1, a core component of Polycomb Repressive Complex 1 (PRC1), binds directly to the Runx1/CBFß transcription factor complex. Genome-wide studies in megakaryocytic cells demonstrate significant chromatin occupancy overlap between the PRC1 core component Ring1b and Runx1/CBFß and functional regulation of a considerable fraction of commonly bound genes. Bmi1/Ring1b and Runx1/CBFß deficiencies generate partial phenocopies of one another in vivo. We also show that Ring1b occupies key Runx1 binding sites in primary murine thymocytes and that this occurs via PRC2-independent mechanisms. Genetic depletion of Runx1 results in reduced Ring1b binding at these sites in vivo. These findings provide evidence for site-specific PRC1 chromatin recruitment by core binding transcription factors in mammalian cells.
Asunto(s)
Cromatina/metabolismo , Subunidad alfa 2 del Factor de Unión al Sitio Principal/metabolismo , Subunidad beta del Factor de Unión al Sitio Principal/metabolismo , Proteínas Represoras/metabolismo , Animales , Línea Celular , Cromatografía de Afinidad , Análisis por Conglomerados , Subunidad alfa 2 del Factor de Unión al Sitio Principal/genética , Subunidad beta del Factor de Unión al Sitio Principal/genética , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Técnicas de Silenciamiento del Gen , Células Madre Hematopoyéticas/fisiología , Megacariocitos/metabolismo , Ratones , Ratones Endogámicos C57BL , Ratones Noqueados , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , Complejo Represivo Polycomb 1 , Proteínas del Grupo Polycomb , Unión Proteica , Multimerización de Proteína , Proteínas Proto-Oncogénicas/genética , Proteínas Proto-Oncogénicas/metabolismo , Proteínas Represoras/genética , Proteínas Represoras/aislamiento & purificación , Linfocitos T/metabolismo , Timocitos/metabolismo , Ubiquitina-Proteína Ligasas/genética , Ubiquitina-Proteína Ligasas/metabolismo , Pez Cebra/embriología , Pez Cebra/genéticaRESUMEN
The mitogen-activated protein kinase (MAPK) extracellular signal-regulated kinase 2 (ERK2) is ubiquitously expressed in mammalian tissues and is involved in a wide range of biological processes. Although MAPKs have been intensely studied, identification of their substrates remains challenging. We have optimized a chemical genetic system using analog-sensitive ERK2, a form of ERK2 engineered to use an analog of adenosine 5'-triphosphate (ATP), to tag and isolate ERK2 substrates in vitro. This approach identified 80 proteins phosphorylated by ERK2, 13 of which are known ERK2 substrates. The 80 substrates are associated with diverse cellular processes, including regulation of transcription and translation, mRNA processing, and regulation of the activity of the Rho family guanosine triphosphatases. We found that one of the newly identified substrates, ETV3 (a member of the E twenty-six family of transcriptional regulators), was extensively phosphorylated on sites within canonical and noncanonical ERK motifs. Phosphorylation of ETV3 regulated transcription by preventing its binding to DNA at promoters for several thousand genes, including some involved in negative feedback regulation of itself and of upstream signals.