RESUMEN
Long non-coding RNAs (lncRNAs) are increasingly recognized as functional units in cancer and powerful biomarkers; however, most remain uncharacterized. Here, we analyze 5,592 prognostic lncRNAs in 9,446 cancers of 30 types using machine learning. We identify 166 lncRNAs whose expression correlates with survival and improves the accuracy of common clinical variables, molecular features, and cancer subtypes. Prognostic lncRNAs are often characterized by switch-like expression patterns. In low-grade gliomas, HOXA10-AS activation is a robust marker of poor prognosis that complements IDH1/2 mutations, as validated in another retrospective cohort, and correlates with developmental pathways in tumor transcriptomes. Loss- and gain-of-function studies in patient-derived glioma cells, organoids, and xenograft models identify HOXA10-AS as a potent onco-lncRNA that regulates cell proliferation, contact inhibition, invasion, Hippo signaling, and mitotic and neuro-developmental pathways. Our study underscores the pan-cancer potential of the non-coding transcriptome for identifying biomarkers and regulators of cancer progression.
Asunto(s)
Biomarcadores de Tumor/metabolismo , Neoplasias Encefálicas/metabolismo , Perfilación de la Expresión Génica , Glioma/metabolismo , ARN Largo no Codificante/metabolismo , Transcriptoma , Animales , Biomarcadores de Tumor/genética , Neoplasias Encefálicas/genética , Neoplasias Encefálicas/patología , Línea Celular Tumoral , Movimiento Celular , Proliferación Celular , Bases de Datos Genéticas , Regulación Neoplásica de la Expresión Génica , Glioma/genética , Glioma/patología , Humanos , Isocitrato Deshidrogenasa/genética , Isocitrato Deshidrogenasa/metabolismo , Aprendizaje Automático , Ratones Endogámicos NOD , Ratones SCID , Mutación , Invasividad Neoplásica , Valor Predictivo de las Pruebas , Pronóstico , ARN Largo no Codificante/genética , Reproducibilidad de los Resultados , Transducción de SeñalRESUMEN
The most common events in breast cancer (BC) involve chromosome arm losses and gains. Here we describe identification of 1089 gene-centric common insertion sites (gCIS) from transposon-based screens in 8 mouse models of BC. Some gCIS are driver-specific, others driver non-specific, and still others associated with tumor histology. Processes affected by driver-specific and histology-specific mutations include well-known cancer pathways. Driver non-specific gCIS target the Mediator complex, Ca++ signaling, Cyclin D turnover, RNA-metabolism among other processes. Most gCIS show single allele disruption and many map to genomic regions showing high-frequency hemizygous loss in human BC. Two gCIS, Nf1 and Trps1, show synthetic haploinsufficient tumor suppressor activity. Many gCIS act on the same pathway responsible for tumor initiation, thereby selecting and sculpting just enough and just right signaling. These data highlight ~1000 genes with predicted conditional haploinsufficient tumor suppressor function and the potential to promote chromosome arm loss in BC.
Asunto(s)
Neoplasias de la Mama/genética , Pérdida de Heterocigocidad/genética , Animales , Neoplasias de la Mama/patología , Transformación Celular Neoplásica , Elementos Transponibles de ADN/genética , Femenino , Genes Supresores de Tumor , Humanos , Ratones , Mutagénesis Insercional , Neoplasias Experimentales , Transducción de SeñalRESUMEN
BACKGROUND: Cancer genomes are shaped by mutational processes with complex spatial variation at multiple scales. Entire classes of regulatory elements are affected by local variations in mutation frequency. However, the underlying mechanisms with functional and genetic determinants remain poorly understood. RESULTS: We characterise the mutational landscape of 1.3 million gene-regulatory and chromatin architectural elements in 2419 whole cancer genomes with transcriptional and pathway activity, functional conservation and recurrent driver events. We develop RM2, a statistical model that quantifies mutational enrichment or depletion in classes of genomic elements through genetic, trinucleotide and megabase-scale effects. We report a map of localised mutational processes affecting CTCF binding sites, transcription start sites (TSS) and tissue-specific open-chromatin regions. Increased mutation frequency in TSSs associates with mRNA abundance in most cancer types, while open-chromatin regions are generally enriched in mutations. We identify ~ 10,000 CTCF binding sites with core DNA motifs and constitutive binding in 66 cell types that represent focal points of mutagenesis. We detect site-specific mutational signature enrichments, such as SBS40 in open-chromatin regions in prostate cancer and SBS17b in CTCF binding sites in gastrointestinal cancers. Candidate drivers of localised mutagenesis are also apparent: BRAF mutations associate with mutational enrichments at CTCF binding sites in melanoma, and ARID1A mutations with TSS-specific mutagenesis in pancreatic cancer. CONCLUSIONS: Our method and catalogue of localised mutational processes provide novel perspectives to cancer genome evolution, mutagenesis, DNA repair and driver gene discovery. The functional and genetic correlates of mutational processes suggest mechanistic hypotheses for future studies.