RESUMEN
Colorectal malignancies are a leading cause of cancer-related death1 and have undergone extensive genomic study2,3. However, DNA mutations alone do not fully explain malignant transformation4-7. Here we investigate the co-evolution of the genome and epigenome of colorectal tumours at single-clone resolution using spatial multi-omic profiling of individual glands. We collected 1,370 samples from 30 primary cancers and 8 concomitant adenomas and generated 1,207 chromatin accessibility profiles, 527 whole genomes and 297 whole transcriptomes. We found positive selection for DNA mutations in chromatin modifier genes and recurrent somatic chromatin accessibility alterations, including in regulatory regions of cancer driver genes that were otherwise devoid of genetic mutations. Genome-wide alterations in accessibility for transcription factor binding involved CTCF, downregulation of interferon and increased accessibility for SOX and HOX transcription factor families, suggesting the involvement of developmental genes during tumourigenesis. Somatic chromatin accessibility alterations were heritable and distinguished adenomas from cancers. Mutational signature analysis showed that the epigenome in turn influences the accumulation of DNA mutations. This study provides a map of genetic and epigenetic tumour heterogeneity, with fundamental implications for understanding colorectal cancer biology.
Asunto(s)
Neoplasias Colorrectales , Epigenoma , Genoma Humano , Mutación , Humanos , Adenoma/genética , Adenoma/patología , Transformación Celular Neoplásica/genética , Transformación Celular Neoplásica/metabolismo , Transformación Celular Neoplásica/patología , Cromatina/genética , Cromatina/metabolismo , Neoplasias Colorrectales/genética , Neoplasias Colorrectales/patología , Epigenoma/genética , Oncogenes/genética , Factores de Transcripción/metabolismo , Genoma Humano/genética , InterferonesRESUMEN
Genetic and epigenetic variation, together with transcriptional plasticity, contribute to intratumour heterogeneity1. The interplay of these biological processes and their respective contributions to tumour evolution remain unknown. Here we show that intratumour genetic ancestry only infrequently affects gene expression traits and subclonal evolution in colorectal cancer (CRC). Using spatially resolved paired whole-genome and transcriptome sequencing, we find that the majority of intratumour variation in gene expression is not strongly heritable but rather 'plastic'. Somatic expression quantitative trait loci analysis identified a number of putative genetic controls of expression by cis-acting coding and non-coding mutations, the majority of which were clonal within a tumour, alongside frequent structural alterations. Consistently, computational inference on the spatial patterning of tumour phylogenies finds that a considerable proportion of CRCs did not show evidence of subclonal selection, with only a subset of putative genetic drivers associated with subclone expansions. Spatial intermixing of clones is common, with some tumours growing exponentially and others only at the periphery. Together, our data suggest that most genetic intratumour variation in CRC has no major phenotypic consequence and that transcriptional plasticity is, instead, widespread within a tumour.
Asunto(s)
Adaptación Fisiológica , Neoplasias Colorrectales , Regulación Neoplásica de la Expresión Génica , Fenotipo , Humanos , Adaptación Fisiológica/genética , Células Clonales/metabolismo , Células Clonales/patología , Neoplasias Colorrectales/genética , Neoplasias Colorrectales/patología , Mutación , Secuenciación del Exoma , Transcripción GenéticaRESUMEN
ABSTRACT: SETBP1 mutations are found in various clonal myeloid disorders. However, it is unclear whether they can initiate leukemia, because SETBP1 mutations typically appear as later events during oncogenesis. To answer this question, we generated a mouse model expressing mutated SETBP1 in hematopoietic tissue: this model showed profound alterations in the differentiation program of hematopoietic progenitors and developed a myeloid neoplasm with megakaryocytic dysplasia, splenomegaly, and bone marrow fibrosis, prompting us to investigate SETBP1 mutations in a cohort of 36 triple-negative primary myelofibrosis (TN-PMF) cases. We identified 2 distinct subgroups, one carrying SETBP1 mutations and the other completely devoid of somatic variants. Clinically, a striking difference in disease aggressiveness was noted, with patients with SETBP1 mutation showing a much worse clinical course. In contrast to myelodysplastic/myeloproliferative neoplasms, in which SETBP1 mutations are mostly found as a late clonal event, single-cell clonal hierarchy reconstruction in 3 patients with TN-PMF from our cohort revealed SETBP1 to be a very early event, suggesting that the phenotype of the different SETBP1+ disorders may be shaped by the opposite hierarchy of the same clonal SETBP1 variants.
Asunto(s)
Sistema Hematopoyético , Enfermedades Mielodisplásicas-Mieloproliferativas , Trastornos Mieloproliferativos , Mielofibrosis Primaria , Animales , Ratones , Humanos , Mielofibrosis Primaria/genética , Trastornos Mieloproliferativos/genética , Mutación , Proteínas Portadoras/genética , Proteínas Nucleares/genéticaRESUMEN
Recent investigations have improved our understanding of the molecular aberrations supporting Waldenström macroglobulinemia (WM) biology; however, whether the immune microenvironment contributes to WM pathogenesis remains unanswered. First, we showed how a transgenic murine model of human-like lymphoplasmacytic lymphoma/WM exhibits an increased number of regulatory T cells (Tregs) relative to control mice. These findings were translated into the WM clinical setting, in which the transcriptomic profiling of Tregs derived from patients with WM unveiled a peculiar WM-devoted messenger RNA signature, with significant enrichment for genes related to nuclear factor κB-mediated tumor necrosis factor α signaling, MAPK, and PI3K/AKT, which was paralleled by a different Treg functional phenotype. We demonstrated significantly higher Treg induction, expansion, and proliferation triggered by WM cells, compared with their normal cellular counterpart; with a more profound effect within the context of CXCR4C1013G-mutated WM cells. By investigating the B-cell-to-T-cell cross talk at single-cell level, we identified the CD40/CD40-ligand as a potentially relevant axis that supports WM cell-Tregs interaction. Our findings demonstrate the existence of a Treg-mediated immunosuppressive phenotype in WM, which can be therapeutically reversed by blocking the CD40L/CD40 axis to inhibit WM cell growth.
Asunto(s)
Linfoma de Células B , Macroglobulinemia de Waldenström , Humanos , Animales , Ratones , Macroglobulinemia de Waldenström/patología , Ligando de CD40/genética , Fosfatidilinositol 3-Quinasas , Ligandos , Transducción de Señal , Linfoma de Células B/complicaciones , Microambiente TumoralRESUMEN
BACKGROUND: Copy number alterations (CNAs) are genetic changes commonly found in cancer that involve different regions of the genome and impact cancer progression by affecting gene expression and genomic stability. Computational techniques can analyze copy number data obtained from high-throughput sequencing platforms, and various tools visualize and analyze CNAs in cancer genomes, providing insights into genetic mechanisms driving cancer development and progression. However, tools for visualizing copy number data in cancer research have some limitations. In fact, they can be complex to use and require expertise in bioinformatics or computational biology. While copy number data analysis and visualization provide insights into cancer biology, interpreting results can be challenging, and there may be multiple explanations for observed patterns of copy number alterations. RESULTS: We created Control-FREEC Viewer, a tool that facilitates effective visualization and exploration of copy number data. With Control-FREEC Viewer, experimental data can be easily loaded by the user. After choosing the reference genome, copy number data are displayed in whole genome or single chromosome view. Gain or loss on a specific gene can be found and visualized on each chromosome. Analysis parameters for subsequent sessions can be stored and images can be exported in raster and vector formats. CONCLUSIONS: Control-FREEC Viewer enables users to import and visualize data analyzed by the Control-FREEC tool, as well as by other tools sharing a similar tabular output, providing a comprehensive and intuitive graphical user interface for data visualization.
Asunto(s)
Neoplasias , Programas Informáticos , Humanos , Variaciones en el Número de Copia de ADN , Genoma , Biología Computacional/métodos , Neoplasias/genéticaRESUMEN
In a first-of-its-kind study, we assessed the capabilities of large language models (LLMs) in making complex decisions in haematopoietic stem cell transplantation. The evaluation was conducted not only for Generative Pre-trained Transformer 4 (GPT-4) but also conducted on other artificial intelligence models: PaLm 2 and Llama-2. Using detailed haematological histories that include both clinical, molecular and donor data, we conducted a triple-blind survey to compare LLMs to haematology residents. We found that residents significantly outperformed LLMs (p = 0.02), particularly in transplant eligibility assessment (p = 0.01). Our triple-blind methodology aimed to mitigate potential biases in evaluating LLMs and revealed both their promise and limitations in deciphering complex haematological clinical scenarios.
Asunto(s)
Inteligencia Artificial , Trasplante de Células Madre Hematopoyéticas , Humanos , Lenguaje , Donantes de TejidosRESUMEN
BACKGROUND: Longitudinal single-cell sequencing experiments of patient-derived models are increasingly employed to investigate cancer evolution. In this context, robust computational methods are needed to properly exploit the mutational profiles of single cells generated via variant calling, in order to reconstruct the evolutionary history of a tumor and characterize the impact of therapeutic strategies, such as the administration of drugs. To this end, we have recently developed the LACE framework for the Longitudinal Analysis of Cancer Evolution. RESULTS: The LACE 2.0 release aimed at inferring longitudinal clonal trees enhances the original framework with new key functionalities: an improved data management for preprocessing of standard variant calling data, a reworked inference engine, and direct connection to public databases. CONCLUSIONS: All of this is accessible through a new and interactive Shiny R graphical interface offering the possibility to apply filters helpful in discriminating relevant or potential driver mutations, set up inferential parameters, and visualize the results. The software is available at: github.com/BIMIB-DISCo/LACE.
Asunto(s)
Neoplasias , Programas Informáticos , Humanos , Neoplasias/genética , Células ClonalesRESUMEN
Mantle-cell lymphoma (MCL) is a B-cell non-Hodgkin Lymphoma (NHL) with a poor prognosis, at high risk of relapse after conventional treatment. MCL-associated tumour microenvironment (TME) is characterized by M2-like tumour-associated macrophages (TAMs), able to interact with cancer cells, providing tumour survival and resistance to immuno-chemotherapy. Likewise, monocyte-derived nurse-like cells (NLCs) present M2-like profile and provide proliferation signals to chronic lymphocytic leukaemia (CLL), a B-cell malignancy sharing with MCL some biological and phenotypic features. Antibodies against TAMs targeted CD47, a 'don't eat me' signal (DEMs) able to quench phagocytosis by TAMs within TME, with clinical effectiveness when combined with Rituximab in pretreated NHL. Recently, CD24 was found as valid DEMs in solid cancer. Since CD24 is expressed during B-cell differentiation, we investigated and identified consistent CD24 in MCL, CLL and primary human samples. Phagocytosis increased when M2-like macrophages were co-cultured with cancer cells, particularly in the case of paired DEMs blockade (i.e. anti-CD24 + anti-CD47) combined with Rituximab. Similarly, unstimulated CLL patients-derived NLCs provided increased phagocytosis when DEMs blockade occurred. Since high levels of CD24 were associated with worse survival in both MCL and CLL, anti-CD24-induced phagocytosis could be considered for future clinical use, particularly in association with other agents such as Rituximab.
Asunto(s)
Leucemia Linfocítica Crónica de Células B , Linfoma de Células del Manto , Adulto , Humanos , Rituximab/farmacología , Rituximab/uso terapéutico , Leucemia Linfocítica Crónica de Células B/tratamiento farmacológico , Leucemia Linfocítica Crónica de Células B/patología , Linfoma de Células del Manto/tratamiento farmacológico , Antígeno CD47 , Recurrencia Local de Neoplasia , Fagocitosis , Microambiente Tumoral , Antígeno CD24RESUMEN
MOTIVATION: Driver (epi)genomic alterations underlie the positive selection of cancer subpopulations, which promotes drug resistance and relapse. Even though substantial heterogeneity is witnessed in most cancer types, mutation accumulation patterns can be regularly found and can be exploited to reconstruct predictive models of cancer evolution. Yet, available methods can not infer logical formulas connecting events to represent alternative evolutionary routes or convergent evolution. RESULTS: We introduce PMCE, an expressive framework that leverages mutational profiles from cross-sectional sequencing data to infer probabilistic graphical models of cancer evolution including arbitrary logical formulas, and which outperforms the state-of-the-art in terms of accuracy and robustness to noise, on simulations. The application of PMCE to 7866 samples from the TCGA database allows us to identify a highly significant correlation between the predicted evolutionary paths and the overall survival in 7 tumor types, proving that our approach can effectively stratify cancer patients in reliable risk groups. AVAILABILITY AND IMPLEMENTATION: PMCE is freely available at https://github.com/BIMIB-DISCo/PMCE, in addition to the code to replicate all the analyses presented in the manuscript. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Algoritmos , Neoplasias , Humanos , Pronóstico , Estudios Transversales , Neoplasias/genética , GenómicaRESUMEN
Cancer is the result of mutagenic processes that can be inferred from tumor genomes by analyzing rate spectra of point mutations, or "mutational signatures". Here we present SparseSignatures, a novel framework to extract signatures from somatic point mutation data. Our approach incorporates a user-specified background signature, employs regularization to reduce noise in non-background signatures, uses cross-validation to identify the number of signatures, and is scalable to large datasets. We show that SparseSignatures outperforms current state-of-the-art methods on simulated data using a variety of standard metrics. We then apply SparseSignatures to whole genome sequences of pancreatic and breast tumors, discovering well-differentiated signatures that are linked to known mutagenic mechanisms and are strongly associated with patient clinical features.
Asunto(s)
Análisis Mutacional de ADN/estadística & datos numéricos , Neoplasias/genética , Mutación Puntual , Algoritmos , Biomarcadores de Tumor/genética , Neoplasias de la Mama/clasificación , Neoplasias de la Mama/genética , Biología Computacional , Simulación por Computador , Bases de Datos Genéticas/estadística & datos numéricos , Femenino , Genes BRCA1 , Genes BRCA2 , Genoma Humano , Humanos , Neoplasias Pancreáticas/clasificación , Neoplasias Pancreáticas/genética , Programas InformáticosRESUMEN
Recurrent successions of genomic changes, both within and between patients, reflect repeated evolutionary processes that are valuable for the anticipation of cancer progression. Multi-region sequencing allows the temporal order of some genomic changes in a tumor to be inferred, but the robust identification of repeated evolution across patients remains a challenge. We developed a machine-learning method based on transfer learning that allowed us to overcome the stochastic effects of cancer evolution and noise in data and identified hidden evolutionary patterns in cancer cohorts. When applied to multi-region sequencing datasets from lung, breast, renal, and colorectal cancer (768 samples from 178 patients), our method detected repeated evolutionary trajectories in subgroups of patients, which were reproduced in single-sample cohorts (n = 2,935). Our method provides a means of classifying patients on the basis of how their tumor evolved, with implications for the anticipation of disease progression.
Asunto(s)
Evolución Molecular , Neoplasias/clasificación , Neoplasias/patología , Línea Celular Tumoral , Estudios de Cohortes , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Aprendizaje Automático , Neoplasias/genética , Reproducibilidad de los Resultados , Procesos EstocásticosRESUMEN
We present single-cell interpretation via multikernel learning (SIMLR), an analytic framework and software which learns a similarity measure from single-cell RNA-seq data in order to perform dimension reduction, clustering and visualization. On seven published data sets, we benchmark SIMLR against state-of-the-art methods. We show that SIMLR is scalable and greatly enhances clustering performance while improving the visualization and interpretability of single-cell sequencing data.
Asunto(s)
Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Programas Informáticos , Algoritmos , Biología Computacional/métodos , Humanos , Neutrófilos/citología , Neutrófilos/fisiologíaRESUMEN
BACKGROUND: A large number of algorithms is being developed to reconstruct evolutionary models of individual tumours from genome sequencing data. Most methods can analyze multiple samples collected either through bulk multi-region sequencing experiments or the sequencing of individual cancer cells. However, rarely the same method can support both data types. RESULTS: We introduce TRaIT, a computational framework to infer mutational graphs that model the accumulation of multiple types of somatic alterations driving tumour evolution. Compared to other tools, TRaIT supports multi-region and single-cell sequencing data within the same statistical framework, and delivers expressive models that capture many complex evolutionary phenomena. TRaIT improves accuracy, robustness to data-specific errors and computational complexity compared to competing methods. CONCLUSIONS: We show that the application of TRaIT to single-cell and multi-region cancer datasets can produce accurate and reliable models of single-tumour evolution, quantify the extent of intra-tumour heterogeneity and generate new testable experimental hypotheses.
Asunto(s)
Algoritmos , Neoplasias/patología , Biología Computacional/métodos , Evolución Molecular , Humanos , Mutación , Neoplasias/clasificación , Neoplasias/genética , Análisis de Secuencia de ADN , Análisis de la Célula IndividualRESUMEN
The genomic evolution inherent to cancer relates directly to a renewed focus on the voluminous next-generation sequencing data and machine learning for the inference of explanatory models of how the (epi)genomic events are choreographed in cancer initiation and development. However, despite the increasing availability of multiple additional -omics data, this quest has been frustrated by various theoretical and technical hurdles, mostly stemming from the dramatic heterogeneity of the disease. In this paper, we build on our recent work on the "selective advantage" relation among driver mutations in cancer progression and investigate its applicability to the modeling problem at the population level. Here, we introduce PiCnIc (Pipeline for Cancer Inference), a versatile, modular, and customizable pipeline to extract ensemble-level progression models from cross-sectional sequenced cancer genomes. The pipeline has many translational implications because it combines state-of-the-art techniques for sample stratification, driver selection, identification of fitness-equivalent exclusive alterations, and progression model inference. We demonstrate PiCnIc's ability to reproduce much of the current knowledge on colorectal cancer progression as well as to suggest novel experimentally verifiable hypotheses.
Asunto(s)
Evolución Biológica , Neoplasias Colorrectales/genética , Modelos Genéticos , Algoritmos , Humanos , Aprendizaje Automático , Repeticiones de MicrosatéliteRESUMEN
SIMLR (Single-cell Interpretation via Multi-kernel LeaRning), an open-source tool that implements a novel framework to learn a sample-to-sample similarity measure from expression data observed for heterogenous samples, is presented here. SIMLR can be effectively used to perform tasks such as dimension reduction, clustering, and visualization of heterogeneous populations of samples. SIMLR was benchmarked against state-of-the-art methods for these three tasks on several public datasets, showing it to be scalable and capable of greatly improving clustering performance, as well as providing valuable insights by making the data more interpretable via better a visualization. SIMLR is available on https://github.com/BatzoglouLabSU/SIMLRGitHub in both R and MATLAB implementations. Furthermore, it is also available as an R package on http://bioconductor.org.
Asunto(s)
Genómica/métodos , Aprendizaje Automático , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Programas Informáticos , Algoritmos , HumanosRESUMEN
MOTIVATION: We introduce TRanslational ONCOlogy (TRONCO), an open-source R package that implements the state-of-the-art algorithms for the inference of cancer progression models from (epi)genomic mutational profiles. TRONCO can be used to extract population-level models describing the trends of accumulation of alterations in a cohort of cross-sectional samples, e.g. retrieved from publicly available databases, and individual-level models that reveal the clonal evolutionary history in single cancer patients, when multiple samples, e.g. multiple biopsies or single-cell sequencing data, are available. The resulting models can provide key hints for uncovering the evolutionary trajectories of cancer, especially for precision medicine or personalized therapy. AVAILABILITY AND IMPLEMENTATION: TRONCO is released under the GPL license, is hosted at http://bimib.disco.unimib.it/ (Software section) and archived also at bioconductor.org. CONTACT: tronco@disco.unimib.it SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Modelos Teóricos , Neoplasias/genética , Programas Informáticos , Algoritmos , Progresión de la Enfermedad , Epigénesis Genética , Genómica , Humanos , Interfaz Usuario-ComputadorRESUMEN
UNLABELLED: We devise a novel inference algorithm to effectively solve the cancer progression model reconstruction problem. Our empirical analysis of the accuracy and convergence rate of our algorithm, CAncer PRogression Inference (CAPRI), shows that it outperforms the state-of-the-art algorithms addressing similar problems. MOTIVATION: Several cancer-related genomic data have become available (e.g. The Cancer Genome Atlas, TCGA) typically involving hundreds of patients. At present, most of these data are aggregated in a cross-sectional fashion providing all measurements at the time of diagnosis. Our goal is to infer cancer 'progression' models from such data. These models are represented as directed acyclic graphs (DAGs) of collections of 'selectivity' relations, where a mutation in a gene A 'selects' for a later mutation in a gene B. Gaining insight into the structure of such progressions has the potential to improve both the stratification of patients and personalized therapy choices. RESULTS: The CAPRI algorithm relies on a scoring method based on a probabilistic theory developed by Suppes, coupled with bootstrap and maximum likelihood inference. The resulting algorithm is efficient, achieves high accuracy and has good complexity, also, in terms of convergence properties. CAPRI performs especially well in the presence of noise in the data, and with limited sample sizes. Moreover CAPRI, in contrast to other approaches, robustly reconstructs different types of confluent trajectories despite irregularities in the data. We also report on an ongoing investigation using CAPRI to study atypical Chronic Myeloid Leukemia, in which we uncovered non trivial selectivity relations and exclusivity patterns among key genomic events. AVAILABILITY AND IMPLEMENTATION: CAPRI is part of the TRanslational ONCOlogy R package and is freely available on the web at: http://bimib.disco.unimib.it/index.php/Tronco CONTACT: daniele.ramazzotti@disco.unimib.it SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Algoritmos , Biología Computacional/métodos , Leucemia Mielógena Crónica BCR-ABL Positiva/genética , Leucemia Mielógena Crónica BCR-ABL Positiva/patología , Modelos Teóricos , Estudios Transversales , Bases de Datos Genéticas , Progresión de la Enfermedad , Humanos , Mutación/genética , Probabilidad , Transducción de SeñalRESUMEN
A group of 27 patients diagnosed with metastatic triple-negative breast cancer (mTNBC) was randomly distributed into two groups and underwent different lines of metronomic treatment (mCHT). The former group (N 14) received first-line mCHT and showed a higher overall survival rate than the second group (N 13), which underwent second-line mCHT. Analysis of one patient still alive from the first group, diagnosed with mTNBC in 2019, showed a complete metabolic response (CMR) after a composite approach implicating first-line mCHT followed by second-line epirubicin and third-line nab-paclitaxel, and was chosen for subsequent molecular characterization. We found altered expression in the cancer stemness-associated gene NOTCH-1 and its corresponding protein. Additionally, we found changes in the expression of oncogenes, such as MYC and AKT, along with their respective proteins. Overall, our data suggest that a first-line treatment with mCHT followed by MTD might be effective by negatively regulating stemness traits usually associated with the emergence of drug resistance.
RESUMEN
The patterns by which primary tumors spread to metastatic sites remain poorly understood. Here, we define patterns of metastatic seeding in prostate cancer using a novel injection-based mouse model-EvoCaP (Evolution in Cancer of the Prostate), featuring aggressive metastatic cancer to bone, liver, lungs, and lymph nodes. To define migration histories between primary and metastatic sites, we used our EvoTraceR pipeline to track distinct tumor clones containing recordable barcodes. We detected widespread intratumoral heterogeneity from the primary tumor in metastatic seeding, with few clonal populations instigating most migration. Metastasis-to-metastasis seeding was uncommon, as most cells remained confined within the tissue. Migration patterns in our model were congruent with human prostate cancer seeding topologies. Our findings support the view of metastatic prostate cancer as a systemic disease driven by waves of aggressive clones expanding their niche, infrequently overcoming constraints that otherwise keep them confined in the primary or metastatic site. Significance: Defining the kinetics of prostate cancer metastasis is critical for developing novel therapeutic strategies. This study uses CRISPR/Cas9-based barcoding technology to accurately define tumor clonal patterns and routes of migration in a novel somatically engineered mouse model (EvoCaP) that recapitulates human prostate cancer using an in-house developed analytical pipeline (EvoTraceR).
Asunto(s)
Metástasis de la Neoplasia , Neoplasias de la Próstata , Masculino , Neoplasias de la Próstata/patología , Neoplasias de la Próstata/genética , Animales , Ratones , Humanos , Movimiento Celular , Modelos Animales de EnfermedadRESUMEN
Cancer evolution lays the groundwork for predictive oncology. Testing evolutionary metrics requires quantitative measurements in controlled clinical trials. We mapped genomic intratumor heterogeneity in locally advanced prostate cancer using 642 samples from 114 individuals enrolled in clinical trials with a 12-year median follow-up. We concomitantly assessed morphological heterogeneity using deep learning in 1,923 histological sections from 250 individuals. Genetic and morphological (Gleason) diversity were independent predictors of recurrence (hazard ratio (HR) = 3.12 and 95% confidence interval (95% CI) = 1.34-7.3; HR = 2.24 and 95% CI = 1.28-3.92). Combined, they identified a group with half the median time to recurrence. Spatial segregation of clones was also an independent marker of recurrence (HR = 2.3 and 95% CI = 1.11-4.8). We identified copy number changes associated with Gleason grade and found that chromosome 6p loss correlated with reduced immune infiltration. Matched profiling of relapse, decades after diagnosis, confirmed that genomic instability is a driving force in prostate cancer progression. This study shows that combining genomics with artificial intelligence-aided histopathology leads to the identification of clinical biomarkers of evolution.