ABSTRACT
Pancreatic cancer, a highly aggressive tumour type with uniformly poor prognosis, exemplifies the classically held view of stepwise cancer development. The current model of tumorigenesis, based on analyses of precursor lesions, termed pancreatic intraepithelial neoplasm (PanINs) lesions, makes two predictions: first, that pancreatic cancer develops through a particular sequence of genetic alterations (KRAS, followed by CDKN2A, then TP53 and SMAD4); and second, that the evolutionary trajectory of pancreatic cancer progression is gradual because each alteration is acquired independently. A shortcoming of this model is that clonally expanded precursor lesions do not always belong to the tumour lineage, indicating that the evolutionary trajectory of the tumour lineage and precursor lesions can be divergent. This prevailing model of tumorigenesis has contributed to the clinical notion that pancreatic cancer evolves slowly and presents at a late stage. However, the propensity for this disease to rapidly metastasize and the inability to improve patient outcomes, despite efforts aimed at early detection, suggest that pancreatic cancer progression is not gradual. Here, using newly developed informatics tools, we tracked changes in DNA copy number and their associated rearrangements in tumour-enriched genomes and found that pancreatic cancer tumorigenesis is neither gradual nor follows the accepted mutation order. Two-thirds of tumours harbour complex rearrangement patterns associated with mitotic errors, consistent with punctuated equilibrium as the principal evolutionary trajectory. In a subset of cases, the consequence of such errors is the simultaneous, rather than sequential, knockout of canonical preneoplastic genetic drivers that are likely to set-off invasive cancer growth. These findings challenge the current progression model of pancreatic cancer and provide insights into the mutational processes that give rise to these aggressive tumours.
Subject(s)
Carcinogenesis/genetics , Carcinogenesis/pathology , Gene Rearrangement/genetics , Genome, Human/genetics , Models, Biological , Mutagenesis/genetics , Pancreatic Neoplasms/genetics , Pancreatic Neoplasms/pathology , Carcinoma in Situ/genetics , Chromothripsis , DNA Copy Number Variations/genetics , Disease Progression , Evolution, Molecular , Female , Genes, Neoplasm/genetics , Humans , Male , Mitosis/genetics , Mutation/genetics , Neoplasm Invasiveness/genetics , Neoplasm Invasiveness/pathology , Neoplasm Metastasis/genetics , Neoplasm Metastasis/pathology , Polyploidy , Precancerous Conditions/geneticsABSTRACT
INTRODUCTION: Transcriptional analyses have identified several distinct molecular subtypes in pancreatic ductal adenocarcinoma (PDAC) that have prognostic and potential therapeutic significance. However, to date, an indepth, clinicomorphological correlation of these molecular subtypes has not been performed. We sought to identify specific morphological patterns to compare with known molecular subtypes, interrogate their biological significance, and furthermore reappraise the current grading system in PDAC. DESIGN: We first assessed 86 primary, chemotherapy-naive PDAC resection specimens with matched RNA-Seq data for specific, reproducible morphological patterns. Differential expression was applied to the gene expression data using the morphological features. We next compared the differentially expressed gene signatures with previously published molecular subtypes. Overall survival (OS) was correlated with the morphological and molecular subtypes. RESULTS: We identified four morphological patterns that segregated into two components ('gland forming' and 'non-gland forming') based on the presence/absence of well-formed glands. A morphological cut-off (≥40% 'non-gland forming') was established using RNA-Seq data, which identified two groups (A and B) with gene signatures that correlated with known molecular subtypes. There was a significant difference in OS between the groups. The morphological groups remained significantly prognostic within cancers that were moderately differentiated and classified as 'classical' using RNA-Seq. CONCLUSION: Our study has demonstrated that PDACs can be morphologically classified into distinct and biologically relevant categories which predict known molecular subtypes. These results provide the basis for an improved taxonomy of PDAC, which may lend itself to future treatment strategies and the development of deep learning models.
Subject(s)
Carcinoma, Pancreatic Ductal/pathology , Pancreatic Neoplasms/pathology , Adult , Aged , Carcinoma, Pancreatic Ductal/genetics , Deep Learning , Female , Gene Expression Regulation, Neoplastic , Humans , Kaplan-Meier Estimate , Male , Middle Aged , Observer Variation , Pancreatic Neoplasms/genetics , Prognosis , Reproducibility of Results , Transcriptome , Pancreatic NeoplasmsABSTRACT
RNA-seq is a key technology for understanding the biology of the cell because of its ability to profile transcriptional and post-transcriptional regulation at single nucleotide resolutions. Compared to DNA sequencing alignment algorithms, RNA-seq alignment algorithms have a diminished ability to accurately detect and map base pair substitutions, gaps, discordant pairs and repetitive regions. These shortcomings adversely affect experiments that require a high degree of accuracy, notably the ability to detect RNA editing. We have developed RNASequel, a software package that runs as a post-processing step in conjunction with an RNA-seq aligner and systematically corrects common alignment artifacts. Its key innovations are a two-pass splice junction alignment system that includes de novo splice junctions and the use of an empirically determined estimate of the fragment size distribution when resolving read pairs. We demonstrate that RNASequel produces improved alignments when used in conjunction with STAR or Tophat2 using two simulated datasets. We then show that RNASequel improves the identification of adenosine to inosine RNA editing sites on biological datasets. This software will be useful in applications requiring the accurate identification of variants in RNA sequencing data, the discovery of RNA editing sites and the analysis of alternative splicing.
Subject(s)
Sequence Alignment/methods , Sequence Analysis, RNA/methods , Software , RNA Editing , RNA Splice SitesABSTRACT
The ongoing advancements in CRISPR-Cas technologies can significantly accelerate the preclinical development of both in vivo and ex vivo organ genome-editing therapeutics. One of the promising applications is to genetically modify donor organs prior to implantation. The implantation of optimized donor organs with long-lasting immunomodulatory capacity holds promise for reducing the need for lifelong potent whole-body immunosuppression in recipients. However, assessing genome-targeting interventions in a clinically relevant manner prior to clinical trials remains a major challenge owing to the limited modalities available. This study introduces a novel platform for testing genome editing in human lungs ex vivo, effectively simulating preimplantation genetic engineering of donor organs. We identified gene regulatory elements whose disruption via Cas nucleases led to the upregulation of the immunomodulatory gene interleukin 10 (IL-10). We combined this approach with adenoviral vector-mediated IL-10 delivery to create favorable kinetics for early (immediate postimplantation) graft immunomodulation. Using ex vivo organ machine perfusion and precision-cut tissue slice technology, we demonstrated the feasibility of evaluating CRISPR genome editing in human lungs. To overcome the assessment limitations in ex vivo perfused human organs, we conducted an in vivo rodent study and demonstrated both early gene induction and sustained editing of the lung. Collectively, our findings lay the groundwork for a first-in-human-organ study to overcome the current translational barriers of genome-targeting therapeutics.
Subject(s)
CRISPR-Cas Systems , Gene Editing , Lung , Gene Editing/methods , Humans , Lung/metabolism , Interleukin-10/genetics , Interleukin-10/metabolism , Animals , Genetic Vectors/genetics , Genetic Vectors/administration & dosageABSTRACT
Background: Single-cell RNA-sequencing (scRNA-seq) technology has revealed novel cell populations in organs, uncovered regulatory relationships between genes, and allowed for tracking of cell lineage trajectory during development. It demonstrates promise as a method to better understand transplant biology; however, fundamental bioinformatic tools for its use in the context of transplantation have not been developed. One major need has been a robust method to identify cells as being either donor or recipient genotype origin, and ideally without the need to separately sequence the donor and recipient. Methods: We implemented a novel two-stage genotype discovery method (scTx) optimized for transplant samples by being robust to disparities in cell number and cell type. Using both in silico and real-world scRNA-seq transplant data, we benchmarked our method against existing demultiplexing methods to profile their limitations in terms of sequencing depth, donor and recipient cell imbalance, and single nucleotide variant input selection. Results: Using in silico data, scTx could more accurately separate donor from recipient cells and at much lower genotype ratios than existing methods. This was further validated using solid-organ scRNA-seq data where scTx could more reliably identify when a second genotype was present and at lower numbers of cells from a second genotype. Conclusion: scTx introduces the capability to accurately segregate donor and recipient gene expression at the single-cell level from scRNA-seq data without the need to separately genotype the donor and recipient. This will facilitate the use of scRNA-seq in the context of transplantation.
ABSTRACT
BACKGROUND: Inflammatory injury in the donor lung remains a persistent challenge in lung transplantation that limits donor organ usage and post-transplant outcomes. Inducing immunomodulatory capacity in donor organs could address this unsolved clinical problem. We sought to apply clustered regularly interspaced short palindromic repeats (CRISPR)-associated (Cas) technologies to the donor lung to fine-tune immunomodulatory gene expression, exploring for the first time the therapeutic use of CRISPR-mediated transcriptional activation in the whole donor lung. METHODS: We explored the feasibility of CRISPR-mediated transcriptional upregulation of interleukin 10 (IL-10), a key immunomodulatory cytokine, in vitro and in vivo. We first evaluated the potency, titratability, and multiplexibility of the gene activation in rat and human cell lines. Next, in vivo CRISPR-mediated IL-10 activation was characterized in rat lungs. Finally, the IL-10-activated donor lungs were transplanted into recipient rats to assess the feasibility in a transplant setting. RESULTS: The targeted transcriptional activation induced robust and titrable IL-10 upregulation in vitro. The combination of guide RNAs also facilitated multiplex gene modulation, that is, simultaneous activation of IL-10 and IL1 receptor antagonist. In vivo profiling demonstrated that adenoviral delivery of Cas9-based activators to the lung was feasible with the use of immunosuppression, which is routinely applied to organ transplant recipients. The transcriptionally modulated donor lungs retained IL-10 upregulation in isogeneic and allogeneic recipients. CONCLUSIONS: Our findings highlight the potential of CRISPR epigenome editing to improve lung transplant outcomes by creating a more favorable immunomodulatory environment in the donor organ, a paradigm that may be extendable to other organ transplants.
Subject(s)
Gene Editing , Interleukin-10 , Humans , Animals , Rats , Interleukin-10/genetics , Cell Line , Lung , ImmunomodulationABSTRACT
Homing endonucleases are site-specific DNA endonucleases that typically function as mobile genetic elements by introducing a double-strand break (DSB) in genomes that lack the endonuclease, resulting in a unidirectional gene conversion event that mobilizes the homing endonuclease gene and flanking DNA. Here, we characterize phage T4-encoded mobE, a predicted free-standing HNH family homing endonuclease. We show that mobE is promoterless and dependent on upstream transcription for expression, and that an internal intrinsic terminator regulates mobE transcript levels. Crucially, in vivo mapping experiments revealed a MobE-dependent, strand-specific nick in the non-coding strand of the nrdB gene of phage T2. An internal deletion of the predicted HNH catalytic motif of MobE abolishes nicking, and reduces high-frequency inheritance of mobE. Sequence polymorphisms of progeny phage that inherit mobE are consistent with DSB repair pathways. Significantly, we found that mobility of the neighboring I-TevIII, a defunct homing endonuclease encoded within a group I intron interrupting the nrdB gene of phage T4, was dependent on an intact mobE gene. Thus, our data indicate that the stagnant nrdB intron and I-TevIII are mobilized in trans as a consequence of a MobE-dependent gene conversion event, facilitating persistence of genetic elements that have no inherent means of promoting their own mobility.
Subject(s)
Bacteriophage T4/enzymology , Bacteriophage T4/genetics , DNA Transposable Elements , Endodeoxyribonucleases/genetics , Chromosome Mapping , Endodeoxyribonucleases/biosynthesis , Endodeoxyribonucleases/metabolism , Introns , Recombination, Genetic , Regulatory Elements, Transcriptional , Ribonucleotide Reductases/biosynthesis , Ribonucleotide Reductases/genetics , Transcription, GeneticABSTRACT
Identifying single nucleotide variants has become common practice for droplet-based single-cell RNA-seq experiments; however, presently, a pipeline does not exist to maximize variant calling accuracy. Furthermore, molecular duplicates generated in these experiments have not been utilized to optimally detect variant co-expression. Herein, we introduce scSNV designed from the ground up to "collapse" molecular duplicates and accurately identify variants and their co-expression. We demonstrate that scSNV is fast, with a reduced false-positive variant call rate, and enables the co-detection of genetic variants and A>G RNA edits across twenty-two samples.
Subject(s)
Polymorphism, Single Nucleotide/genetics , Software , Algorithms , Alleles , Computer Simulation , Humans , Polymerase Chain Reaction , RNA, Messenger/genetics , RNA, Messenger/metabolism , RNA-Seq , Single-Cell Analysis , Whole Genome SequencingABSTRACT
Esophageal adenocarcinoma has few known recurrent mutations and therefore robust, reliable and reproducible patient-specific models are needed for personalized treatment. Patient-derived organoid culture is a strategy that may allow for the personalized study of esophageal adenocarcinoma and the development of personalized induction therapy. We therefore developed a protocol to establish EAC organoids from endoscopic biopsies of naïve esophageal adenocarcinomas. Histologic characterization and molecular characterization of organoids by whole exome sequencing demonstrated recapitulation of the tumors' histology and genomic (~ 60% SNV overlap) characteristics. Drug testing using clinically appropriate chemotherapeutics and targeted therapeutics showed an overlap between the patient's tumor response and the corresponding organoids' response. Furthermore, we identified Barrett's esophagus epithelium as a potential source of organoid culture contamination. In conclusion, organoids can be robustly cultured from endoscopic biopsies of esophageal adenocarcinoma and recapitulate the originating tumor. This model demonstrates promise as a tool to better personalize therapy for esophageal adenocarcinoma patients.
Subject(s)
Adenocarcinoma/drug therapy , Esophageal Neoplasms/drug therapy , Induction Chemotherapy/methods , Precision Medicine/methods , Adult , Aged , Aged, 80 and over , Barrett Esophagus/drug therapy , Female , Humans , Male , Middle Aged , Organoids/cytologyABSTRACT
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
ABSTRACT
Pancreatic adenocarcinoma presents as a spectrum of a highly aggressive disease in patients. The basis of this disease heterogeneity has proved difficult to resolve due to poor tumor cellularity and extensive genomic instability. To address this, a dataset of whole genomes and transcriptomes was generated from purified epithelium of primary and metastatic tumors. Transcriptome analysis demonstrated that molecular subtypes are a product of a gene expression continuum driven by a mixture of intratumoral subpopulations, which was confirmed by single-cell analysis. Integrated whole-genome analysis uncovered that molecular subtypes are linked to specific copy number aberrations in genes such as mutant KRAS and GATA6. By mapping tumor genetic histories, tetraploidization emerged as a key mutational process behind these events. Taken together, these data support the premise that the constellation of genomic aberrations in the tumor gives rise to the molecular subtype, and that disease heterogeneity is due to ongoing genomic instability during progression.
Subject(s)
Carcinoma, Pancreatic Ductal/genetics , Carcinoma, Pancreatic Ductal/pathology , Pancreatic Neoplasms/genetics , Pancreatic Neoplasms/pathology , Adult , Aged , Aged, 80 and over , Carcinoma, Pancreatic Ductal/drug therapy , Carcinoma, Pancreatic Ductal/mortality , Cohort Studies , Female , GATA6 Transcription Factor/genetics , Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Genomic Instability , Humans , Male , Middle Aged , Pancreatic Neoplasms/drug therapy , Pancreatic Neoplasms/mortality , Phenotype , Proto-Oncogene Proteins p21(ras)/genetics , Smad4 Protein/geneticsABSTRACT
IMPORTANCE: Outcomes for patients with pancreatic ductal adenocarcinoma (PDAC) remain poor. Advances in next-generation sequencing provide a route to therapeutic approaches, and integrating DNA and RNA analysis with clinicopathologic data may be a crucial step toward personalized treatment strategies for this disease. OBJECTIVE: To classify PDAC according to distinct mutational processes, and explore their clinical significance. DESIGN, SETTING, AND PARTICIPANTS: We performed a retrospective cohort study of resected PDAC, using cases collected between 2008 and 2015 as part of the International Cancer Genome Consortium. The discovery cohort comprised 160 PDAC cases from 154 patients (148 primary; 12 metastases) that underwent tumor enrichment prior to whole-genome and RNA sequencing. The replication cohort comprised 95 primary PDAC cases that underwent whole-genome sequencing and expression microarray on bulk biospecimens. MAIN OUTCOMES AND MEASURES: Somatic mutations accumulate from sequence-specific processes creating signatures detectable by DNA sequencing. Using nonnegative matrix factorization, we measured the contribution of each signature to carcinogenesis, and used hierarchical clustering to subtype each cohort. We examined expression of antitumor immunity genes across subtypes to uncover biomarkers predictive of response to systemic therapies. RESULTS: The discovery cohort was 53% male (n = 79) and had a median age of 67 (interquartile range, 58-74) years. The replication cohort was 50% male (n = 48) and had a median age of 68 (interquartile range, 60-75) years. Five predominant mutational subtypes were identified that clustered PDAC into 4 major subtypes: age related, double-strand break repair, mismatch repair, and 1 with unknown etiology (signature 8). These were replicated and validated. Signatures were faithfully propagated from primaries to matched metastases, implying their stability during carcinogenesis. Twelve of 27 (45%) double-strand break repair cases lacked germline or somatic events in canonical homologous recombination genes-BRCA1, BRCA2, or PALB2. Double-strand break repair and mismatch repair subtypes were associated with increased expression of antitumor immunity, including activation of CD8-positive T lymphocytes (GZMA and PRF1) and overexpression of regulatory molecules (cytotoxic T-lymphocyte antigen 4, programmed cell death 1, and indolamine 2,3-dioxygenase 1), corresponding to higher frequency of somatic mutations and tumor-specific neoantigens. CONCLUSIONS AND RELEVANCE: Signature-based subtyping may guide personalized therapy of PDAC in the context of biomarker-driven prospective trials.
Subject(s)
Carcinoma, Pancreatic Ductal/genetics , Mutation , Pancreatic Neoplasms/genetics , Aged , CD8-Positive T-Lymphocytes/immunology , CTLA-4 Antigen/metabolism , Carcinoma, Pancreatic Ductal/immunology , DNA Breaks, Double-Stranded/drug effects , DNA Mismatch Repair/genetics , Fanconi Anemia Complementation Group N Protein , Female , Genes, BRCA1/physiology , Genes, BRCA2/physiology , Genetic Predisposition to Disease/genetics , Genome-Wide Association Study , Humans , Male , Middle Aged , Nuclear Proteins/genetics , Pancreatic Neoplasms/immunology , Prognosis , Programmed Cell Death 1 Receptor/metabolism , Retrospective Studies , Tumor Suppressor Proteins/genetics , Pancreatic NeoplasmsABSTRACT
BACKGROUND: Long non-coding RNAs (lncRNAs) are emerging as molecules that significantly impact many cellular processes and have been associated with almost every human cancer. Compared to protein-coding genes, lncRNA genes are often associated with transposable elements, particularly with endogenous retroviral elements (ERVs). ERVs can have potentially deleterious effects on genome structure and function, so these elements are typically silenced in normal somatic tissues, albeit with varying efficiency. The aberrant regulation of ERVs associated with lncRNAs (ERV-lncRNAs), coupled with the diverse range of lncRNA functions, creates significant potential for ERV-lncRNAs to impact cancer biology. METHODS: We used RNA-seq analysis to identify and profile the expression of a novel lncRNA in six large cohorts, including over 7,500 samples from The Cancer Genome Atlas (TCGA). RESULTS: We identified the tumor-specific expression of a novel lncRNA that we have named Endogenous retroViral-associated ADenocarcinoma RNA or 'EVADR', by analyzing RNA-seq data derived from colorectal tumors and matched normal control tissues. Subsequent analysis of TCGA RNA-seq data revealed the striking association of EVADR with adenocarcinomas, which are tumors of glandular origin. Moderate to high levels of EVADR were detected in 25 to 53% of colon, rectal, lung, pancreas and stomach adenocarcinomas (mean = 30 to 144 FPKM), and EVADR expression correlated with decreased patient survival (Cox regression; hazard ratio = 1.47, 95% confidence interval = 1.06 to 2.04, P = 0.02). In tumor sites of non-glandular origin, EVADR expression was detectable at only very low levels and in less than 10% of patients. For EVADR, a MER48 ERV element provides an active promoter to drive its transcription. Genome-wide, MER48 insertions are associated with nine lncRNAs, but none of the MER48-associated lncRNAs other than EVADR were consistently expressed in adenocarcinomas, demonstrating the specific activation of EVADR. The sequence and structure of the EVADR locus is highly conserved among Old World monkeys and apes but not New World monkeys or prosimians, where the MER48 insertion is absent. Conservation of the EVADR locus suggests a functional role for this novel lncRNA in humans and our closest primate relatives. CONCLUSIONS: Our results describe the specific activation of a highly conserved ERV-lncRNA in numerous cancers of glandular origin, a finding with diagnostic, prognostic and therapeutic implications.