RESUMEN
To gain insight into how ERG translocations cause prostate cancer, we performed single cell transcriptional profiling of an autochthonous mouse model at an early stage of disease initiation. Despite broad expression of ERG in all prostate epithelial cells, proliferation was enriched in a small, stem-like population with mixed-luminal basal identity (called intermediate cells). Through a series of lineage tracing and primary prostate tissue transplantation experiments, we find that tumor initiating activity resides in a subpopulation of basal cells that co-express the luminal genes Tmprss2 and Nkx3.1 (called BasalLum) but not in the larger population of classical Krt8+ luminal cells. Upon ERG activation, BasalLum cells give rise to the highly proliferative intermediate state, which subsequently transitions to the larger population of Krt8+ luminal cells characteristic of ERG-positive human cancers. Furthermore, this proliferative population is characterized by an ERG-specific chromatin state enriched for NFkB, AP-1, STAT and NFAT binding, with implications for TF cooperativity. The fact that the proliferative potential of ERG is enriched in a small stem-like population implicates the chromatin context of these cells as a critical variable for unmasking its oncogenic activity.
RESUMEN
Tumors develop by invoking a supportive environment characterized by aberrant angiogenesis and infiltration of tumor-associated macrophages (TAMs). In a transgenic model of breast cancer, we found that TAMs localized to the tumor parenchyma and were smaller than mammary tissue macrophages. TAMs had low activity of the metabolic regulator mammalian/mechanistic target of rapamycin complex 1 (mTORC1), and depletion of negative regulator of mTORC1 signaling, tuberous sclerosis complex 1 (TSC1), in TAMs inhibited tumor growth in a manner independent of adaptive lymphocytes. Whereas wild-type TAMs exhibited inflammatory and angiogenic gene expression profiles, TSC1-deficient TAMs had a pro-resolving phenotype. TSC1-deficient TAMs relocated to a perivascular niche, depleted protein C receptor (PROCR)-expressing endovascular endothelial progenitor cells, and rectified the hyperpermeable blood vasculature, causing tumor tissue hypoxia and cancer cell death. TSC1-deficient TAMs were metabolically active and effectively eliminated PROCR-expressing endothelial cells in cell competition experiments. Thus, TAMs exhibit a TSC1-dependent mTORC1-low state, and increasing mTORC1 signaling promotes a pro-resolving state that suppresses tumor growth, defining an innate immune tumor suppression pathway that may be exploited for cancer immunotherapy.
Asunto(s)
Células Progenitoras Endoteliales , Proteínas Supresoras de Tumor , Animales , Humanos , Serina-Treonina Quinasas TOR/metabolismo , Proteína 1 del Complejo de la Esclerosis Tuberosa/genética , Macrófagos Asociados a Tumores/metabolismo , Células Progenitoras Endoteliales/metabolismo , Receptor de Proteína C Endotelial , Diana Mecanicista del Complejo 1 de la Rapamicina , Neovascularización Patológica , MamíferosRESUMEN
The phosphoinositide 3-kinase (PI3K) pathway regulates proliferation, survival, and metabolism and is frequently activated across human cancers. A comprehensive elucidation of how this signaling pathway controls transcriptional and cotranscriptional processes could provide new insights into the key functions of PI3K signaling in cancer. Here, we undertook a transcriptomic approach to investigate genome-wide gene expression and transcription factor activity changes, as well as splicing and isoform usage dynamics, downstream of PI3K. These analyses uncovered widespread alternatively spliced isoforms linked to proliferation, metabolism, and splicing in PIK3CA-mutant cells, which were reversed by inhibition of PI3Kα. Analysis of paired tumor biopsies from patients with PIK3CA-mutated breast cancer undergoing treatment with PI3Kα inhibitors identified widespread splicing alterations that affect specific isoforms in common with the preclinical models, and these alterations, namely PTK2/FRNK and AFMID isoforms, were validated as functional drivers of cancer cell growth or migration. Mechanistically, isoform-specific splicing factors mediated PI3K-dependent RNA splicing. Treatment with splicing inhibitors rendered breast cancer cells more sensitive to the PI3Kα inhibitor alpelisib, resulting in greater growth inhibition than alpelisib alone. This study provides the first comprehensive analysis of widespread splicing alterations driven by oncogenic PI3K in breast cancer. The atlas of PI3K-mediated splicing programs establishes a key role for the PI3K pathway in regulating splicing, opening new avenues for exploiting PI3K signaling as a therapeutic vulnerability in breast cancer. SIGNIFICANCE: Transcriptomic analysis reveals a key role for the PI3K pathway in regulating RNA splicing, uncovering new mechanisms by which PI3K regulates proliferation and metabolism in breast cancer. See related commentary by Claridge and Hopkins, p. 2216.
Asunto(s)
Neoplasias de la Mama , Fosfatidilinositol 3-Quinasas , Neoplasias de la Mama/patología , Carcinogénesis/genética , Línea Celular Tumoral , Fosfatidilinositol 3-Quinasa Clase I/genética , Femenino , Humanos , Fosfatidilinositol 3-Quinasa/genética , Fosfatidilinositol 3-Quinasa/metabolismo , Fosfatidilinositol 3-Quinasas/genética , Fosfatidilinositol 3-Quinasas/metabolismo , Empalme del ARN/genética , TranscriptomaRESUMEN
Fifty percent of diffuse large B cell lymphoma (DLBCL) cases lack cell-surface expression of the class I major histocompatibility complex (MHC-I), thus escaping recognition by cytotoxic T cells. Here we show that, across B cell lymphomas, loss of MHC-I, but not MHC-II, is preferentially restricted to DLBCL. To identify the involved mechanisms, we performed whole exome and targeted HLA deep-sequencing in 74 DLBCL samples, and found somatic inactivation of B2M and the HLA-I loci in 80% (34 of 42) of MHC-INEG tumors. Furthermore, 70% (22 of 32) of MHC-IPOS DLBCLs harbored monoallelic HLA-I genetic alterations (MHC-IPOS/mono), indicating allele-specific inactivation. MHC-INEG and MHC-IPOS/mono cases harbored significantly higher mutational burden and inferred neoantigen load, suggesting potential coselection of HLA-I loss and sustained neoantigen production. Notably, the analysis of >500,000 individuals across different cancer types revealed common germline HLA-I homozygosity, preferentially in DLBCL. In mice, germinal-center B cells lacking HLA-I expression did not progress to lymphoma and were counterselected in the context of oncogene-driven lymphomagenesis, suggesting that additional events are needed to license immune evasion. These results suggest a multistep process of HLA-I loss in DLBCL development including both germline and somatic events, and have direct implications for the pathogenesis and immunotherapeutic targeting of this disease.
Asunto(s)
Transformación Celular Neoplásica/genética , Antígenos de Histocompatibilidad Clase I/genética , Linfoma de Células B Grandes Difuso/genética , Línea Celular Tumoral , Citidina Desaminasa , Silenciador del Gen , Humanos , Linfoma de Células B Grandes Difuso/inmunología , Proteínas Proto-Oncogénicas c-bcl-6/genética , Microglobulina beta-2/genéticaRESUMEN
PI3Kα inhibitors have shown clinical activity in PIK3CA-mutated estrogen receptor-positive (ER+) patients with breast cancer. Using whole genome CRISPR/Cas9 sgRNA knockout screens, we identified and validated several negative regulators of mTORC1 whose loss confers resistance to PI3Kα inhibition. Among the top candidates were TSC1, TSC2, TBC1D7, AKT1S1, STK11, MARK2, PDE7A, DEPDC5, NPRL2, NPRL3, C12orf66, SZT2, and ITFG2. Loss of these genes invariably results in sustained mTOR signaling under pharmacologic inhibition of the PI3K-AKT pathway. Moreover, resistance could be prevented or overcome by mTOR inhibition, confirming the causative role of sustained mTOR activity in limiting the sensitivity to PI3Kα inhibition. Cumulatively, genomic alterations affecting these genes are identified in about 15% of PIK3CA-mutated breast tumors and appear to be mutually exclusive. This study improves our understanding of the role of mTOR signaling restoration in leading to resistance to PI3Kα inhibition and proposes therapeutic strategies to prevent or revert this resistance. SIGNIFICANCE: These findings show that genetic lesions of multiple negative regulators of mTORC1 could limit the efficacy of PI3Kα inhibitors in breast cancer, which may guide patient selection strategies for future clinical trials.
Asunto(s)
Neoplasias de la Mama/tratamiento farmacológico , Neoplasias de la Mama/genética , Fosfatidilinositol 3-Quinasa Clase I/genética , Resistencia a Antineoplásicos/efectos de los fármacos , Mutación con Pérdida de Función , Diana Mecanicista del Complejo 1 de la Rapamicina/metabolismo , Inhibidores de las Quinasa Fosfoinosítidos-3/administración & dosificación , Transducción de Señal/genética , Tiazoles/administración & dosificación , Animales , Neoplasias de la Mama/metabolismo , Neoplasias de la Mama/patología , Supervivencia Celular/efectos de los fármacos , Fosfatidilinositol 3-Quinasa Clase I/antagonistas & inhibidores , Estudios de Cohortes , Femenino , Células HEK293 , Humanos , Células MCF-7 , Ratones , Serina-Treonina Quinasas TOR/metabolismo , Transducción Genética , Carga Tumoral/efectos de los fármacos , Carga Tumoral/genética , Ensayos Antitumor por Modelo de XenoinjertoRESUMEN
Mutations in the pioneer transcription factor FOXA1 are a hallmark of estrogen receptor-positive (ER+) breast cancers. Examining FOXA1 in â¼5,000 breast cancer patients identifies several hotspot mutations in the Wing2 region and a breast cancer-specific mutation SY242CS, located in the third ß strand. Using a clinico-genomically curated cohort, together with breast cancer models, we find that FOXA1 mutations associate with a lower response to aromatase inhibitors. Mechanistically, Wing2 mutations display increased chromatin binding at ER loci upon estrogen stimulation, and an enhanced ER-mediated transcription without changes in chromatin accessibility. In contrast, SY242CS shows neomorphic properties that include the ability to open distinct chromatin regions and activate an alternative cistrome and transcriptome. Structural modeling predicts that SY242CS confers a conformational change that mediates stable binding to a non-canonical DNA motif. Taken together, our results provide insights into how FOXA1 mutations perturb its function to dictate cancer progression and therapeutic response.
Asunto(s)
Inhibidores de la Aromatasa/uso terapéutico , Neoplasias de la Mama/tratamiento farmacológico , Cromatina/genética , Factor Nuclear 3-alfa del Hepatocito/genética , Mutación Missense , Animales , Neoplasias de la Mama/genética , Neoplasias de la Mama/metabolismo , Línea Celular Tumoral , Cromatina/metabolismo , Receptor alfa de Estrógeno/genética , Receptor alfa de Estrógeno/metabolismo , Femenino , Perfilación de la Expresión Génica/métodos , Regulación Neoplásica de la Expresión Génica , Factor Nuclear 3-alfa del Hepatocito/química , Factor Nuclear 3-alfa del Hepatocito/metabolismo , Humanos , Células MCF-7 , Ratones Desnudos , Modelos Moleculares , Dominios Proteicos , Ensayos Antitumor por Modelo de Xenoinjerto/métodosRESUMEN
Mutations in ARID1A, a subunit of the SWI/SNF chromatin remodeling complex, are the most common alterations of the SWI/SNF complex in estrogen-receptor-positive (ER+) breast cancer. We identify that ARID1A inactivating mutations are present at a high frequency in advanced endocrine-resistant ER+ breast cancer. An epigenome CRISPR-CAS9 knockout (KO) screen identifies ARID1A as the top candidate whose loss determines resistance to the ER degrader fulvestrant. ARID1A inactivation in cells and in patients leads to resistance to ER degraders by facilitating a switch from ER-dependent luminal cells to ER-independent basal-like cells. Cellular plasticity is mediated by loss of ARID1A-dependent SWI/SNF complex targeting to genomic sites of the luminal lineage-determining transcription factors including ER, forkhead box protein A1 (FOXA1) and GATA-binding factor 3 (GATA3). ARID1A also regulates genome-wide ER-FOXA1 chromatin interactions and ER-dependent transcription. Altogether, we uncover a critical role for ARID1A in maintaining luminal cell identity and endocrine therapeutic response in ER+ breast cancer.
Asunto(s)
Neoplasias de la Mama/tratamiento farmacológico , Neoplasias de la Mama/genética , Proteínas de Unión al ADN/genética , Resistencia a Antineoplásicos/genética , Receptores de Estrógenos/metabolismo , Factores de Transcripción/genética , Animales , Neoplasias de la Mama/mortalidad , Neoplasias de la Mama/patología , Estudios de Casos y Controles , Línea Celular Tumoral , Cromatina/genética , Cromatina/metabolismo , Proteínas de Unión al ADN/metabolismo , Femenino , Factor de Transcripción GATA3/genética , Factor de Transcripción GATA3/metabolismo , Factor Nuclear 3-alfa del Hepatocito/genética , Factor Nuclear 3-alfa del Hepatocito/metabolismo , Humanos , Ratones , Mutación , Receptores de Estrógenos/genética , Factores de Transcripción/metabolismo , Ensayos Antitumor por Modelo de XenoinjertoRESUMEN
Activating mutations in PIK3CA are frequent in human breast cancer, and phosphoinositide 3-kinase alpha (PI3Kα) inhibitors have been approved for therapy. To characterize determinants of sensitivity to these agents, we analyzed PIK3CA-mutant cancer genomes and observed the presence of multiple PIK3CA mutations in 12 to 15% of breast cancers and other tumor types, most of which (95%) are double mutations. Double PIK3CA mutations are in cis on the same allele and result in increased PI3K activity, enhanced downstream signaling, increased cell proliferation, and tumor growth. The biochemical mechanisms of dual mutations include increased disruption of p110α binding to the inhibitory subunit p85α, which relieves its catalytic inhibition, and increased p110α membrane lipid binding. Double PIK3CA mutations predict increased sensitivity to PI3Kα inhibitors compared with single-hotspot mutations.
Asunto(s)
Carcinogénesis/genética , Fosfatidilinositol 3-Quinasa Clase I/genética , Fosfatidilinositol 3-Quinasa Clase I/metabolismo , Resistencia a Antineoplásicos/genética , Neoplasias/genética , Inhibidores de las Quinasa Fosfoinosítidos-3/farmacología , Neoplasias de la Mama/tratamiento farmacológico , Neoplasias de la Mama/genética , Neoplasias de la Mama/patología , Línea Celular Tumoral , Fosfatidilinositol 3-Quinasa Clase I/química , Fosfatidilinositol 3-Quinasa Clase Ia/química , Fosfatidilinositol 3-Quinasa Clase Ia/metabolismo , Femenino , Humanos , Mutación , Neoplasias/tratamiento farmacológico , Neoplasias/patología , Inhibidores de las Quinasa Fosfoinosítidos-3/uso terapéutico , Unión Proteica , Dominios Proteicos , Tiazoles/farmacologíaRESUMEN
Background: Despite extensive efforts on the genomic characterization of gliomas, very few studies have reported the genetic alterations of cerebellar glioblastoma (C-GBM), a rare and lethal disease. Here, we provide a systematic study of C-GBM to better understand its specific genomic features. Methods: We collected a cohort of C-GBM patients and compared patient demographics and tumor pathologies with supratentorial glioblastoma (S-GBM). To uncover the molecular characteristics, we performed DNA and mRNA sequencing and DNA methylation arrays on 19, 6, and 4 C-GBM cases, respectively. Moreover, chemical drug screening was conducted to identify potential therapeutic options for C-GBMs. Results: Despite differing anatomical origins of C-GBM and S-GBM, neither histological, cytological, nor patient demographics appeared significantly different between the 2 types. However, we observed striking differences in mutational patterns, including frequent alterations of ATRX, PDGFRA, NF1, and RAS and absence of EGFR alterations in C-GBM. These results show a distinct evolutionary path in C-GBM, suggesting specific therapeutic targeted options. Targeted-drug screening revealed that C-GBMs were more responsive to mitogen-activated protein kinase kinase (MEK) inhibitor and resistant to epidermal growth factor receptor inhibitors than S-GBMs. Also, differential expression analysis indicated that C-GBMs may have originated from oligodendrocyte progenitor cells, suggesting that different types of cells can undergo malignant transformation according to their location in brain. Master regulator analysis with differentially expressed genes between C-GBM and proneural S-GBM revealed NR4A1 as a potential therapeutic target. Conclusions: Our results imply that unique gliomagenesis mechanisms occur in adult cerebellum and new treatment strategies are needed to provide greater therapeutic benefits for C-GBM patients. Key Points: 1. Distinct genomic profiles of 19 adult cerebellar GBMs were characterized. 2. MEK inhibitor was highly sensitive to cerebellar GBM compared with supratentorial GBM. 3. Master regulator analysis revealed NR4A1 as a potential therapeutic target in cerebellar GBM.
Asunto(s)
Biomarcadores de Tumor/genética , Neoplasias Cerebelosas/genética , Regulación Neoplásica de la Expresión Génica/efectos de los fármacos , Genómica/métodos , Glioblastoma/genética , Terapia Molecular Dirigida , Inhibidores de Proteínas Quinasas/uso terapéutico , Transcriptoma/efectos de los fármacos , Adulto , Anciano , Anciano de 80 o más Años , Neoplasias Cerebelosas/tratamiento farmacológico , Neoplasias Cerebelosas/patología , Metilación de ADN , Femenino , Estudios de Seguimiento , Fusión Génica , Glioblastoma/tratamiento farmacológico , Glioblastoma/patología , Humanos , Masculino , Persona de Mediana Edad , Pronóstico , Estudios Retrospectivos , Tasa de Supervivencia , Adulto JovenRESUMEN
Dissecting the pathogenesis of classical Hodgkin lymphoma (cHL), a common cancer in young adults, remains challenging because of the rarity of tumor cells in involved tissues (usually <5%). Here, we analyzed the coding genome of cHL by microdissecting tumor and normal cells from 34 patient biopsies for a total of â¼50 000 singly isolated lymphoma cells. We uncovered several recurrently mutated genes, namely, STAT6 (32% of cases), GNA13 (24%), XPO1 (18%), and ITPKB (16%), and document the functional role of mutant STAT6 in sustaining tumor cell viability. Mutations of STAT6 genetically and functionally cooperated with disruption of SOCS1, a JAK-STAT pathway inhibitor, to promote cHL growth. Overall, 87% of cases showed dysregulation of the JAK-STAT pathway by genetic alterations in multiple genes (also including STAT3, STAT5B, JAK1, JAK2, and PTPN1), attesting to the pivotal role of this pathway in cHL pathogenesis and highlighting its potential as a new therapeutic target in this disease.
Asunto(s)
Regulación Neoplásica de la Expresión Génica , Enfermedad de Hodgkin/genética , Quinasas Janus/genética , Mutación , Factores de Transcripción STAT/genética , Línea Celular Tumoral , Análisis Mutacional de ADN , Enfermedad de Hodgkin/metabolismo , Enfermedad de Hodgkin/patología , Humanos , Quinasas Janus/metabolismo , Factores de Transcripción STAT/metabolismo , Transducción de SeñalRESUMEN
The distribution of sense and antisense strand DNA mutations on transcribed duplex DNA contributes to the development of immune and neural systems along with the progression of cancer. Because developmentally matured B cells undergo biologically programmed strand-specific DNA mutagenesis at focal DNA/RNA hybrid structures, they make a convenient system to investigate strand-specific mutagenesis mechanisms. We demonstrate that the sense and antisense strand DNA mutagenesis at the immunoglobulin heavy chain locus and some other regions of the B cell genome depends upon localized RNA processing protein complex formation in the nucleus. Both the physical proximity and coupled activities of RNA helicase Mtr4 (and senataxin) with the noncoding RNA processing function of RNA exosome determine the strand-specific distribution of DNA mutations. Our study suggests that strand-specific DNA mutagenesis-associated mechanisms will play major roles in other undiscovered aspects of organismic development.
Asunto(s)
Linfocitos B/metabolismo , Complejo Multienzimático de Ribonucleasas del Exosoma/metabolismo , Mutación , Proteínas Nucleares/metabolismo , Proteínas de Unión al ARN/metabolismo , Animales , Núcleo Celular/metabolismo , ADN Helicasas/metabolismo , Exorribonucleasas/genética , Inestabilidad Genómica , Cadenas Pesadas de Inmunoglobulina/genética , Ratones , Enzimas Multifuncionales , Proteínas Nucleares/genética , ARN Helicasas , Procesamiento Postranscripcional del ARN , Proteínas de Unión al ARN/genéticaRESUMEN
Precision medicine in cancer proposes that genomic characterization of tumors can inform personalized targeted therapies. However, this proposition is complicated by spatial and temporal heterogeneity. Here we study genomic and expression profiles across 127 multisector or longitudinal specimens from 52 individuals with glioblastoma (GBM). Using bulk and single-cell data, we find that samples from the same tumor mass share genomic and expression signatures, whereas geographically separated, multifocal tumors and/or long-term recurrent tumors are seeded from different clones. Chemical screening of patient-derived glioma cells (PDCs) shows that therapeutic response is associated with genetic similarity, and multifocal tumors that are enriched with PIK3CA mutations have a heterogeneous drug-response pattern. We show that targeting truncal events is more efficacious than targeting private events in reducing the tumor burden. In summary, this work demonstrates that evolutionary inference from integrated genomic analysis in multisector biopsies can inform targeted therapeutic interventions for patients with GBM.
Asunto(s)
Neoplasias Encefálicas/genética , Glioblastoma/genética , Genómica/métodos , Humanos , Mutación/genética , Recurrencia Local de Neoplasia/genética , Fosfatidilinositol 3-Quinasas/genética , Medicina de Precisión/métodosRESUMEN
Glioblastoma (GBM) is the most common and aggressive primary brain tumor. To better understand how GBM evolves, we analyzed longitudinal genomic and transcriptomic data from 114 patients. The analysis shows a highly branched evolutionary pattern in which 63% of patients experience expression-based subtype changes. The branching pattern, together with estimates of evolutionary rate, suggests that relapse-associated clones typically existed years before diagnosis. Fifteen percent of tumors present hypermutation at relapse in highly expressed genes, with a clear mutational signature. We find that 11% of recurrence tumors harbor mutations in LTBP4, which encodes a protein binding to TGF-ß. Silencing LTBP4 in GBM cells leads to suppression of TGF-ß activity and decreased cell proliferation. In recurrent GBM with wild-type IDH1, high LTBP4 expression is associated with worse prognosis, highlighting the TGF-ß pathway as a potential therapeutic target in GBM.
Asunto(s)
Biomarcadores de Tumor/genética , Neoplasias Encefálicas/patología , Evolución Clonal/genética , Dacarbazina/análogos & derivados , Glioblastoma/patología , Mutación/genética , Recurrencia Local de Neoplasia/patología , Antineoplásicos Alquilantes/uso terapéutico , Neoplasias Encefálicas/tratamiento farmacológico , Neoplasias Encefálicas/genética , Proliferación Celular , Metilasas de Modificación del ADN/genética , Enzimas Reparadoras del ADN/genética , Dacarbazina/uso terapéutico , Regulación Neoplásica de la Expresión Génica , Genómica , Glioblastoma/tratamiento farmacológico , Glioblastoma/genética , Humanos , Isocitrato Deshidrogenasa/genética , Proteínas de Unión a TGF-beta Latente/genética , Estudios Longitudinales , Clasificación del Tumor , Recurrencia Local de Neoplasia/tratamiento farmacológico , Recurrencia Local de Neoplasia/genética , Tasa de Supervivencia , Temozolomida , Transcriptoma , Factor de Crecimiento Transformador beta/genética , Proteínas Supresoras de Tumor/genéticaRESUMEN
Mirtrons are microRNA (miRNA) substrates that utilize the splicing machinery to bypass the necessity of Drosha cleavage for their biogenesis. Expanding our recent efforts for mammalian mirtron annotation, we use meta-analysis of aggregate datasets to identify ~500 novel mouse and human introns that confidently generate diced small RNA duplexes. These comprise nearly 1000 total loci distributed in four splicing-mediated biogenesis subclasses, with 5'-tailed mirtrons as, by far, the dominant subtype. Thus, mirtrons surprisingly comprise a substantial fraction of endogenous Dicer substrates in mammalian genomes. Although mirtron-derived small RNAs exhibit overall expression correlation with their host mRNAs, we observe a subset with substantial differences that suggest regulated processing or accumulation. We identify characteristic sequence, length, and structural features of mirtron loci that distinguish them from bulk introns, and find that mirtrons preferentially emerge from genes with larger numbers of introns. While mirtrons generate miRNA-class regulatory RNAs, we also find that mirtrons exhibit many features that distinguish them from canonical miRNAs. We observe that conventional mirtron hairpins are substantially longer than Drosha-generated pre-miRNAs, indicating that the characteristic length of canonical pre-miRNAs is not a general feature of Dicer substrate hairpins. In addition, mammalian mirtrons exhibit unique patterns of ordered 5' and 3' heterogeneity, which reveal hidden complexity in miRNA processing pathways. These include broad 3'-uridylation of mirtron hairpins, atypically heterogeneous 5' termini that may result from exonucleolytic processing, and occasionally robust decapitation of the 5' guanine (G) of mirtron-5p species defined by splicing. Altogether, this study reveals that this extensive class of non-canonical miRNA bears a multitude of characteristic properties, many of which raise general mechanistic questions regarding the processing of endogenous hairpin transcripts.
Asunto(s)
Biología Computacional/métodos , MicroARNs/química , MicroARNs/genética , MicroARNs/metabolismo , Ribonucleasa III/metabolismo , Animales , Biblioteca de Genes , Humanos , Ratones , Modelos Genéticos , Análisis de Secuencia de ARNRESUMEN
The transcriptome is the readout of the genome. Identifying common features in it across distant species can reveal fundamental principles. To this end, the ENCODE and modENCODE consortia have generated large amounts of matched RNA-sequencing data for human, worm and fly. Uniform processing and comprehensive annotation of these data allow comparison across metazoan phyla, extending beyond earlier within-phylum transcriptome comparisons and revealing ancient, conserved features. Specifically, we discover co-expression modules shared across animals, many of which are enriched in developmental genes. Moreover, we use expression patterns to align the stages in worm and fly development and find a novel pairing between worm embryo and fly pupae, in addition to the embryo-to-embryo and larvae-to-larvae pairings. Furthermore, we find that the extent of non-canonical, non-coding transcription is similar in each organism, per base pair. Finally, we find in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a 'universal model' based on a single set of organism-independent parameters.
Asunto(s)
Caenorhabditis elegans/genética , Drosophila melanogaster/genética , Perfilación de la Expresión Génica , Transcriptoma/genética , Animales , Caenorhabditis elegans/embriología , Caenorhabditis elegans/crecimiento & desarrollo , Cromatina/genética , Análisis por Conglomerados , Drosophila melanogaster/crecimiento & desarrollo , Regulación del Desarrollo de la Expresión Génica/genética , Histonas/metabolismo , Humanos , Larva/genética , Larva/crecimiento & desarrollo , Modelos Genéticos , Anotación de Secuencia Molecular , Regiones Promotoras Genéticas/genética , Pupa/genética , Pupa/crecimiento & desarrollo , ARN no Traducido/genética , Análisis de Secuencia de ARNRESUMEN
We expanded the knowledge base for Drosophila cell line transcriptomes by deeply sequencing their small RNAs. In total, we analyzed more than 1 billion raw reads from 53 libraries across 25 cell lines. We verify reproducibility of biological replicate data sets, determine common and distinct aspects of miRNA expression across cell lines, and infer the global impact of miRNAs on cell line transcriptomes. We next characterize their commonalities and differences in endo-siRNA populations. Interestingly, most cell lines exhibit enhanced TE-siRNA production relative to tissues, suggesting this as a common aspect of cell immortalization. We also broadly extend annotations of cis-NAT-siRNA loci, identifying ones with common expression across diverse cells and tissues, as well as cell-restricted loci. Finally, we characterize small RNAs in a set of ovary-derived cell lines, including somatic cells (OSS and OSC) and a mixed germline/somatic cell population (fGS/OSS) that exhibits ping-pong piRNA signatures. Collectively, the ovary data reveal new genic piRNA loci, including unusual configurations of piRNA-generating regions. Together with the companion analysis of mRNAs described in a previous study, these small RNA data provide comprehensive information on the transcriptional landscape of diverse Drosophila cell lines. These data should encourage broader usage of fly cell lines, beyond the few that are presently in common usage.
Asunto(s)
Drosophila/genética , Variación Genética , MicroARNs/genética , ARN Interferente Pequeño/genética , Animales , Secuencia de Bases , Línea Celular , Biología Computacional/métodos , Expresión Génica , Sitios Genéticos , Células Germinativas , Secuenciación de Nucleótidos de Alto Rendimiento , MicroARNs/química , Anotación de Secuencia Molecular , Datos de Secuencia Molecular , Conformación de Ácido Nucleico , ARN Interferente Pequeño/química , Alineación de SecuenciaRESUMEN
In the canonical animal microRNA (miRNA) pathway, Drosha generates â¼60- to 70-nucleotide pre-miRNA hairpins that are cleaved by Dicer into small RNA duplexes that load into Argonaute proteins, which retain a single mature strand in the active complex. The terminal loops of some miRNA hairpins regulate processing efficiency, but once liberated by Dicer, they are generally considered nonfunctional by-products. Here, we show that specific miRNA loops accumulate in effector Argonaute complexes in Drosophila and mediate miRNA-type repression. This was unexpected, since endogenous loading of Argonaute proteins was believed to occur exclusively via small RNA duplexes. Using in vitro assays, which recapitulate Argonaute-specific loop loading from synthetic pre-miRNAs and even single-stranded oligoribonucleotides corresponding to miRNA loops, we reveal that the loop-loading mechanism is distinct from duplex loading. We also show that miRNA loops loaded into the miRNA effector AGO1 are subject to 3' resection, and structure-function analyses indicate selectivity of loop loading. Finally, we demonstrate that select miRNA loops in mammals are similarly loaded into Argonaute complexes and direct target repression. Altogether, we reveal a conserved mechanism that yields functional RNAs from miRNA loop regions, broadening the repertoire of Argonaute-dependent regulatory RNAs and providing evidence for functionality of endogenous ssRNA species.
Asunto(s)
Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , MicroARNs/genética , MicroARNs/metabolismo , Conformación de Ácido Nucleico , Animales , Proteínas Argonautas/metabolismo , Secuencia Conservada , Proteínas de Drosophila/metabolismo , Regulación de la Expresión Génica , Estudio de Asociación del Genoma Completo , Mamíferos , Ratones , MicroARNs/químicaRESUMEN
Atypical miRNA substrates do not fit criteria often used to annotate canonical miRNAs, and can escape the notice of miRNA genefinders. Recent analyses expanded the catalogs of invertebrate splicing-derived miRNAs ("mirtrons"), but only a few tens of mammalian mirtrons have been recognized to date. We performed meta-analysis of 737 mouse and human small RNA data sets comprising 2.83 billion raw reads. Using strict and conservative criteria, we provide confident annotation for 237 mouse and 240 human splicing-derived miRNAs, the vast majority of which are novel genes. These comprise three classes of splicing-derived miRNAs in mammals: conventional mirtrons, 5'-tailed mirtrons, and 3'-tailed mirtrons. In addition, we segregated several hundred additional human and mouse loci with candidate (and often compelling) evidence. Most of these loci arose relatively recently in their respective lineages. Nevertheless, some members in each of the three mirtron classes are conserved, indicating their incorporation into beneficial regulatory networks. We also provide the first Northern validation for mammalian mirtrons, and demonstrate Dicer-dependent association of mature miRNAs from all three classes of mirtrons with Ago2. The recognition of hundreds of mammalian mirtrons provides a new foundation for understanding the scope and evolutionary dynamics of Dicer substrates in mammals.
Asunto(s)
MicroARNs/genética , MicroARNs/metabolismo , Animales , Proteínas Argonautas/metabolismo , Secuencia de Bases , Secuencia Conservada , Regulación de la Expresión Génica , Sitios Genéticos , Humanos , Mamíferos/genética , Ratones , Anotación de Secuencia Molecular , Datos de Secuencia Molecular , ARN Pequeño no Traducido/genética , ARN Pequeño no Traducido/metabolismo , Alineación de SecuenciaRESUMEN
Nucleotide modifications to microRNAs or their precursors can influence their processing and/or activity. A challenge to their analysis is the lack of independent references for the termini generated by primary processing; typically, these are empirically assigned as the most abundant mapped reads. Mirtrons offer such an independent measure since these microRNA hairpins are defined by splicing. Consequently, mirtron-derived reads that deviate from splice sites reflect modification following primary processing. Analysis in Drosophila revealed multiple modification patterns, including select alterations of 5' termini, many 3' resection events, and unexpectedly abundant 3' untemplated monouridylation. Resections occur on mature AGO1-loaded species, whereas uridylation occurs on pre-miRNAs but is compatible with dicing and AGO1 loading. Strikingly, we found many mirtrons whose modified reads are more abundant than those produced by primary processing. In some cases, these abundant modified reads matched the genome owing to fortuitous uridines in downstream flanking exons, thus highlighting the value of an independent reference for the primary-processed sequence. We could further extend the principle of abundant and preferred uridylation of mirtrons, relative to canonical pre-miRNAs, to Caenorhabditis elegans, mouse, and human. Finally, we found that 3' resection occurs broadly across AGO1-loaded canonical miRNA and star species. Altogether, these findings substantially broaden the complexity of terminal modification pathways acting upon small regulatory RNAs.