RESUMEN
Alternative splicing (AS) is a critical regulatory layer; yet, factors controlling functionally coordinated splicing programs during developmental transitions are poorly understood. Here, we employ a screening strategy to identify factors controlling dynamic splicing events important for mammalian neurogenesis. Among previously unknown regulators, Rbm38 acts widely to negatively control neural AS, in part through interactions mediated by the established repressor of splicing, Ptbp1. Puf60, a ubiquitous factor, is surprisingly found to promote neural splicing patterns. This activity requires a conserved, neural-differential exon that remodels Puf60 co-factor interactions. Ablation of this exon rewires distinct AS networks in embryonic stem cells and at different stages of mouse neurogenesis. Single-cell transcriptome analyses further reveal distinct roles for Rbm38 and Puf60 isoforms in establishing neuronal identity. Our results describe important roles for previously unknown regulators of neurogenesis and establish how an alternative exon in a widely expressed splicing factor orchestrates temporal control over cell differentiation.
Asunto(s)
Neurogénesis , Empalme del ARN , Empalme Alternativo , Animales , Exones/genética , Mamíferos , Ratones , Neurogénesis/genética , Neuronas , Proteínas de Unión al ARN/genéticaRESUMEN
DNA strand asymmetries can have a major effect on several biological functions, including replication, transcription and transcription factor binding. As such, DNA strand asymmetries and mutational strand bias can provide information about biological function. However, a versatile tool to explore this does not exist. Here, we present Asymmetron, a user-friendly computational tool that performs statistical analysis and visualizations for the evaluation of strand asymmetries. Asymmetron takes as input DNA features provided with strand annotation and outputs strand asymmetries for consecutive occurrences of a single DNA feature or between pairs of features. We illustrate the use of Asymmetron by identifying transcriptional and replicative strand asymmetries of germline structural variant breakpoints. We also show that the orientation of the binding sites of 45% of human transcription factors analyzed have a significant DNA strand bias in transcribed regions, that is also corroborated in ChIP-seq analyses, and is likely associated with transcription. In summary, we provide a novel tool to assess DNA strand asymmetries and show how it can be used to derive new insights across a variety of biological disciplines.
Asunto(s)
Biología Computacional/métodos , Replicación del ADN/genética , ADN/genética , Mutación , Transcripción Genética/genética , Células A549 , Algoritmos , Línea Celular Transformada , ADN/química , ADN/metabolismo , Células Hep G2 , Humanos , Células K562 , Células MCF-7 , Modelos Genéticos , Unión Proteica , Factores de Transcripción/genética , Factores de Transcripción/metabolismoRESUMEN
We uncovered the diversity of non-canonical splice sites at the human transcriptome using deep transcriptome profiling. We mapped a total of 3.7 billion human RNA-seq reads and developed a set of stringent filters to avoid false non-canonical splice site detections. We identified 184 splice sites with non-canonical dinucleotides and U2/U12-like consensus sequences. We selected 10 of the herein identified U2/U12-like non-canonical splice site events and successfully validated 9 of them via reverse transcriptase-polymerase chain reaction and Sanger sequencing. Analyses of the 184 U2/U12-like non-canonical splice sites indicate that 51% of them are not annotated in GENCODE. In addition, 28% of them are conserved in mouse and 76% are involved in alternative splicing events, some of them with tissue-specific alternative splicing patterns. Interestingly, our analysis identified some U2/U12-like non-canonical splice sites that are converted into canonical splice sites by RNA A-to-I editing. Moreover, the U2/U12-like non-canonical splice sites have a differential distribution of splicing regulatory sequences, which may contribute to their recognition and regulation. Our analysis provides a high-confidence group of U2/U12-like non-canonical splice sites, which exhibit distinctive features among the total human splice sites.
Asunto(s)
Sitios de Empalme de ARN , Transcriptoma , Empalme Alternativo , Animales , Artefactos , Secuencia de Bases , Línea Celular , Secuencia de Consenso , Evolución Molecular , Perfilación de la Expresión Génica , Humanos , Intrones , Ratones , Anotación de Secuencia Molecular , Edición de ARN , Secuencias Reguladoras de Ácido Ribonucleico , Análisis de Secuencia de ARN , Empalmosomas/metabolismoRESUMEN
Single-cell RNA-seq (scRNA-seq) is widely used for transcriptome profiling, but most analyses focus on gene-level events, with less attention devoted to alternative splicing. Here, we present scASfind, a novel computational method to allow for quantitative analysis of cell type-specific splicing events using full-length scRNA-seq data. ScASfind utilizes an efficient data structure to store the percent spliced-in value for each splicing event. This makes it possible to exhaustively search for patterns among all differential splicing events, allowing us to identify marker events, mutually exclusive events, and events involving large blocks of exons that are specific to one or more cell types.
Asunto(s)
Empalme Alternativo , RNA-Seq , Análisis de Expresión Génica de una Sola Célula , Animales , Humanos , Biología Computacional/métodos , Minería de Datos , Exones , Perfilación de la Expresión Génica/métodos , RNA-Seq/métodos , Programas InformáticosRESUMEN
The analysis of RNA-seq has greatly improved the characterization and understanding of the transcriptome. In particular, RNA-seq experiments have extended catalogs of alternative splicing events. However, the analysis of RNAs-seq data for detection and quantification of microexons, extremely short exons of length up to 30 nt, require specialized computational workflows. Here, we describe MicroExonator, a reproducible computational workflow for microexon splicing analysis using bulk or single-cell RNA-seq data.
Asunto(s)
Empalme Alternativo , Empalme del ARN , Exones/genética , RNA-Seq , Análisis de Secuencia de ARN , TranscriptomaRESUMEN
Even though the functional role of mRNA molecules is primarily decided by the nucleotide sequence, several properties are determined by secondary structure conformations. Examples of secondary structures include long range interactions, hairpins, R-loops and G-quadruplexes and they are formed through interactions of non-adjacent nucleotides. Here, we discuss advances in our understanding of how secondary structures can impact RNA synthesis, splicing, translation and mRNA half-life. During RNA synthesis, secondary structures determine RNA polymerase II (RNAPII) speed, thereby influencing splicing. Splicing is also determined by RNA binding proteins and their binding rates are modulated by secondary structures. For the initiation of translation, secondary structures can control the choice of translation start site. Here, we highlight the mechanisms by which secondary structures modulate these processes, discuss advances in technologies to detect and study them systematically, and consider the roles of RNA secondary structures in disease.
RESUMEN
Alternative splicing is central to metazoan gene regulation, but the regulatory mechanisms are incompletely understood. Here, we show that G-quadruplex (G4) motifs are enriched ~3-fold near splice junctions. The importance of G4s in RNA is emphasised by a higher enrichment for the non-template strand. RNA-seq data from mouse and human neurons reveals an enrichment of G4s at exons that were skipped following depolarisation induced by potassium chloride. We validate the formation of stable RNA G4s for three candidate splice sites by circular dichroism spectroscopy, UV-melting and fluorescence measurements. Moreover, we find that sQTLs are enriched at G4s, and a minigene experiment provides further support for their role in promoting exon inclusion. Analysis of >1,800 high-throughput experiments reveals multiple RNA binding proteins associated with G4s. Finally, exploration of G4 motifs across eleven species shows strong enrichment at splice sites in mammals and birds, suggesting an evolutionary conserved splice regulatory mechanism.
Asunto(s)
Empalme Alternativo , G-Cuádruplex , Animales , Exones/genética , Mamíferos/genética , Ratones , ARN/metabolismo , Proteínas de Unión al ARN/metabolismoRESUMEN
lternative DNA conformations, termed non-B DNA structures, can affect transcription, but the underlying mechanisms and their functional impact have not been systematically characterized. Here, we used computational genomic analyses coupled with massively parallel reporter assays (MPRAs) to show that certain non-B DNA structures have a substantial effect on gene expression. Genomic analyses found that non-B DNA structures at promoters harbor an excess of germline variants. Analysis of multiple MPRAs, including a promoter library specifically designed to perturb non-B DNA structures, functionally validated that Z-DNA can significantly affect promoter activity. We also observed that biophysical properties of non-B DNA motifs, such as the length of Z-DNA motifs and the orientation of G-quadruplex structures relative to transcriptional direction, have a significant effect on promoter activity. Combined, their higher mutation rate and functional effect on transcription implicate a subset of non-B DNA motifs as major drivers of human gene-expression-associated phenotypes.
RESUMEN
Most methods for single-cell transcriptome sequencing amplify the termini of polyadenylated transcripts, capturing only a small fraction of the total cellular transcriptome. This precludes the detection of many long non-coding, short non-coding and non-polyadenylated protein-coding transcripts and hinders alternative splicing analysis. We, therefore, developed VASA-seq to detect the total transcriptome in single cells, which is enabled by fragmenting and tailing all RNA molecules subsequent to cell lysis. The method is compatible with both plate-based formats and droplet microfluidics. We applied VASA-seq to more than 30,000 single cells in the developing mouse embryo during gastrulation and early organogenesis. Analyzing the dynamics of the total single-cell transcriptome, we discovered cell type markers, many based on non-coding RNA, and performed in vivo cell cycle analysis via detection of non-polyadenylated histone genes. RNA velocity characterization was improved, accurately retracing blood maturation trajectories. Moreover, our VASA-seq data provide a comprehensive analysis of alternative splicing during mammalian development, which highlighted substantial rearrangements during blood development and heart morphogenesis.
Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Transcriptoma , Ratones , Animales , Análisis de Secuencia de ARN/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Empalme Alternativo/genética , ARN/metabolismo , Perfilación de la Expresión Génica/métodos , Mamíferos/genéticaRESUMEN
Stochastic gene expression leads to inherent variability in expression outcomes even in isogenic single-celled organisms grown in the same environment. The Drop-Seq technology facilitates transcriptomic studies of individual mammalian cells, and it has had transformative effects on the characterization of cell identity and function based on single-cell transcript counts. However, application of this technology to organisms with different cell size and morphology characteristics has been challenging. Here we present yeastDrop-Seq, a yeast-optimized platform for quantifying the number of distinct mRNA molecules in a cell-specific manner in individual yeast cells. Using yeastDrop-Seq, we measured the transcriptomic impact of the lifespan-extending compound mycophenolic acid and its epistatic agent guanine. Each treatment condition had a distinct transcriptomic footprint on isogenic yeast cells as indicated by distinct clustering with clear separations among the different groups. The yeastDrop-Seq platform facilitates transcriptomic profiling of yeast cells for basic science and biotechnology applications.
Asunto(s)
Perfilación de la Expresión Génica/métodos , Regulación Fúngica de la Expresión Génica/genética , ARN Mensajero/genética , Saccharomyces cerevisiae/genética , Análisis de la Célula Individual/métodos , Transcriptoma/genética , Análisis por Conglomerados , Regulación Fúngica de la Expresión Génica/efectos de los fármacos , Ontología de Genes , Guanina/metabolismo , Guanina/farmacología , Ácido Micofenólico/metabolismo , Ácido Micofenólico/farmacología , ARN Mensajero/metabolismo , Saccharomyces cerevisiae/citología , Análisis de Secuencia de ARN/métodos , Transcriptoma/efectos de los fármacosRESUMEN
Single traumatic events that elicit an exaggerated stress response can lead to the development of neuropsychiatric conditions. Rodent studies suggested germline RNA as a mediator of effects of chronic environmental exposures to the progeny. The effects of an acute paternal stress exposure on the germline and their potential consequences on offspring remain to be seen. We find that acute administration of an agonist for the stress-sensitive Glucocorticoid receptor, using the common corticosteroid dexamethasone, affects the RNA payload of mature sperm as soon as 3 hr after exposure. It further impacts early embryonic transcriptional trajectories, as determined by single-embryo sequencing, and metabolism in the offspring. We show persistent regulation of tRNA fragments in sperm and descendant 2-cell embryos, suggesting transmission from sperm to embryo. Lastly, we unravel environmentally induced alterations in sperm circRNAs and their targets in the early embryo, highlighting this class as an additional candidate in RNA-mediated inheritance of disease risk.
RESUMEN
BACKGROUND: Microexons, exons that are ≤ 30 nucleotides, are a highly conserved and dynamically regulated set of cassette exons. They have key roles in nervous system development and function, as evidenced by recent results demonstrating the impact of microexons on behaviour and cognition. However, microexons are often overlooked due to the difficulty of detecting them using standard RNA-seq aligners. RESULTS: Here, we present MicroExonator, a novel pipeline for reproducible de novo discovery and quantification of microexons. We process 289 RNA-seq datasets from eighteen mouse tissues corresponding to nine embryonic and postnatal stages, providing the most comprehensive survey of microexons available for mice. We detect 2984 microexons, 332 of which are differentially spliced throughout mouse embryonic brain development, including 29 that are not present in mouse transcript annotation databases. Unsupervised clustering of microexons based on their inclusion patterns segregates brain tissues by developmental time, and further analysis suggests a key function for microexons in axon growth and synapse formation. Finally, we analyse single-cell RNA-seq data from the mouse visual cortex, and for the first time, we report differential inclusion between neuronal subpopulations, suggesting that some microexons could be cell type-specific. CONCLUSIONS: MicroExonator facilitates the investigation of microexons in transcriptome studies, particularly when analysing large volumes of data. As a proof of principle, we use MicroExonator to analyse a large collection of both mouse bulk and single-cell RNA-seq datasets. The analyses enabled the discovery of previously uncharacterized microexons, and our study provides a comprehensive microexon inclusion catalogue during mouse development.
Asunto(s)
Desarrollo Embrionario/genética , Exones , Neuronas/metabolismo , Animales , Secuencia de Bases , Encéfalo/crecimiento & desarrollo , Encéfalo/metabolismo , Ratones , Neurogénesis/genética , Neurulación/genética , Neurulación/fisiología , Empalme del ARN , Análisis de Secuencia de ARN , Análisis de la Célula Individual , Programas Informáticos , Transcriptoma , Corteza Visual , Pez CebraRESUMEN
Small RNAs play a crucial role in genome defense against transposable elements and guide Argonaute proteins to nascent RNA transcripts to induce co-transcriptional gene silencing. However, the molecular basis of this process remains unknown. Here, we identify the conserved RNA helicase Aquarius/EMB-4 as a direct and essential link between small RNA pathways and the transcriptional machinery in Caenorhabditis elegans. Aquarius physically interacts with the germline Argonaute HRDE-1. Aquarius is required to initiate small-RNA-induced heritable gene silencing. HRDE-1 and Aquarius silence overlapping sets of genes and transposable elements. Surprisingly, removal of introns from a target gene abolishes the requirement for Aquarius, but not HRDE-1, for small RNA-dependent gene silencing. We conclude that Aquarius allows small RNA pathways to compete for access to nascent transcripts undergoing co-transcriptional splicing in order to detect and silence transposable elements. Thus, Aquarius and HRDE-1 act as gatekeepers coordinating gene expression and genome defense.
Asunto(s)
Proteínas Argonautas/genética , Proteínas de Caenorhabditis elegans/genética , Caenorhabditis elegans/genética , Proteínas Nucleares/genética , Interferencia de ARN , Animales , Proteínas Argonautas/metabolismo , Caenorhabditis elegans/metabolismo , Proteínas de Caenorhabditis elegans/metabolismo , Elementos Transponibles de ADN , Intrones , Proteínas Nucleares/metabolismo , Unión ProteicaRESUMEN
Aging is the main risk factor for Alzheimer's disease (AD); being associated with conspicuous changes on microglia activation. Aged microglia exhibit an increased expression of cytokines, exacerbated reactivity to various stimuli, oxidative stress, and reduced phagocytosis of ß-amyloid (Aß). Whereas normal inflammation is protective, it becomes dysregulated in the presence of a persistent stimulus, or in the context of an inflammatory environment, as observed in aging. Thus, neuroinflammation can be a self-perpetuating deleterious response, becoming a source of additional injury to host cells in neurodegenerative diseases. In aged individuals, although transforming growth factor ß (TGFß) is upregulated, its canonical Smad3 signaling is greatly reduced and neuroinflammation persists. This age-related Smad3 impairment reduces protective activation while facilitating cytotoxic activation of microglia through several cellular mechanisms, potentiating microglia-mediated neurodegeneration. Here, we critically discuss the role of TGFß-Smad signaling on the cytotoxic activation of microglia and its relevance in the pathogenesis of AD. Other protective functions, such as phagocytosis, although observed in aged animals, are not further induced by inflammatory stimuli and TGFß1. Analysis in silico revealed that increased expression of receptor scavenger receptor (SR)-A, involved in Aß uptake and cell activation, by microglia exposed to TGFß, through a Smad3-dependent mechanism could be mediated by transcriptional co-factors Smad2/3 over the MSR1 gene. We discuss that changes of TGFß-mediated regulation could at least partially mediate age-associated microglia changes, and, together with other changes on inflammatory response, could result in the reduction of protective activation and the potentiation of cytotoxicity of microglia, resulting in the promotion of neurodegenerative diseases.
RESUMEN
CoREST (CoREST1, rcor1) transcriptional corepressor together with the histone demethylase LSD1 (KDM1A) and the histone deacetylases HDAC1/2 form LSD1-CoREST-HDAC (LCH) transcriptional complexes to regulate gene expression. CoREST1 belong to a family that also comprises CoREST2 (rcor2) and CoREST3 (rcor3). CoREST1 represses the expression of neuronal genes during neuronal differentiation. However, the role of paralogs CoREST2 and CoREST3 in this process is just starting to emerge. Here, we report the expression of all CoRESTs and partners LSD1 and HDAC1/2 in two models of neuronal differentiation: Nerve-Growth-Factor (NGF)-induced neuronal phenotype of PC12 cells, and in vitro maturation of embryonic rat cortical neurons. In both models, a concomitant and gradual decrease of LSD1, HDAC1, HDAC2, CoREST1, and CoREST2, but not CoREST3 was observed. As required by the study, full-length rat rcor1 gene was identified using in silico analysis of available rat genome. The work was also complemented by the analysis of rat RNA-seq databases. The analysis showed that all CoRESTs, including the identified four splicing variants of rat CoREST3, display a wide expression in adult tissues. Moreover, the analysis of RNA-seq databases showed that CoREST2 displays a higher expression than CoREST1 and CoREST3 in the mature brain. Immunofluorescent assays and immunoblots of adult rat brain showed that all CoRESTs are present in both glia and neurons. Regarding functional partnership, CoREST2 and CoREST3 interact with all LSD1 splicing variants. In conclusion, neuronal differentiation is accompanied by decreased expression of all core components of LCH complexes, but not CoREST3. The combination of the differential transcriptional repressor capacity of LCH complexes and variable protein levels of its different components should result in a finely tuned gene expression during neuronal differentiation and in the adult brain.
Asunto(s)
Proteínas Co-Represoras/genética , Histona Desacetilasa 1/genética , Histona Desacetilasa 2/genética , Histona Demetilasas/genética , Neurogénesis/genética , Neuronas/citología , Animales , Regulación hacia Abajo/genética , Embrión de Mamíferos , Femenino , Regulación del Desarrollo de la Expresión Génica , Regulación Enzimológica de la Expresión Génica , Células HEK293 , Humanos , Masculino , Neuronas/metabolismo , Células PC12 , Isoformas de Proteínas/genética , Ratas , Ratas Sprague-DawleyRESUMEN
Causes of lower induction of Hsp70 in neurons during heat shock are still a matter of debate. To further inquire into the mechanisms regulating Hsp70 expression in neurons, we studied the activity of Heat Shock Factor 1 (HSF1) and histone posttranslational modifications (PTMs) at the hsp70 promoter in rat cortical neurons. Heat shock induced a transient and efficient translocation of HSF1 to neuronal nuclei. However, no binding of HSF1 at the hsp70 promoter was detected while it bound to the hsp25 promoter in cortical neurons during heat shock. Histone PTMs analysis showed that the hsp70 promoter harbors lower levels of histone H3 and H4 acetylation in cortical neurons compared to PC12 cells under basal conditions. Transcriptomic profiling data analysis showed a predominant usage of cryptic transcriptional start sites at hsp70 gene in the rat cerebral cortex, compared with the whole brain. These data support a weaker activation of hsp70 canonical promoter. Heat shock increased H3Ac at the hsp70 promoter in PC12 cells, which correlated with increased Hsp70 expression while no modifications occurred at the hsp70 promoter in cortical neurons. Increased histone H3 acetylation by Trichostatin A led to hsp70 mRNA and protein induction in cortical neurons. In conclusion, we found that two independent mechanisms maintain a lower induction of Hsp70 in cortical neurons. First, HSF1 fails to bind specifically to the hsp70 promoter in cortical neurons during heat shock and, second, the hsp70 promoter is less accessible in neurons compared to non-neuronal cells due to histone deacetylases repression.