RESUMEN
Ribonucleoprotein complexes are dynamic assemblies of RNA with RNA-binding proteins (RBPs), which can modulate the fate of the RNA molecules from transcription to degradation. Vice versa, RNA can regulate the interactions and functions of the associated proteins. Dysregulation of RBPs is linked to diseases such as cancer and neurological disorders. RNA and RBPs are present in mitotic structures like the centrosomes and spindle microtubules, but their influence on mitotic spindle integrity remains unknown. Thus, we applied the R-DeeP strategy for the proteome-wide identification of RNA-dependent proteins and complexes to cells synchronized in mitosis versus interphase. The resulting atlas of RNA-dependent proteins in cell division can be accessed through the R-DeeP 3.0 database (R-DeeP3.dkfz.de). It revealed key mitotic factors as RNA-dependent such as AURKA, KIFC1 and TPX2 that were linked to RNA despite their lack of canonical RNA-binding domains. KIFC1 was identified as a new interaction partner and phosphorylation substrate of AURKA at S349 and T359. In addition, KIFC1 interacted with both, AURKA and TPX2, in an RNA-dependent manner. Our data suggest a riboregulation of mitotic protein-protein interactions during spindle assembly, offering new perspectives on the control of cell division processes by RNA-protein complexes.
RESUMEN
CircRNAs are an important class of RNAs with diverse cellular functions in human physiology and disease. A thorough knowledge of circRNAs including their biogenesis and subcellular distribution is important to understand their roles in a wide variety of processes. However, the analysis of circRNAs from total RNA sequencing data remains challenging. Therefore, we developed Calcifer, a versatile workflow for circRNA annotation. Using Calcifer, we analysed APEX-Seq data to compare circRNA occurrence between whole cells, nucleus and subnuclear compartments. We generally find that circRNAs show higher abundance in whole cells compared to nuclear samples, consistent with their accumulation in the cytoplasm. The notable exception is the single-exon circRNA circCANX(9), which is unexpectedly enriched in the nucleus. In addition, we observe that circFIRRE prevails over the linear lncRNA FIRRE in both the cytoplasm and the nucleus. Zooming in on the subnuclear compartments, we show that circRNAs are strongly depleted from nuclear speckles, indicating that excess splicing factors in this compartment counteract back-splicing. Our results thereby provide valuable insights into the subnuclear distribution of circRNAs. Regarding circRNA function, we surprisingly find that the majority of all detected circRNAs possess complete open reading frames with potential for cap-independent translation. Overall, we show that Calcifer is an easy-to-use, versatile and sustainable workflow for the annotation of circRNAs which expands the repertoire of circRNA tools and allows to gain new insights into circRNA distribution and function.
Asunto(s)
Núcleo Celular , ARN Circular , ARN Circular/genética , ARN Circular/metabolismo , Humanos , Núcleo Celular/metabolismo , Núcleo Celular/genética , Citoplasma/metabolismo , Citoplasma/genética , Sistemas de Lectura Abierta , Anotación de Secuencia Molecular , ARN Largo no Codificante/genética , ARN Largo no Codificante/metabolismo , Empalme del ARN , Biología Computacional/métodos , Análisis de Secuencia de ARNRESUMEN
Mutations in splicing factor 3B subunit 1 (SF3B1) frequently occur in patients with chronic lymphocytic leukemia (CLL) and myelodysplastic syndromes (MDS). These mutations have different effects on the disease prognosis with beneficial effect in MDS and worse prognosis in CLL patients. A full-length transcriptome approach can expand our knowledge on SF3B1 mutation effects on RNA splicing and its contribution to patient survival and treatment options. We applied long-read transcriptome sequencing (LRTS) to 44 MDS and CLL patients, as well as two pairs of isogenic cell lines with and without SF3B1 mutations, and found >60% of novel isoforms. Splicing alterations were largely shared between cancer types and specifically affected the usage of introns and 3' splice sites. Our data highlighted a constrained window at canonical 3' splice sites in which dynamic splice site switches occurred in SF3B1-mutated patients. Using transcriptome-wide RNA binding maps and molecular dynamics simulations, we showed multimodal SF3B1 binding at 3' splice sites and predicted reduced RNA binding at the second binding pocket of SF3B1K700E Our work presents the hitherto most complete LRTS study of the SF3B1 mutation in CLL and MDS and provides a resource to study aberrant splicing in cancer. Moreover, we showed that different disease prognosis most likely results from the different cell types expanded during carcinogenesis rather than different mechanisms of action of the mutated SF3B1 These results have important implications for understanding the role of SF3B1 mutations in hematological malignancies and other related diseases.
RESUMEN
Global warming poses a threat for crops, therefore, the identification of thermotolerance mechanisms is a priority. In plants, the core factors that regulate transcription under heat stress (HS) are well described and include several HS transcription factors (HSFs). Despite the relevance of alternative splicing in HS response and thermotolerance, the core regulators of HS-sensitive alternative splicing have not been identified. In tomato, alternative splicing of HSFA2 is important for acclimation to HS. Here, we show that several members of the serine/arginine-rich family of splicing factors (SRSFs) suppress HSFA2 intron splicing. Individual-nucleotide resolution UV cross-linking and immunoprecipitation (iCLIP) combined with RNA-Seq revealed that RS2Z35 and RS2Z36, which make up a plant-specific clade of SR proteins, not only regulate HSFA2 but approximately 50% of RNAs that undergo HS-sensitive alternative splicing, with preferential binding to purine-rich RNA motifs. Single and double CRISPR rs2z mutant lines show a dysregulation of splicing and exhibit lower basal and acquired thermotolerance compared to wild type plants. Our results suggest that RS2Z35 and RS2Z36 have a central role in mitigation of the negative effects of HS on RNA splicing homeostasis, and their emergence might have contributed to the increased capacity of plants to acclimate to high temperatures.
RESUMEN
Motivation: A vast variety of biological questions connected to RNA-binding proteins can be tackled with UV crosslinking and immunoprecipitation (CLIP) experiments. However, the processing and analysis of CLIP data are rather complex. Moreover, different types of CLIP experiments like iCLIP or eCLIP are often processed in different ways, reducing comparability between multiple experiments. Therefore, we aimed to build an easy-to-use computational tool for the processing of CLIP data that can be used for both iCLIP and eCLIP data, as well as data from other truncation-based CLIP methods. Results: Here, we introduce racoon_clip, a sustainable and fully automated pipeline for the complete processing of iCLIP and eCLIP data to extract RNA binding signal at single-nucleotide resolution. racoon_clip is easy to install and execute, with multiple pre-settings and fully customizable parameters, and outputs a conclusive summary report with visualizations and statistics for all analysis steps. Availability and implementation: racoon_clip is implemented as a Snakemake-powered command line tool (Snakemake version ≥7.22, Python version ≥3.9). The latest release can be downloaded from GitHub (https://github.com/ZarnackGroup/racoon_clip/tree/main) and installed via pip. A detailed documentation, including installation, usage, and customization, can be found at https://racoon-clip.readthedocs.io/en/latest/. The example datasets can be downloaded from the Short Read Archive (SRA; iCLIP: SRR5646576, SRR5646577, SRR5646578) or the ENCODE Project (eCLIP: ENCSR202BFN).
RESUMEN
AT-rich interacting domain (ARID)-containing proteins, Arids, are a heterogeneous DNA-binding protein family involved in transcription regulation and chromatin processing. For the member Arid5a, no exact DNA-binding preference has been experimentally defined so far. Additionally, the protein binds to mRNA motifs for transcript stabilization, supposedly through the DNA-binding ARID domain. To date, however, no unbiased RNA motif definition and clear dissection of nucleic acid-binding through the ARID domain have been undertaken. Using NMR-centered biochemistry, we here define the Arid5a DNA preference. Further, high-throughput in vitro binding reveals a consensus RNA-binding motif engaged by the core ARID domain. Finally, transcriptome-wide binding (iCLIP2) reveals that Arid5a has a weak preference for (A)U-rich regions in pre-mRNA transcripts of factors related to RNA processing. We find that the intrinsically disordered regions flanking the ARID domain modulate the specificity and affinity of DNA binding, while they appear crucial for RNA interactions. Ultimately, our data suggest that Arid5a uses its extended ARID domain for bifunctional gene regulation and that the involvement of IDR extensions is a more general feature of Arids in interacting with different nucleic acids at the chromatin-mRNA interface.
Asunto(s)
Proteínas de Unión al ADN , ADN , Factores de Transcripción , Humanos , Proteínas de Unión al ADN/metabolismo , Proteínas de Unión al ADN/genética , Proteínas de Unión al ADN/química , ADN/metabolismo , ADN/química , ADN/genética , Factores de Transcripción/metabolismo , Factores de Transcripción/genética , Factores de Transcripción/química , Dominios Proteicos , Regulación de la Expresión Génica , Unión Proteica , ARN Mensajero/metabolismo , ARN Mensajero/genética , ARN/metabolismo , ARN/química , ARN/genéticaRESUMEN
MicroRNAs (miRNAs) are critical post-transcriptional regulators in many biological processes. They act by guiding RNA-induced silencing complexes to miRNA response elements (MREs) in target mRNAs, inducing translational inhibition and/or mRNA degradation. Functional MREs are expected to predominantly occur in the 3' untranslated region and involve perfect base-pairing of the miRNA seed. Here, we generate a high-resolution map of miR-181a/b-1 (miR-181) MREs to define the targeting rules of miR-181 in developing murine T cells. By combining a multi-omics approach with computational high-resolution analyses, we uncover novel miR-181 targets and demonstrate that miR-181 acts predominantly through RNA destabilization. Importantly, we discover an alternative seed match and identify a distinct set of targets with repeat elements in the coding sequence which are targeted by miR-181 and mediate translational inhibition. In conclusion, deep profiling of MREs in primary cells is critical to expand physiologically relevant targetomes and establish context-dependent miRNA targeting rules.
Asunto(s)
MicroARNs , Elementos de Respuesta , MicroARNs/genética , MicroARNs/metabolismo , Animales , Ratones , Estabilidad del ARN/genética , Regiones no Traducidas 3'/genética , ARN Mensajero/genética , ARN Mensajero/metabolismo , Linfocitos T/metabolismo , Sistemas de Lectura Abierta/genética , Ratones Endogámicos C57BLRESUMEN
Despite crucial roles of RNA-binding proteins (RBPs) in plant physiology and development, methods for determining their transcriptome-wide binding landscape are less developed than those used in other model organisms. Cross-linking and immunoprecipitation (CLIP) methods (based on UV-mediated generation of covalent bonds between RNAs and cognate RBPs in vivo, purification of the cross-linked complexes and identification of the co-purified RNAs by high-throughput sequencing) have been applied mainly in mammalian cells growing in monolayers or in translucent tissue. We have developed plant iCLIP2, an efficient protocol for performing individual-nucleotide-resolution CLIP (iCLIP) in plants, tailored to overcome the experimental hurdles posed by plant tissue. We optimized the UV dosage to efficiently cross-link RNA and proteins in plants and expressed epitope-tagged RBPs under the control of their native promoters in loss-of-function mutants. We select epitopes for which nanobodies are available, allowing stringent conditions for immunopurification of the RNA-protein complexes to be established. To overcome the inherently high RNase content of plant cells, RNase inhibitors are added and the limited RNA fragmentation step is modified. We combine the optimized isolation of RBP-bound RNAs with iCLIP2, a streamlined protocol that greatly enhances the efficiency of library preparation for high-throughput sequencing. Plant researchers with experience in molecular biology and handling of RNA can complete this iCLIP2 protocol in ~5 d. Finally, we describe a bioinformatics workflow to determine targets of Arabidopsis RBPs from iCLIP data, covering all steps from downloading sequencing reads to identifying cross-linking events ( https://github.com/malewins/Plant-iCLIPseq ), and present the R/Bioconductor package BindingSiteFinder to extract reproducible binding sites ( https://bioconductor.org/packages/release/bioc/html/BindingSiteFinder.html ).
Asunto(s)
Nucleótidos , ARN , Animales , ARN/genética , Nucleótidos/metabolismo , Unión Proteica , Proteínas de Unión al ARN/genética , Proteínas de Unión al ARN/metabolismo , Sitios de Unión , Ribonucleasas/metabolismo , Inmunoprecipitación , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Mamíferos/genéticaRESUMEN
HnRNPs are ubiquitously expressed RNA-binding proteins, tightly controlling posttranscriptional gene regulation. Consequently, hnRNP networks are essential for cellular homeostasis and their dysregulation is associated with cancer and other diseases. However, the physiological function of hnRNPs in non-cancerous cell systems are poorly understood. We analyzed the importance of HNRNPDL in endothelial cell functions. Knockdown of HNRNPDL led to impaired proliferation, migration and sprouting of spheroids. Transcriptome analysis identified cyclin D1 (CCND1) and tropomyosin 4 (TPM4) as targets of HNRNPDL, reflecting the phenotypic changes after knockdown. Our findings underline the importance of HNRNPDL for the homeostasis of physiological processes in endothelial cells.
Asunto(s)
Células Endoteliales , Ribonucleoproteínas Nucleares Heterogéneas , Ribonucleoproteínas Nucleares Heterogéneas/genética , Células Endoteliales/metabolismo , Proteínas de Unión al ARN/metabolismoRESUMEN
Nuclear RNA binding proteins (RBPs) are difficult to study because they often belong to large protein families and form extensive networks of auto- and crossregulation. They are highly abundant and many localize to condensates with a slow turnover, requiring long depletion times or knockouts that cannot distinguish between direct and indirect or compensatory effects. Here, we developed a system that is optimized for the rapid degradation of nuclear RBPs, called hGRAD. It comes as a "one-fits-all" plasmid, and integration into any cell line with endogenously GFP-tagged proteins allows for an inducible, rapid, and complete knockdown. We show that the nuclear RBPs SRSF3, SRSF5, SRRM2, and NONO are completely cleared from nuclear speckles and paraspeckles within 2 h. hGRAD works in various cell types, is more efficient than previous methods, and does not require the expression of exogenous ubiquitin ligases. Combining SRSF5 hGRAD degradation with Nascent-seq uncovered transient transcript changes, compensatory mechanisms, and an effect of SRSF5 on transcript stability.
Asunto(s)
Técnicas de Silenciamiento del Gen , Proteínas de Unión al ARN , Línea Celular , Proteínas de Unión al ARN/genética , Plásmidos/genética , Ubiquitina-Proteína LigasasRESUMEN
The importance of RNA-binding proteins (RBPs) for plant responses to environmental stimuli and development is well documented. Insights into the portfolio of RNAs they recognize, however, clearly lack behind the understanding gathered in non-plant model organisms. Here, we characterize binding of the circadian clock-regulated Arabidopsis thaliana GLYCINE-RICH RNA-BINDING PROTEIN 7 (AtGRP7) to its target transcripts. We identified novel RNA targets from individual-nucleotide resolution UV crosslinking and immunoprecipitation (iCLIP) data using an improved bioinformatics pipeline that will be broadly applicable to plant RBP iCLIP data. 2705 transcripts with binding sites were identified in plants expressing AtGRP7-GFP that were not recovered in plants expressing an RNA-binding dead variant or GFP alone. A conserved RNA motif enriched in uridine residues was identified at the AtGRP7 binding sites. NMR titrations confirmed the preference of AtGRP7 for RNAs with a central U-rich motif. Among the bound RNAs, circadian clock-regulated transcripts were overrepresented. Peak abundance of the LHCB1.1 transcript encoding a chlorophyll-binding protein was reduced in plants overexpressing AtGRP7 whereas it was elevated in atgrp7 mutants, indicating that LHCB1.1 was regulated by AtGRP7 in a dose-dependent manner. In plants overexpressing AtGRP7, the LHCB1.1 half-life was shorter compared to wild-type plants whereas in atgrp7 mutant plants, the half-life was significantly longer. Thus, AtGRP7 modulates circadian oscillations of its in vivo binding target LHCB1.1 by affecting RNA stability.
Asunto(s)
Proteínas de Arabidopsis , Arabidopsis , Arabidopsis/metabolismo , Proteínas de Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Regulación de la Expresión Génica de las Plantas , Glicina/metabolismo , ARN/metabolismo , Estabilidad del ARN , Proteínas de Unión al ARN/genética , Proteínas de Unión al ARN/metabolismoRESUMEN
Fungal pathogens depend on sophisticated gene expression programs for successful infection. A crucial component is RNA regulation mediated by RNA-binding proteins (RBPs). However, little is known about the spatiotemporal RNA control mechanisms during fungal pathogenicity. Here, we discover that the RBP Khd4 defines a distinct mRNA regulon to orchestrate membrane trafficking during pathogenic development of Ustilago maydis. By establishing hyperTRIBE for fungal RBPs, we generated a comprehensive transcriptome-wide map of Khd4 interactions in vivo. We identify a defined set of target mRNAs enriched for regulatory proteins involved, e.g., in GTPase signaling. Khd4 controls the stability of target mRNAs via its cognate regulatory element AUACCC present in their 3' untranslated regions. Studying individual examples reveals a unique link between Khd4 and vacuole maturation. Thus, we uncover a distinct role for an RNA stability factor defining a specific mRNA regulon for membrane trafficking during pathogenicity.
Asunto(s)
Estabilidad del ARN , Regulón , ARN Mensajero/genética , Regulón/genética , Regiones no Traducidas 3'/genéticaRESUMEN
Splicing of pre-mRNAs critically contributes to gene regulation and proteome expansion in eukaryotes, but our understanding of the recognition and pairing of splice sites during spliceosome assembly lacks detail. Here, we identify the multidomain RNA-binding protein FUBP1 as a key splicing factor that binds to a hitherto unknown cis-regulatory motif. By collecting NMR, structural, and in vivo interaction data, we demonstrate that FUBP1 stabilizes U2AF2 and SF1, key components at the 3' splice site, through multivalent binding interfaces located within its disordered regions. Transcriptional profiling and kinetic modeling reveal that FUBP1 is required for efficient splicing of long introns, which is impaired in cancer patients harboring FUBP1 mutations. Notably, FUBP1 interacts with numerous U1 snRNP-associated proteins, suggesting a unique role for FUBP1 in splice site bridging for long introns. We propose a compelling model for 3' splice site recognition of long introns, which represent 80% of all human introns.
Asunto(s)
Sitios de Empalme de ARN , Empalme del ARN , Humanos , Sitios de Empalme de ARN/genética , Intrones/genética , Factores de Empalme de ARN/genética , Factores de Empalme de ARN/metabolismo , Proteínas de Unión al ARN/genética , Proteínas de Unión al ARN/metabolismo , Precursores del ARN/genética , Precursores del ARN/metabolismo , Proteínas de Unión al ADN/genética , Proteínas de Unión al ADN/metabolismoRESUMEN
Circular RNAs are generated by backsplicing and control cellular signaling and phenotypes. Pericytes stabilize capillary structures and play important roles in the formation and maintenance of blood vessels. Here, we characterize hypoxia-regulated circular RNAs (circRNAs) in human pericytes and show that the circular RNA of procollagen-lysine,2-oxoglutarate 5-dioxygenase-2 (circPLOD2) is induced by hypoxia and regulates pericyte functions. Silencing of circPLOD2 affects pericytes and increases proliferation, migration, and secretion of soluble angiogenic proteins, thereby enhancing endothelial migration and network capability. Transcriptional and epigenomic profiling of circPLOD2-depleted cells reveals widespread changes in gene expression and identifies the transcription factor krüppel-like factor 4 (KLF4) as a key effector of the circPLOD2-mediated changes. KLF4 depletion mimics circPLOD2 silencing, whereas KLF4 overexpression reverses the effects of circPLOD2 depletion on proliferation and endothelial-pericyte interactions. Together, these data reveal an important function of circPLOD2 in controlling pericyte proliferation and capillary formation and show that the circPLOD2-mediated regulation of KLF4 significantly contributes to the transcriptional response to hypoxia.
Asunto(s)
Pericitos , ARN Circular , Humanos , Hipoxia/metabolismo , Pericitos/metabolismo , ARN Circular/genética , ARN Circular/metabolismoRESUMEN
Hypoxia contributes to numerous pathophysiological conditions including inflammation-associated diseases. We characterized the impact of hypoxia on the immunometabolic cross-talk between cholesterol and interferon (IFN) responses. Specifically, hypoxia reduced cholesterol biosynthesis flux and provoked a compensatory activation of sterol regulatory element-binding protein 2 (SREBP2) in monocytes. Concomitantly, a broad range of interferon-stimulated genes (ISGs) increased under hypoxia in the absence of an inflammatory stimulus. While changes in cholesterol biosynthesis intermediates and SREBP2 activity did not contribute to hypoxic ISG induction, intracellular cholesterol distribution appeared critical to enhance hypoxic expression of chemokine ISGs. Importantly, hypoxia further boosted chemokine ISG expression in monocytes upon infection with severe acute respiratory syndrome coronavirus type 2 (SARS-CoV-2). Mechanistically, hypoxia sensitized toll-like receptor 4 (TLR4) signaling to activation by SARS-CoV-2 spike protein, which emerged as a major signaling hub to enhance chemokine ISG induction following SARS-CoV-2 infection of hypoxic monocytes. These data depict a hypoxia-regulated immunometabolic mechanism with implications for the development of systemic inflammatory responses in severe cases of coronavirus disease-2019 (COVID-19).
Asunto(s)
COVID-19 , Interferones , Humanos , Interferones/farmacología , Monocitos , SARS-CoV-2 , Quimiocinas , Hipoxia , ColesterolRESUMEN
The expanding field of epitranscriptomics might rival the epigenome in the diversity of biological processes impacted. In recent years, the development of new high-throughput experimental and computational techniques has been a key driving force in discovering the properties of RNA modifications. Machine learning applications, such as for classification, clustering or de novo identification, have been critical in these advances. Nonetheless, various challenges remain before the full potential of machine learning for epitranscriptomics can be leveraged. In this review, we provide a comprehensive survey of machine learning methods to detect RNA modifications using diverse input data sources. We describe strategies to train and test machine learning methods and to encode and interpret features that are relevant for epitranscriptomics. Finally, we identify some of the current challenges and open questions about RNA modification analysis, including the ambiguity in predicting RNA modifications in transcript isoforms or in single nucleotides, or the lack of complete ground truth sets to test RNA modifications. We believe this review will inspire and benefit the rapidly developing field of epitranscriptomics in addressing the current limitations through the effective use of machine learning.
Asunto(s)
Aprendizaje Automático , Transcriptoma , ARN Mensajero , ARN/genéticaRESUMEN
In mammals, X-chromosomal genes are expressed from a single copy since males (XY) possess a single X chromosome, while females (XX) undergo X inactivation. To compensate for this reduction in dosage compared with two active copies of autosomes, it has been proposed that genes from the active X chromosome exhibit dosage compensation. However, the existence and mechanisms of X-to-autosome dosage compensation are still under debate. Here we show that X-chromosomal transcripts have fewer m6A modifications and are more stable than their autosomal counterparts. Acute depletion of m6A selectively stabilizes autosomal transcripts, resulting in perturbed dosage compensation in mouse embryonic stem cells. We propose that higher stability of X-chromosomal transcripts is directed by lower levels of m6A, indicating that mammalian dosage compensation is partly regulated by epitranscriptomic RNA modifications.
Asunto(s)
Compensación de Dosificación (Genética) , Cromosoma X , Masculino , Femenino , Animales , Ratones , Metilación , Cromosoma X/genética , Mamíferos/genética , Estabilidad del ARNRESUMEN
The RNA-binding protein PURA has been implicated in the rare, monogenetic, neurodevelopmental disorder PURA Syndrome. PURA binds both DNA and RNA and has been associated with various cellular functions. Only little is known about its main cellular roles and the molecular pathways affected upon PURA depletion. Here, we show that PURA is predominantly located in the cytoplasm, where it binds to thousands of mRNAs. Many of these transcripts change abundance in response to PURA depletion. The encoded proteins suggest a role for PURA in immune responses, mitochondrial function, autophagy and processing (P)-body activity. Intriguingly, reduced PURA levels decrease the expression of the integral P-body components LSM14A and DDX6 and strongly affect P-body formation in human cells. Furthermore, PURA knockdown results in stabilization of P-body-enriched transcripts, whereas other mRNAs are not affected. Hence, reduced PURA levels, as reported in patients with PURA Syndrome, influence the formation and composition of this phase-separated RNA processing machinery. Our study proposes PURA Syndrome as a new model to study the tight connection between P-body-associated RNA regulation and neurodevelopmental disorders.