RESUMEN
The mRNA 3' poly(A) tail plays a critical role in regulating both mRNA translation and turnover. It is bound by the cytoplasmic poly(A) binding protein (PABPC), an evolutionarily conserved protein that can interact with translation factors and mRNA decay machineries to regulate gene expression. Mammalian PABPC1, the prototypical PABPC, is expressed in most tissues and interacts with eukaryotic translation initiation factor 4G (eIF4G) to stimulate translation in specific contexts. In this study, we uncovered a new mammalian PABPC, which we named neural PABP (neuPABP), as it is predominantly expressed in the brain. neuPABP maintains a unique architecture as compared with other PABPCs, containing only two RNA recognition motifs (RRMs) and maintaining a unique N-terminal domain of unknown function. neuPABP expression is activated in neurons as they mature during synaptogenesis, where neuPABP localizes to the soma and postsynaptic densities. neuPABP interacts with the noncoding RNA BC1, as well as mRNAs coding for ribosomal and mitochondrial proteins. However, in contrast to PABPC1, neuPABP does not associate with actively translating mRNAs in the brain. In keeping with this, we show that neuPABP has evolved such that it does not bind eIF4G and as a result fails to support protein synthesis in vitro. Taken together, these results indicate that mammals have expanded their PABPC repertoire in the brain and propose that neuPABP may support the translational repression of select mRNAs.
Asunto(s)
Factor 4G Eucariótico de Iniciación , Proteínas de Unión a Poli(A) , Animales , Proteínas de Unión a Poli(A)/genética , Neuronas , Encéfalo , MamíferosRESUMEN
Alternative polyadenylation (APA) enhances gene regulatory potential by increasing the diversity of mRNA transcripts. 3' UTR shortening through APA correlates with enhanced cellular proliferation and is a widespread phenomenon in tumor cells. Here, we show that the ubiquitously expressed transcription factor Sp1 binds RNA in vivo and is a common repressor of distal poly(A) site usage. RNA sequencing identified 2,344 genes (36% of the total mapped mRNA transcripts) with lengthened 3' UTRs upon Sp1 depletion. Sp1 preferentially binds the 3' UTRs of such lengthened transcripts and inhibits cleavage at distal sites by interacting with the subunits of the core cleavage and polyadenylation (CPA) machinery. The 3' UTR lengths of Sp1 target genes in breast cancer patient RNA-seq data correlate with Sp1 expression levels, implicating Sp1-mediated APA regulation in modulating tumorigenic properties. Taken together, our findings provide insights into the mechanism for dynamic APA regulation by unraveling a previously unknown function of the DNA-binding transcription factor Sp1.
Asunto(s)
Poli A , Poliadenilación , Regiones no Traducidas 3' , Humanos , Poli A/metabolismo , ARN Mensajero/metabolismo , Factor de Transcripción Sp1/genética , Factor de Transcripción Sp1/metabolismo , Zinc/metabolismoRESUMEN
Understanding the regulatory interactions that control gene expression during the development of novel tissues is a key goal of evolutionary developmental biology. Here, we show that Mbnl3 has undergone a striking process of evolutionary specialization in eutherian mammals resulting in the emergence of a novel placental function for the gene. Mbnl3 belongs to a family of RNA-binding proteins whose members regulate multiple aspects of RNA metabolism. We find that, in eutherians, while both Mbnl3 and its paralog Mbnl2 are strongly expressed in placenta, Mbnl3 expression has been lost from nonplacental tissues in association with the evolution of a novel promoter. Moreover, Mbnl3 has undergone accelerated protein sequence evolution leading to changes in its RNA-binding specificities and cellular localization. While Mbnl2 and Mbnl3 share partially redundant roles in regulating alternative splicing, polyadenylation site usage and, in turn, placenta maturation, Mbnl3 has also acquired novel biological functions. Specifically, Mbnl3 knockout (M3KO) alone results in increased placental growth associated with higher Myc expression. Furthermore, Mbnl3 loss increases fetal resource allocation during limiting conditions, suggesting that location of Mbnl3 on the X chromosome has led to its role in limiting placental growth, favoring the maternal side of the parental genetic conflict.
Asunto(s)
Placenta , Proteínas de Unión al ARN , Empalme Alternativo/genética , Animales , Euterios/genética , Femenino , Placenta/metabolismo , Embarazo , ARN/metabolismo , Proteínas de Unión al ARN/genética , Proteínas de Unión al ARN/metabolismoRESUMEN
Networks of coordinated alternative splicing (AS) events play critical roles in development and disease. However, a comprehensive knowledge of the factors that regulate these networks is lacking. We describe a high-throughput system for systematically linking trans-acting factors to endogenous RNA regulatory events. Using this system, we identify hundreds of factors associated with diverse regulatory layers that positively or negatively control AS events linked to cell fate. Remarkably, more than one-third of the regulators are transcription factors. Further analyses of the zinc finger protein Zfp871 and BTB/POZ domain transcription factor Nacc1, which regulate neural and stem cell AS programs, respectively, reveal roles in controlling the expression of specific splicing regulators. Surprisingly, these proteins also appear to regulate target AS programs via binding RNA. Our results thus uncover a large "missing cache" of splicing regulators among annotated transcription factors, some of which dually regulate AS through direct and indirect mechanisms.
Asunto(s)
Empalme Alternativo , Redes Reguladoras de Genes , Análisis de Secuencia de ARN/métodos , Factores de Transcripción/metabolismo , Animales , Línea Celular , Células Madre Embrionarias/citología , Células Madre Embrionarias/metabolismo , Células HEK293 , Humanos , Ratones , Neuronas/citología , Neuronas/metabolismo , ARN Mensajero/genéticaRESUMEN
Modelling both primary sequence and secondary structure preferences for RNA binding proteins (RBPs) remains an ongoing challenge. Current models use varied RNA structure representations and can be difficult to interpret and evaluate. To address these issues, we present a universal RNA motif-finding/scanning strategy, termed PRIESSTESS (Predictive RBP-RNA InterpretablE Sequence-Structure moTif regrESSion), that can be applied to diverse RNA binding datasets. PRIESSTESS identifies dozens of enriched RNA sequence and/or structure motifs that are subsequently reduced to a set of core motifs by logistic regression with LASSO regularization. Importantly, these core motifs are easily visualized and interpreted, and provide a measure of RBP secondary structure specificity. We used PRIESSTESS to interrogate new HTR-SELEX data for 23 RBPs with diverse RNA binding modes and captured known primary sequence and secondary structure preferences for each. Moreover, when applying PRIESSTESS to 144 RBPs across 202 RNA binding datasets, 75% showed an RNA secondary structure preference but only 10% had a preference besides unpaired bases, suggesting that most RBPs simply recognize the accessibility of primary sequences.
Asunto(s)
Algoritmos , Proteínas de Unión al ARN , Sitios de Unión , Proteínas de Unión al ARN/metabolismo , Motivos de Nucleótidos , ARN/química , Unión ProteicaRESUMEN
Alternative splicing is important for the development and function of the nervous system, but little is known about the differences in alternative splicing between distinct types of neurons. Furthermore, the factors that control cell-type-specific splicing and the physiological roles of these alternative isoforms are unclear. By monitoring alternative splicing at single-cell resolution in Caenorhabditis elegans, we demonstrate that splicing patterns in different neurons are often distinct and highly regulated. We identify two conserved RNA-binding proteins, UNC-75/CELF and EXC-7/Hu/ELAV, which regulate overlapping networks of splicing events in GABAergic and cholinergic neurons. We use the UNC-75 exon network to discover regulators of synaptic transmission and to identify unique roles for isoforms of UNC-64/Syntaxin, a protein required for synaptic vesicle fusion. Our results indicate that combinatorial regulation of alternative splicing in distinct neurons provides a mechanism to specialize metazoan nervous systems.
Asunto(s)
Empalme Alternativo/genética , Proteínas de Caenorhabditis elegans/genética , Proteínas de Caenorhabditis elegans/fisiología , Caenorhabditis elegans/genética , Neuronas Colinérgicas/citología , Neuronas GABAérgicas/citología , Proteínas de Unión al ARN/fisiología , Transmisión Sináptica/genética , Sintaxina 1/genética , Animales , Neuronas Colinérgicas/metabolismo , Neuronas GABAérgicas/metabolismo , Mutación , Sistema Nervioso/embriología , Sistema Nervioso/crecimiento & desarrollo , Isoformas de Proteínas/genética , Proteínas de Unión al ARN/genética , Transmisión Sináptica/fisiología , Vesículas Sinápticas/metabolismoRESUMEN
Stress hormones bind and activate the glucocorticoid receptor (GR) in many tissues including the brain. We identified arginine and glutamate rich 1 (ARGLU1) in a screen for new modulators of glucocorticoid signaling in the CNS. Biochemical studies show that the glutamate rich C-terminus of ARGLU1 coactivates multiple nuclear receptors including the glucocorticoid receptor (GR) and the arginine rich N-terminus interacts with splicing factors and binds to RNA. RNA-seq of neural cells depleted of ARGLU1 revealed significant changes in the expression and alternative splicing of distinct genes involved in neurogenesis. Loss of ARGLU1 is embryonic lethal in mice, and knockdown in zebrafish causes neurodevelopmental and heart defects. Treatment with dexamethasone, a GR activator, also induces changes in the pattern of alternatively spliced genes, many of which were lost when ARGLU1 was absent. Importantly, the genes found to be alternatively spliced in response to glucocorticoid treatment were distinct from those under transcriptional control by GR, suggesting an additional mechanism of glucocorticoid action is present in neural cells. Our results thus show that ARGLU1 is a novel factor for embryonic development that modulates basal transcription and alternative splicing in neural cells with consequences for glucocorticoid signaling.
Asunto(s)
Desarrollo Embrionario , Glucocorticoides/farmacología , Péptidos y Proteínas de Señalización Intracelular/fisiología , Empalme del ARN/genética , Activación Transcripcional/genética , Empalme Alternativo/efectos de los fármacos , Empalme Alternativo/genética , Animales , Animales Modificados Genéticamente , Células Cultivadas , Embrión no Mamífero , Desarrollo Embrionario/efectos de los fármacos , Desarrollo Embrionario/genética , Glucocorticoides/metabolismo , Células HEK293 , Humanos , Ratones , Ratones Endogámicos C57BL , Neurogénesis/efectos de los fármacos , Neurogénesis/genética , Empalme del ARN/efectos de los fármacos , Transducción de Señal/efectos de los fármacos , Transducción de Señal/genética , Estrés Fisiológico/efectos de los fármacos , Estrés Fisiológico/genética , Transactivadores/fisiología , Activación Transcripcional/efectos de los fármacos , Pez CebraRESUMEN
RNA-binding proteins are key regulators of gene expression, yet only a small fraction have been functionally characterized. Here we report a systematic analysis of the RNA motifs recognized by RNA-binding proteins, encompassing 205 distinct genes from 24 diverse eukaryotes. The sequence specificities of RNA-binding proteins display deep evolutionary conservation, and the recognition preferences for a large fraction of metazoan RNA-binding proteins can thus be inferred from their RNA-binding domain sequence. The motifs that we identify in vitro correlate well with in vivo RNA-binding data. Moreover, we can associate them with distinct functional roles in diverse types of post-transcriptional regulation, enabling new insights into the functions of RNA-binding proteins both in normal physiology and in human disease. These data provide an unprecedented overview of RNA-binding proteins and their targets, and constitute an invaluable resource for determining post-transcriptional regulatory mechanisms in eukaryotes.
Asunto(s)
Regulación de la Expresión Génica/genética , Motivos de Nucleótidos/genética , Proteínas de Unión al ARN/metabolismo , Trastorno Autístico/genética , Secuencia de Bases , Sitios de Unión/genética , Secuencia Conservada/genética , Células Eucariotas/metabolismo , Humanos , Datos de Secuencia Molecular , Estructura Terciaria de Proteína/genética , Factores de Empalme de ARN , Estabilidad del ARN/genética , Proteínas de Unión al ARN/química , Proteínas de Unión al ARN/genéticaRESUMEN
Neurogenesis requires the concerted action of numerous genes that are regulated at multiple levels. However, how different layers of gene regulation are coordinated to promote neurogenesis is not well understood. We show that the neural-specific Ser/Arg repeat-related protein of 100 kDa (nSR100/SRRM4) negatively regulates REST (NRSF), a transcriptional repressor of genes required for neurogenesis. nSR100 directly promotes alternative splicing of REST transcripts to produce a REST isoform (REST4) with greatly reduced repressive activity, thereby activating expression of REST targets in neural cells. Conversely, REST directly represses nSR100 in nonneural cells to prevent the activation of neural-specific splicing events. Consistent with a critical role for nSR100 in the inhibition of REST activity, blocking nSR100 expression in the developing mouse brain impairs neurogenesis. Our results thus reveal a fundamental role for direct regulatory interactions between a splicing activator and transcription repressor in the control of the multilayered regulatory programs required for neurogenesis.
Asunto(s)
Empalme Alternativo , Neurogénesis , Factores de Transcripción/genética , Animales , Células Cultivadas , Ratones , Ratones Endogámicos , Proteínas del Tejido Nervioso/genética , Proteínas del Tejido Nervioso/metabolismo , Neuronas/citología , Neuronas/metabolismo , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Empalme del ARN , Proteínas Represoras/genética , Proteínas Represoras/metabolismo , Factores de Transcripción/metabolismo , Transcripción GenéticaRESUMEN
RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3Î intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1-ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3Î site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping.
Asunto(s)
Proteínas de Unión al ARN/fisiología , Autoantígenos , Secuencia de Bases , Sitios de Unión , Exones , Células HEK293 , Humanos , Unión Proteica , Empalme del ARN , ARN Mensajero/química , ARN Mensajero/metabolismo , Proteínas de Unión al ARN/química , Dedos de ZincRESUMEN
RNA-binding proteins (RBPs) participate in diverse cellular processes and have important roles in human development and disease. The human genome, and that of many other eukaryotes, encodes hundreds of RBPs that contain canonical sequence-specific RNA-binding domains (RBDs) as well as numerous other unconventional RNA binding proteins (ucRBPs). ucRBPs physically associate with RNA but lack common RBDs. The degree to which these proteins bind RNA, in a sequence specific manner, is unknown. Here, we provide a detailed description of both the laboratory and data processing methods for RNAcompete, a method we have previously used to analyze the RNA binding preferences of hundreds of RBD-containing RBPs, from diverse eukaryotes. We also determine the RNA-binding preferences for two human ucRBPs, NUDT21 and CNBP, and use this analysis to exemplify the RNAcompete pipeline. The results of our RNAcompete experiments are consistent with independent RNA-binding data for these proteins and demonstrate the utility of RNAcompete for analyzing the growing repertoire of ucRBPs.
Asunto(s)
Factor de Especificidad de Desdoblamiento y Poliadenilación/genética , Análisis por Micromatrices/métodos , Proteínas de Unión al ARN/genética , ARN/química , Animales , Secuencia de Bases , Sitios de Unión , Factor de Especificidad de Desdoblamiento y Poliadenilación/metabolismo , Clonación Molecular , Cartilla de ADN/química , Cartilla de ADN/metabolismo , Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Escherichia coli/genética , Escherichia coli/metabolismo , Expresión Génica , Humanos , Unión Proteica , Dominios Proteicos , ARN/genética , ARN/metabolismo , Proteínas de Unión al ARN/metabolismo , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Alineación de SecuenciaRESUMEN
RNA-binding proteins recognize RNA sequences and structures, but there is currently no systematic and accurate method to derive large (>12base) motifs de novo that reflect a combination of intrinsic preference to both sequence and structure. To address this absence, we introduce RNAcompete-S, which couples a single-step competitive binding reaction with an excess of random RNA 40-mers to a custom computational pipeline for interrogation of the bound RNA sequences and derivation of SSMs (Sequence and Structure Models). RNAcompete-S confirms that HuR, QKI, and SRSF1 prefer binding sites that are single stranded, and recapitulates known 8-10bp sequence and structure preferences for Vts1p and RBMY. We also derive an 18-base long SSM for Drosophila SLBP, which to our knowledge has not been previously determined by selections from pure random sequence, and accurately discriminates human replication-dependent histone mRNAs. Thus, RNAcompete-S enables accurate identification of large, intrinsic sequence-structure specificities with a uniform assay.
Asunto(s)
Secuencia de Bases/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Proteínas de Unión al ARN/genética , Humanos , Proteínas de Unión al ARN/química , Análisis de Secuencia de ARN/métodosRESUMEN
It is commonly known that mammalian microRNAs (miRNAs) guide the RNA-induced silencing complex (RISC) to target mRNAs through the seed-pairing rule. However, recent experiments that coimmunoprecipitate the Argonaute proteins (AGOs), the central catalytic component of RISC, have consistently revealed extensive AGO-associated mRNAs that lack seed complementarity with miRNAs. We herein test the hypothesis that AGO has its own binding preference within target mRNAs, independent of guide miRNAs. By systematically analyzing the data from in vivo cross-linking experiments with human AGOs, we have identified a structurally accessible and evolutionarily conserved region (â¼10 nucleotides in length) that alone can accurately predict AGO-mRNA associations, independent of the presence of miRNA binding sites. Within this region, we further identified an enriched motif that was replicable on independent AGO-immunoprecipitation data sets. We used RNAcompete to enumerate the RNA-binding preference of human AGO2 to all possible 7-mer RNA sequences and validated the AGO motif in vitro. These findings reveal a novel function of AGOs as sequence-specific RNA-binding proteins, which may aid miRNAs in recognizing their targets with high specificity.
Asunto(s)
Proteínas Argonautas/metabolismo , ARN Mensajero/metabolismo , Proteínas Argonautas/química , Proteínas Argonautas/genética , Secuencia de Bases , Sitios de Unión , Humanos , MicroARNs/genética , MicroARNs/metabolismo , Datos de Secuencia Molecular , Motivos de Nucleótidos , Unión Proteica , ARN Mensajero/química , ARN Mensajero/genéticaRESUMEN
The human proteome contains a plethora of short linear motifs (SLiMs) that serve as binding interfaces for modular protein domains. Such interactions are crucial for signaling and other cellular processes, but are difficult to detect because of their low to moderate affinities. Here we developed a dedicated approach, proteomic peptide-phage display (ProP-PD), to identify domain-SLiM interactions. Specifically, we generated phage libraries containing all human and viral C-terminal peptides using custom oligonucleotide microarrays. With these libraries we screened the nine PSD-95/Dlg/ZO-1 (PDZ) domains of human Densin-180, Erbin, Scribble, and Disks large homolog 1 for peptide ligands. We identified several known and putative interactions potentially relevant to cellular signaling pathways and confirmed interactions between full-length Scribble and the target proteins ß-PIX, plakophilin-4, and guanylate cyclase soluble subunit α-2 using colocalization and coimmunoprecipitation experiments. The affinities of recombinant Scribble PDZ domains and the synthetic peptides representing the C termini of these proteins were in the 1- to 40-µM range. Furthermore, we identified several well-established host-virus protein-protein interactions, and confirmed that PDZ domains of Scribble interact with the C terminus of Tax-1 of human T-cell leukemia virus with micromolar affinity. Previously unknown putative viral protein ligands for the PDZ domains of Scribble and Erbin were also identified. Thus, we demonstrate that our ProP-PD libraries are useful tools for probing PDZ domain interactions. The method can be extended to interrogate all potential eukaryotic, bacterial, and viral SLiMs and we suggest it will be a highly valuable approach for studying cellular and pathogen-host protein-protein interactions.
Asunto(s)
Bacteriófago M13/genética , Dominios PDZ/genética , Dominios PDZ/fisiología , Biblioteca de Péptidos , Mapeo de Interacción de Proteínas/métodos , Proteómica/métodos , Biología Computacional , Cartilla de ADN/genética , Humanos , Análisis por MicromatricesRESUMEN
RNA-binding proteins (RBPs) are key regulators of gene expression. Here, we introduce EuPRI (Eukaryotic Protein-RNA Interactions) - a freely available resource of RNA motifs for 34,736 RBPs from 690 eukaryotes. EuPRI includes in vitro binding data for 504 RBPs, including newly collected RNAcompete data for 174 RBPs, along with thousands of reconstructed motifs. We reconstruct these motifs with a new computational platform - Joint Protein-Ligand Embedding (JPLE) - which can detect distant homology relationships and map specificity-determining peptides. EuPRI quadruples the number of known RBP motifs, expanding the motif repertoire across all major eukaryotic clades, and assigning motifs to the majority of human RBPs. EuPRI drastically improves knowledge of RBP motifs in flowering plants. For example, it increases the number of Arabidopsis thaliana RBP motifs 7-fold, from 14 to 105. EuPRI also has broad utility for inferring post-transcriptional function and evolutionary relationships. We demonstrate this by predicting a role for 12 Arabidopsis thaliana RBPs in RNA stability and identifying rapid and recent evolution of post-transcriptional regulatory networks in worms and plants. In contrast, the vertebrate RNA motif set has remained relatively stable after its drastic expansion between the metazoan and vertebrate ancestors. EuPRI represents a powerful resource for the study of gene regulation across eukaryotes.
RESUMEN
Tiling microarrays have proven to be a valuable tool for gaining insights into the transcriptomes of microbial organisms grown under various nutritional or stress conditions. Here, we describe the use of such an array, constructed at the level of 20 nt resolution for the Escherichia coli MG1655 genome, to observe genome-wide changes in the steady-state RNA levels in mutants defective in either RNase E or RNase III. The array data were validated by comparison to previously published results for a variety of specific transcripts as well as independent northern analysis of additional mRNAs and sRNAs. In the absence of RNase E, 60% of the annotated coding sequences showed either increases or decreases in their steady-state levels. In contrast, only 12% of the coding sequences were affected in the absence of RNase III. Unexpectedly, many coding sequences showed decreased abundance in the RNase E mutant, while more than half of the annotated sRNAs showed changes in abundance. Furthermore, the steady-state levels of many transcripts showed overlapping effects of both ribonucleases. Data are also presented demonstrating how the arrays were used to identify potential new genes, RNase III cleavage sites and the direct or indirect control of specific biological pathways.
Asunto(s)
Endorribonucleasas/metabolismo , Escherichia coli/enzimología , Escherichia coli/genética , Ribonucleasa III/metabolismo , Cisteína/biosíntesis , Endorribonucleasas/genética , Escherichia coli/metabolismo , Eliminación de Gen , Perfilación de la Expresión Génica , Genes Bacterianos , Genoma Bacteriano , Análisis de Secuencia por Matrices de Oligonucleótidos , ARN Mensajero/metabolismo , ARN Pequeño no Traducido/análisis , Ribonucleasa III/genéticaRESUMEN
Thousands of RNA-binding proteins (RBPs) crosslink to cellular mRNA. Among these are numerous unconventional RBPs (ucRBPs)-proteins that associate with RNA but lack known RNA-binding domains (RBDs). The vast majority of ucRBPs have uncharacterized RNA-binding specificities. We analyzed 492 human ucRBPs for intrinsic RNA-binding in vitro and identified 23 that bind specific RNA sequences. Most (17/23), including 8 ribosomal proteins, were previously associated with RNA-related function. We identified the RBDs responsible for sequence-specific RNA-binding for several of these 23 ucRBPs and surveyed whether corresponding domains from homologous proteins also display RNA sequence specificity. CCHC-zf domains from seven human proteins recognized specific RNA motifs, indicating that this is a major class of RBD. For Nudix, HABP4, TPR, RanBP2-zf, and L7Ae domains, however, only isolated members or closely related homologs yielded motifs, consistent with RNA-binding as a derived function. The lack of sequence specificity for most ucRBPs is striking, and we suggest that many may function analogously to chromatin factors, which often crosslink efficiently to cellular DNA, presumably via indirect recruitment. Finally, we show that ucRBPs tend to be highly abundant proteins and suggest their identification in RNA interactome capture studies could also result from weak nonspecific interactions with RNA.
Asunto(s)
Proteínas de Unión al ARN , ARN , Humanos , Proteínas de Unión al ARN/genética , Proteínas de Unión al ARN/metabolismo , ARN/metabolismo , Proteínas Ribosómicas/metabolismo , ARN Mensajero/genética , ARN Mensajero/metabolismo , Motivos de Unión al ARN/genética , Unión Proteica , Factores Reguladores Miogénicos/metabolismoRESUMEN
Metazoan genomes encode hundreds of RNA-binding proteins (RBPs). These proteins regulate post-transcriptional gene expression and have critical roles in numerous cellular processes including mRNA splicing, export, stability and translation. Despite their ubiquity and importance, the binding preferences for most RBPs are not well characterized. In vitro and in vivo studies, using affinity selection-based approaches, have successfully identified RNA sequence associated with specific RBPs; however, it is difficult to infer RBP sequence and structural preferences without specifically designed motif finding methods. In this study, we introduce a new motif-finding method, RNAcontext, designed to elucidate RBP-specific sequence and structural preferences with greater accuracy than existing approaches. We evaluated RNAcontext on recently published in vitro and in vivo RNA affinity selected data and demonstrate that RNAcontext identifies known binding preferences for several control proteins including HuR, PTB, and Vts1p and predicts new RNA structure preferences for SF2/ASF, RBM4, FUSIP1 and SLM2. The predicted preferences for SF2/ASF are consistent with its recently reported in vivo binding sites. RNAcontext is an accurate and efficient motif finding method ideally suited for using large-scale RNA-binding affinity datasets to determine the relative binding preferences of RBPs for a wide range of RNA sequences and structures.
Asunto(s)
Secuencia de Aminoácidos , Sitios de Unión , Conformación Proteica , Proteínas de Unión al ARN , Algoritmos , Secuencias de Aminoácidos , Área Bajo la Curva , Secuencia de Bases , Bases de Datos de Proteínas , Modelos Genéticos , Modelos Estadísticos , Conformación de Ácido Nucleico , Unión Proteica , ARN Mensajero/química , ARN Mensajero/metabolismo , Proteínas de Unión al ARN/química , Proteínas de Unión al ARN/genética , Proteínas de Unión al ARN/metabolismoRESUMEN
BACKGROUND: RNA-binding proteins (RBPs) function as master regulators of gene expression. Alterations in RBP expression and function are often observed in cancer and influence critical pathways implicated in tumor initiation and growth. Identification and characterization of oncogenic RBPs and their regulatory networks provide new opportunities for targeted therapy. RESULTS: We identify the RNA-binding protein SERBP1 as a novel regulator of glioblastoma (GBM) development. High SERBP1 expression is prevalent in GBMs and correlates with poor patient survival and poor response to chemo- and radiotherapy. SERBP1 knockdown causes delay in tumor growth and impacts cancer-relevant phenotypes in GBM and glioma stem cell lines. RNAcompete identifies a GC-rich region as SERBP1-binding motif; subsequent genomic and functional analyses establish SERBP1 regulation role in metabolic routes preferentially used by cancer cells. An important consequence of these functions is SERBP1 impact on methionine production. SERBP1 knockdown decreases methionine levels causing a subsequent reduction in histone methylation as shown for H3K27me3 and upregulation of genes associated with neurogenesis, neuronal differentiation, and function. Further analysis demonstrates that several of these genes are downregulated in GBM, potentially through epigenetic silencing as indicated by the presence of H3K27me3 sites. CONCLUSIONS: SERBP1 is the first example of an RNA-binding protein functioning as a central regulator of cancer metabolism and indirect modulator of epigenetic regulation in GBM. By bridging these two processes, SERBP1 enhances glioma stem cell phenotypes and contributes to GBM poorly differentiated state.
Asunto(s)
Neoplasias Encefálicas/metabolismo , Glioblastoma/metabolismo , Proteínas de Unión al ARN/metabolismo , Animales , Neoplasias Encefálicas/etiología , Neoplasias Encefálicas/mortalidad , Neoplasias Encefálicas/terapia , Epigénesis Genética , Femenino , Glioblastoma/etiología , Glioblastoma/mortalidad , Glioblastoma/terapia , Humanos , Masculino , Ratones , Neurogénesis , Fenotipo , Pronóstico , Estados Unidos/epidemiologíaRESUMEN
Proper regulation of germline gene expression is essential for fertility and maintaining species integrity. In the C. elegans germline, a diverse repertoire of regulatory pathways promote the expression of endogenous germline genes and limit the expression of deleterious transcripts to maintain genome homeostasis. Here we show that the conserved TRIM-NHL protein, NHL-2, plays an essential role in the C. elegans germline, modulating germline chromatin and meiotic chromosome organization. We uncover a role for NHL-2 as a co-factor in both positively (CSR-1) and negatively (HRDE-1) acting germline 22G-small RNA pathways and the somatic nuclear RNAi pathway. Furthermore, we demonstrate that NHL-2 is a bona fide RNA binding protein and, along with RNA-seq data point to a small RNA independent role for NHL-2 in regulating transcripts at the level of RNA stability. Collectively, our data implicate NHL-2 as an essential hub of gene regulatory activity in both the germline and soma.