RESUMO
Many proteins regulate the expression of genes by binding to specific regions encoded in the genome1. Here we introduce a new data set of RNA elements in the human genome that are recognized by RNA-binding proteins (RBPs), generated as part of the Encyclopedia of DNA Elements (ENCODE) project phase III. This class of regulatory elements functions only when transcribed into RNA, as they serve as the binding sites for RBPs that control post-transcriptional processes such as splicing, cleavage and polyadenylation, and the editing, localization, stability and translation of mRNAs. We describe the mapping and characterization of RNA elements recognized by a large collection of human RBPs in K562 and HepG2 cells. Integrative analyses using five assays identify RBP binding sites on RNA and chromatin in vivo, the in vitro binding preferences of RBPs, the function of RBP binding sites and the subcellular localization of RBPs, producing 1,223 replicated data sets for 356 RBPs. We describe the spectrum of RBP binding throughout the transcriptome and the connections between these interactions and various aspects of RNA biology, including RNA stability, splicing regulation and RNA localization. These data expand the catalogue of functional elements encoded in the human genome by the addition of a large set of elements that function at the RNA level by interacting with RBPs.
Assuntos
Proteínas de Ligação a RNA/química , Proteínas de Ligação a RNA/metabolismo , Transcriptoma/genética , Processamento Alternativo/genética , Sequência de Bases , Sítios de Ligação , Linhagem Celular , Cromatina/genética , Cromatina/metabolismo , Bases de Dados Genéticas , Feminino , Técnicas de Silenciamento de Genes , Humanos , Espaço Intracelular/genética , Masculino , Ligação Proteica , RNA Mensageiro/química , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Proteínas de Ligação a RNA/genética , Especificidade por SubstratoRESUMO
RNA binding proteins (RBPs) orchestrate the production, processing, and function of mRNAs. Here, we present the affinity landscapes of 78 human RBPs using an unbiased assay that determines the sequence, structure, and context preferences of these proteins in vitro by deep sequencing of bound RNAs. These data enable construction of "RNA maps" of RBP activity without requiring crosslinking-based assays. We found an unexpectedly low diversity of RNA motifs, implying frequent convergence of binding specificity toward a relatively small set of RNA motifs, many with low compositional complexity. Offsetting this trend, however, we observed extensive preferences for contextual features distinct from short linear RNA motifs, including spaced "bipartite" motifs, biased flanking nucleotide composition, and bias away from or toward RNA structure. Our results emphasize the importance of contextual features in RNA recognition, which likely enable targeting of distinct subsets of transcripts by different RBPs that recognize the same linear motif.
Assuntos
Proteínas com Motivo de Reconhecimento de RNA/metabolismo , RNA/metabolismo , Sequência de Bases , Sítios de Ligação , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Conformação de Ácido Nucleico , Motivos de Nucleotídeos , Ligação Proteica , RNA/química , RNA/genética , Proteínas com Motivo de Reconhecimento de RNA/química , Proteínas com Motivo de Reconhecimento de RNA/genética , Relação Estrutura-AtividadeRESUMO
RNA metabolism is controlled by an expanding, yet incomplete, catalog of RNA-binding proteins (RBPs), many of which lack characterized RNA binding domains. Approaches to expand the RBP repertoire to discover non-canonical RBPs are currently needed. Here, HaloTag fusion pull down of 12 nuclear and cytoplasmic RBPs followed by quantitative mass spectrometry (MS) demonstrates that proteins interacting with multiple RBPs in an RNA-dependent manner are enriched for RBPs. This motivated SONAR, a computational approach that predicts RNA binding activity by analyzing large-scale affinity precipitation-MS protein-protein interactomes. Without relying on sequence or structure information, SONAR identifies 1,923 human, 489 fly, and 745 yeast RBPs, including over 100 human candidate RBPs that contain zinc finger domains. Enhanced CLIP confirms RNA binding activity and identifies transcriptome-wide RNA binding sites for SONAR-predicted RBPs, revealing unexpected RNA binding activity for disease-relevant proteins and DNA binding proteins.
Assuntos
Algoritmos , Anotação de Sequência Molecular , Proteínas de Ligação a RNA/química , Proteínas de Ligação a RNA/classificação , RNA/química , Animais , Sítios de Ligação , Núcleo Celular/química , Núcleo Celular/metabolismo , Citoplasma/química , Citoplasma/metabolismo , Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Expressão Gênica , Ontologia Genética , Células HEK293 , Humanos , Motivos de Nucleotídeos , Ligação Proteica , Domínios e Motivos de Interação entre Proteínas , RNA/genética , RNA/metabolismo , Proteínas de Ligação a RNA/genética , Proteínas de Ligação a RNA/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Software , Dedos de ZincoRESUMO
Transcriptome-wide maps of RNA binding protein (RBP)-RNA interactions by immunoprecipitation (IP)-based methods such as RNA IP (RIP) and crosslinking and IP (CLIP) are key starting points for evaluating the molecular roles of the thousands of human RBPs. A significant bottleneck to the application of these methods in diverse cell lines, tissues, and developmental stages is the availability of validated IP-quality antibodies. Using IP followed by immunoblot assays, we have developed a validated repository of 438 commercially available antibodies that interrogate 365 unique RBPs. In parallel, 362 short-hairpin RNA (shRNA) constructs against 276 unique RBPs were also used to confirm specificity of these antibodies. These antibodies can characterize subcellular RBP localization. With the burgeoning interest in the roles of RBPs in cancer, neurobiology, and development, these resources are invaluable to the broad scientific community. Detailed information about these resources is publicly available at the ENCODE portal (https://www.encodeproject.org/).
Assuntos
Bases de Dados Genéticas , Proteínas de Ligação a RNA/genética , RNA/metabolismo , Transcriptoma/genética , Sítios de Ligação , Humanos , Ligação Proteica , RNA/genética , RNA Interferente Pequeno/classificação , RNA Interferente Pequeno/genética , Proteínas de Ligação a RNA/metabolismoRESUMO
RNA quality-control pathways get rid of faulty RNAs and therefore must be able to discriminate these RNAs from those that are normal. Here we present evidence that the adenosine triphosphatase (ATPase) cycle of the SF1 helicase Upf1 is required for mRNA discrimination during nonsense-mediated decay (NMD). Mutations affecting the Upf1 ATPase cycle disrupt the mRNA selectivity of Upf1, leading to indiscriminate accumulation of NMD complexes on both NMD target and non-target mRNAs. In addition, two modulators of NMD-translation and termination codon-proximal poly(A) binding protein-depend on the ATPase activity of Upf1 to limit Upf1-non-target association. Preferential ATPase-dependent dissociation of Upf1 from non-target mRNAs in vitro suggests that selective release of Upf1 contributes to the ATPase dependence of Upf1 target discrimination. Given the prevalence of helicases in RNA regulation, ATP hydrolysis may be a widely used activity in target RNA discrimination.
Assuntos
Trifosfato de Adenosina/metabolismo , Degradação do RNAm Mediada por Códon sem Sentido , RNA Mensageiro/metabolismo , Transativadores/genética , Transativadores/metabolismo , Regiões 3' não Traduzidas , Domínio Catalítico , Células HEK293 , Humanos , Técnicas In Vitro , Dados de Sequência Molecular , Mutação , RNA Helicases , RNA Mensageiro/genética , Especificidade por SubstratoRESUMO
Alternative splicing of pre-messenger RNA transcripts enables the generation of multiple protein isoforms from the same gene locus, providing a major source of protein diversity in mammalian genomes. RNA binding proteins (RBPs) bind to RNA to control splice site choice and define which exons are included in the resulting mature RNA transcript. However, depending on where the RBPs bind relative to splice sites, they can activate or repress splice site usage. To explore this position-specific regulation, in vivo binding sites identified by methods such as cross-linking and immunoprecipitation (CLIP) are integrated with alternative splicing events identified by RNA-seq or microarray. Merging these data sets enables the generation of a "splicing map," where CLIP signal relative to a merged meta-exon provides a simple summary of the position-specific effect of binding on splicing regulation. Here, we provide RBP-Maps, a software tool to simplify generation of these maps and enable researchers to rapidly query regulatory patterns of an RBP of interest. Further, we discuss various alternative approaches to generate such splicing maps, focusing on how decisions in construction (such as the use of peak versus read density, or whole-reads versus only single-nucleotide candidate crosslink positions) can affect the interpretation of these maps using example eCLIP data from the 150 RBPs profiled by the ENCODE consortium.
Assuntos
Processamento Alternativo/genética , Biologia Computacional/métodos , Isoformas de Proteínas/genética , Sítios de Splice de RNA/genética , Proteínas de Ligação a RNA/química , Software , Regulação da Expressão Gênica/genética , Humanos , RNA Mensageiro/genética , Análise de Sequência de RNARESUMO
As RNA-binding proteins (RBPs) play essential roles in cellular physiology by interacting with target RNA molecules, binding site identification by UV crosslinking and immunoprecipitation (CLIP) of ribonucleoprotein complexes is critical to understanding RBP function. However, current CLIP protocols are technically demanding and yield low-complexity libraries with high experimental failure rates. We have developed an enhanced CLIP (eCLIP) protocol that decreases requisite amplification by â¼1,000-fold, decreasing discarded PCR duplicate reads by â¼60% while maintaining single-nucleotide binding resolution. By simplifying the generation of paired IgG and size-matched input controls, eCLIP improves specificity in the discovery of authentic binding sites. We generated 102 eCLIP experiments for 73 diverse RBPs in HepG2 and K562 cells (available at https://www.encodeproject.org), demonstrating that eCLIP enables large-scale and robust profiling, with amplification and sample requirements similar to those of ChIP-seq. eCLIP enables integrative analysis of diverse RBPs to reveal factor-specific profiles, common artifacts for CLIP and RNA-centric perspectives on RBP activity.
Assuntos
Perfilação da Expressão Gênica/métodos , Imunoprecipitação/métodos , Proteínas de Ligação a RNA/genética , Transcriptoma , Sítios de Ligação , Reagentes de Ligações Cruzadas/química , Células Hep G2 , Humanos , Células K562 , Processos Fotoquímicos , Raios UltravioletaRESUMO
Crosslinking and immunoprecipitation (CLIP) followed by high-throughput sequencing identifies the binding sites of RNA binding proteins on RNAs. The covalent RNA-amino acid adducts produced by UV irradiation can cause premature reverse transcription termination and deletions (referred to as crosslink-induced mutation sites (CIMS)), which may decrease overall cDNA yield but are exploited in state-of-the-art CLIP methods to identify these crosslink sites at single-nucleotide resolution. Here, we show the ratio of both crosslinked base deletions and read-through versus termination are highly dependent on the identity of the reverse transcriptase enzyme as well as on buffer conditions used. AffinityScript and TGIRT showed a lack of deletion of the crosslinked base with other enzymes showing variable rates, indicating that utilization and interpretation of CIMS analysis requires knowledge of the reverse transcriptase enzyme used. Commonly used enzymes, including Superscript III and AffinityScript, show high termination rates in standard magnesium buffer conditions, but show a single base difference in the position of termination for TARDBP motifs. In contrast, manganese-containing buffer promoted read-through at the adduct site. These results validate the use of standard enzymes and also propose alternative enzyme and buffer choices for particularly challenging samples that contain extensive RNA adducts or other modifications that inhibit standard reverse transcription.
Assuntos
Proteínas de Ligação a RNA/genética , Proteínas de Ligação a RNA/metabolismo , Transcrição Reversa/fisiologia , Análise de Sequência de RNA/métodos , Sequência de Bases/fisiologia , Sítios de Ligação/fisiologia , HumanosRESUMO
Identification of in vivo direct RNA targets for RNA binding proteins (RBPs) provides critical insight into their regulatory activities and mechanisms. Recently, we described a methodology for enhanced crosslinking and immunoprecipitation followed by high-throughput sequencing (eCLIP) using antibodies against endogenous RNA binding proteins. However, in many cases it is desirable to profile targets of an RNA binding protein for which an immunoprecipitation-grade antibody is lacking. Here we describe a scalable method for using CRISPR/Cas9-mediated homologous recombination to insert a peptide tag into the endogenous RNA binding protein locus. Further, we show that TAG-eCLIP performed using tag-specific antibodies can yield the same robust binding profiles after proper control normalization as eCLIP with antibodies against endogenous proteins. Finally, we note that antibodies against commonly used tags can immunoprecipitate significant amounts of antibody-specific RNA, emphasizing the need for paired controls alongside each experiment for normalization. TAG-eCLIP enables eCLIP profiling of new native proteins where no suitable antibody exists, expanding the RBP-RNA interaction landscape.
Assuntos
Sistemas CRISPR-Cas , Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Proteínas de Ligação a RNA/genética , RNA/química , Coloração e Rotulagem/métodos , Anticorpos/química , Sequência de Bases , Sítios de Ligação , Clonagem Molecular , Endonucleases/química , Células HEK293 , Recombinação Homóloga , Humanos , Células K562 , Peptídeos/química , Reação em Cadeia da Polimerase , Ligação Proteica , RNA/genética , RNA/metabolismo , Proteínas de Ligação a RNA/metabolismo , Análise de Sequência de RNA/métodos , TranscriptomaRESUMO
Duplex sequencing (DS) is an error-corrected next-generation sequencing method in which molecular barcodes informatically link PCR-copies back to their source DNA strands, enabling computational removal of errors in consensus sequences. The resulting background of less than one artifactual mutation per 107 nucleotides allows for direct detection of somatic mutations. TwinStrand Biosciences, Inc. has developed a DS-based mutagenesis assay to sample the rat genome, which can be applied to genetic toxicity testing. To evaluate this assay for early detection of mutagenesis, a time-course study was conducted using male Hsd:Sprague Dawley SD rats (3 per group) administered a single dose of 40 mg/kg N-ethyl-N-nitrosourea (ENU) via gavage, with mutation frequency (MF) and spectrum analyzed in stomach, bone marrow, blood, and liver tissues at 3 h, 24 h, 7 d, and 28 d post-exposure. Significant increases in MF were observed in ENU-exposed rats as early as 24 h for stomach (site of contact) and bone marrow (a highly proliferative tissue) and at 7 d for liver and blood. The canonical, mutational signature of ENU was established by 7 d post-exposure in all four tissues. Interlaboratory analysis of a subset of samples from different tissues and time points demonstrated remarkable reproducibility for both MF and spectrum. These results demonstrate that MF and spectrum can be evaluated successfully by directly sequencing targeted regions of DNA obtained from various tissuesâ , a considerable advancement compared to currently used in vivo gene mutation assays.
Assuntos
Etilnitrosoureia , Compostos de Nitrosoureia , Ratos , Masculino , Animais , Etilnitrosoureia/toxicidade , Reprodutibilidade dos Testes , Ratos Sprague-Dawley , Mutagênese , Mutação , Mutagênicos/toxicidadeRESUMO
Duplex sequencing (DuplexSeq) is an error-corrected next-generation sequencing (ecNGS) method in which molecular barcodes informatically link PCR-copies back to their source DNA strands, enabling computational removal of errors by comparing grouped strand sequencing reads. The resulting background of less than one artifactual mutation per 10 7 nucleotides allows for direct detection of somatic mutations. TwinStrand Biosciences, Inc. has developed a DuplexSeq-based mutagenesis assay to sample the rat genome, which can be applied to genetic toxicity testing. To evaluate this assay for early detection of mutagenesis, a time-course study was conducted using male Hsd:Sprague Dawley SD rats (3 per group) administered a single dose of 40 mg/kg N-ethyl-N-nitrosourea (ENU) via gavage, with mutation frequency (MF) and spectrum analyzed in stomach, bone marrow, blood, and liver tissues at 3 h, 24 h, 7 d, and 28 d post-exposure. Significant increases in MF were observed in ENU-exposed rats as early as 24 h for stomach (site of contact) and bone marrow (a highly proliferative tissue) and at 7 d for liver and blood. The canonical, mutational signature of ENU was established by 7 d post-exposure in all four tissues. Interlaboratory analysis of a subset of samples from different tissues and time points demonstrated remarkable reproducibility for both MF and spectrum. These results demonstrate that MF and spectrum can be evaluated successfully by directly sequencing targeted regions of DNA obtained from various tissues, a considerable advancement compared to currently used in vivo gene mutation assays. HIGHLIGHTS: DuplexSeq is an ultra-accurate NGS technology that directly quantifies mutationsENU-dependent mutagenesis was detected 24 h post-exposure in proliferative tissuesMultiple tissues exhibited the canonical ENU mutation spectrum 7 d after exposureResults obtained with DuplexSeq were highly concordant between laboratoriesThe Rat-50 Mutagenesis Assay is promising for applications in genetic toxicology.
RESUMO
Discovery of interaction sites between RNA-binding proteins (RBPs) and their RNA targets plays a critical role in enabling our understanding of how these RBPs control RNA processing and regulation. Cross-linking and immunoprecipitation (CLIP) provides a generalizable, transcriptome-wide method by which RBP/RNA complexes are purified and sequenced to identify sites of intermolecular contact. By simplifying technical challenges in prior CLIP methods and incorporating the generation of and quantitative comparison against size-matched input controls, the single-end enhanced CLIP (seCLIP) protocol allows for the profiling of these interactions with high resolution, efficiency and scalability. Here, we present a step-by-step guide to the seCLIP method, detailing critical steps and offering insights regarding troubleshooting and expected results while carrying out the ~4-d protocol. Furthermore, we describe a comprehensive bioinformatics pipeline that offers users the tools necessary to process two replicate datasets and identify reproducible and significant peaks for an RBP of interest in ~2 d.
Assuntos
RNA , Transcriptoma , Sítios de Ligação , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Imunoprecipitação , Ligação Proteica , RNA/genética , Proteínas de Ligação a RNA/metabolismoRESUMO
The molecular functions of the majority of RNA-binding proteins (RBPs) remain unclear, highlighting a major bottleneck to a full understanding of gene expression regulation. Here, we develop a plasmid resource of 690 human RBPs that we subject to luciferase-based 3'-untranslated-region tethered function assays to pinpoint RBPs that regulate RNA stability or translation. Enhanced UV-cross-linking and immunoprecipitation of these RBPs identifies thousands of endogenous mRNA targets that respond to changes in RBP level, recapitulating effects observed in tethered function assays. Among these RBPs, the ubiquitin-associated protein 2-like (UBAP2L) protein interacts with RNA via its RGG domain and cross-links to mRNA and rRNA. Fusion of UBAP2L to RNA-targeting CRISPR-Cas9 demonstrates programmable translational enhancement. Polysome profiling indicates that UBAP2L promotes translation of target mRNAs, particularly global regulators of translation. Our tethering survey allows rapid assignment of the molecular activity of proteins, such as UBAP2L, to specific steps of mRNA metabolism.
Assuntos
Proteínas de Transporte/metabolismo , Biossíntese de Proteínas , Estabilidade de RNA , Proteínas de Ligação a RNA/metabolismo , Regiões 3' não Traduzidas , Sítios de Ligação , Sistemas CRISPR-Cas , Proteínas de Transporte/química , Proteínas de Transporte/genética , Linhagem Celular , Humanos , Luciferases/genética , Luciferases/metabolismo , Fases de Leitura Aberta , Polirribossomos/genética , Polirribossomos/metabolismo , Proteínas de Ligação a RNA/química , Proteínas de Ligação a RNA/genética , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Raios UltravioletaRESUMO
BACKGROUND: A critical step in uncovering rules of RNA processing is to study the in vivo regulatory networks of RNA binding proteins (RBPs). Crosslinking and immunoprecipitation (CLIP) methods enable mapping RBP targets transcriptome-wide, but methodological differences present challenges to large-scale analysis across datasets. The development of enhanced CLIP (eCLIP) enabled the mapping of targets for 150 RBPs in K562 and HepG2, creating a unique resource of RBP interactomes profiled with a standardized methodology in the same cell types. RESULTS: Our analysis of 223 eCLIP datasets reveals a range of binding modalities, including highly resolved positioning around splicing signals and mRNA untranslated regions that associate with distinct RBP functions. Quantification of enrichment for repetitive and abundant multicopy elements reveals 70% of RBPs have enrichment for non-mRNA element classes, enables identification of novel ribosomal RNA processing factors and sites, and suggests that association with retrotransposable elements reflects multiple RBP mechanisms of action. Analysis of spliceosomal RBPs indicates that eCLIP resolves AQR association after intronic lariat formation, enabling identification of branch points with single-nucleotide resolution, and provides genome-wide validation for a branch point-based scanning model for 3' splice site recognition. Finally, we show that eCLIP peak co-occurrences across RBPs enable the discovery of novel co-interacting RBPs. CONCLUSIONS: This work reveals novel insights into RNA biology by integrated analysis of eCLIP profiling of 150 RBPs with distinct functions. Further, our quantification of both mRNA and other element association will enable further research to identify novel roles of RBPs in regulating RNA processing.
Assuntos
Processamento Pós-Transcricional do RNA , Proteínas de Ligação a RNA/metabolismo , Sítios de Ligação , Células Hep G2 , Humanos , Imunoprecipitação , Íntrons , Células K562 , RNA/metabolismo , Splicing de RNA , RNA Ribossômico/metabolismo , Sequências Repetitivas de Ácido Nucleico , Retroelementos , Spliceossomos/metabolismoRESUMO
Adenosine-to-inosine (A-to-I) editing, mediated by the ADAR enzymes, diversifies the transcriptome by altering RNA sequences. Recent studies reported global changes in RNA editing in disease and development. Such widespread editing variations necessitate an improved understanding of the regulatory mechanisms of RNA editing. Here, we study the roles of >200 RNA-binding proteins (RBPs) in mediating RNA editing in two human cell lines. Using RNA-sequencing and global protein-RNA binding data, we identify a number of RBPs as key regulators of A-to-I editing. These RBPs, such as TDP-43, DROSHA, NF45/90 and Ro60, mediate editing through various mechanisms including regulation of ADAR1 expression, interaction with ADAR1, and binding to Alu elements. We highlight that editing regulation by Ro60 is consistent with the global up-regulation of RNA editing in systemic lupus erythematosus. Additionally, most key editing regulators act in a cell type-specific manner. Together, our work provides insights for the regulatory mechanisms of RNA editing.
Assuntos
Adenosina Desaminase/genética , Adenosina Desaminase/metabolismo , Regulação Neoplásica da Expressão Gênica , Edição de RNA/genética , Proteínas de Ligação a RNA/genética , Proteínas de Ligação a RNA/metabolismo , Adenosina/genética , Elementos Alu , Autoantígenos/genética , Técnicas de Silenciamento de Genes , Células Hep G2 , Humanos , Inosina/genética , Células K562 , Lúpus Eritematoso Sistêmico/genética , RNA Citoplasmático Pequeno/genética , Ribonucleoproteínas/genética , Análise de Sequência de RNA , Transcrição Gênica , TransfecçãoRESUMO
Allele-specific protein-RNA binding is an essential aspect that may reveal functional genetic variants (GVs) mediating post-transcriptional regulation. Recently, genome-wide detection of in vivo binding of RNA-binding proteins is greatly facilitated by the enhanced crosslinking and immunoprecipitation (eCLIP) method. We developed a new computational approach, called BEAPR, to identify allele-specific binding (ASB) events in eCLIP-Seq data. BEAPR takes into account crosslinking-induced sequence propensity and variations between replicated experiments. Using simulated and actual data, we show that BEAPR largely outperforms often-used count analysis methods. Importantly, BEAPR overcomes the inherent overdispersion problem of these methods. Complemented by experimental validations, we demonstrate that the application of BEAPR to ENCODE eCLIP-Seq data of 154 proteins helps to predict functional GVs that alter splicing or mRNA abundance. Moreover, many GVs with ASB patterns have known disease relevance. Overall, BEAPR is an effective method that helps to address the outstanding challenge of functional interpretation of GVs.
Assuntos
Alelos , Variação Genética , Proteínas de Ligação a RNA/metabolismo , RNA/genética , Regiões 3' não Traduzidas/genética , Motivos de Aminoácidos , Sequência de Bases , Biologia Computacional , Simulação por Computador , Doença/genética , Predisposição Genética para Doença , Células Hep G2 , Humanos , Células K562 , Polimorfismo de Nucleotídeo Único/genética , Ligação Proteica , Locos de Características Quantitativas/genética , RNA Helicases/metabolismo , Splicing de RNA/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Reprodutibilidade dos Testes , Transativadores/metabolismoRESUMO
Transcriptomic analyses of postmortem brains have begun to elucidate molecular abnormalities in autism spectrum disorder (ASD). However, a crucial pathway involved in synaptic development, RNA editing, has not yet been studied on a genome-wide scale. Here we profiled global patterns of adenosine-to-inosine (A-to-I) editing in a large cohort of postmortem brains of people with ASD. We observed a global bias for hypoediting in ASD brains, which was shared across brain regions and involved many synaptic genes. We show that the Fragile X proteins FMRP and FXR1P interact with RNA-editing enzymes (ADAR proteins) and modulate A-to-I editing. Furthermore, we observed convergent patterns of RNA-editing alterations in ASD and Fragile X syndrome, establishing this as a molecular link between these related diseases. Our findings, which are corroborated across multiple data sets, including dup15q (genomic duplication of 15q11.2-13.1) cases associated with intellectual disability, highlight RNA-editing dysregulation in ASD and reveal new mechanisms underlying this disorder.
Assuntos
Transtorno Autístico/metabolismo , Encéfalo/metabolismo , Edição de RNA , Adenosina Desaminase/genética , Adenosina Desaminase/metabolismo , Transtorno Autístico/genética , Proteína do X Frágil da Deficiência Intelectual/genética , Proteína do X Frágil da Deficiência Intelectual/metabolismo , Perfilação da Expressão Gênica , Humanos , Neurônios/metabolismo , Proteínas de Ligação a RNA/genética , Proteínas de Ligação a RNA/metabolismoRESUMO
Profiling of RNA binding protein targets in vivo provides critical insights into the mechanistic roles they play in regulating RNA processing. The enhanced crosslinking and immunoprecipitation (eCLIP) methodology provides a framework for robust, reproducible identification of transcriptome-wide protein-RNA interactions, with dramatically improved efficiency over previous methods. Here we provide a step-by-step description of the eCLIP method, along with insights into optimal performance of critical steps in the protocol. In particular, we describe improvements to the adaptor strategy that enables single-end enhanced CLIP (seCLIP), which removes the requirement for paired-end sequencing of eCLIP libraries. Further, we describe the observation of contaminating RNA present in standard nitrocellulose membrane suppliers, and present options with significantly reduced contamination for sensitive applications. These notes further refine the eCLIP methodology, simplifying robust RNA binding protein studies for all users.
Assuntos
Reagentes de Ligações Cruzadas/química , Perfilação da Expressão Gênica/métodos , Biblioteca Gênica , Imunoprecipitação/métodos , Proteínas de Ligação a RNA/química , Animais , HumanosRESUMO
Human pluripotent stem cells (hPSCs) require precise control of post-transcriptional RNA networks to maintain proliferation and survival. Using enhanced UV crosslinking and immunoprecipitation (eCLIP), we identify RNA targets of the IMP/IGF2BP family of RNA-binding proteins in hPSCs. At the broad region and binding site levels, IMP1 and IMP2 show reproducible binding to a large and overlapping set of 3' UTR-enriched targets. RNA Bind-N-seq applied to recombinant full-length IMP1 and IMP2 reveals CA-rich motifs that are enriched in eCLIP-defined binding sites. We observe that IMP1 loss in hPSCs recapitulates IMP1 phenotypes, including a reduction in cell adhesion and increase in cell death. For cell adhesion, we find IMP1 maintains levels of integrin mRNA specifically regulating RNA stability of ITGB5 in hPSCs. Additionally, we show that IMP1 can be linked to hPSC survival via direct target BCL2. Thus, transcriptome-wide binding profiles identify hPSC targets modulating well-characterized IMP1 roles.