RESUMEN
RNA interactome studies have revealed that hundreds of zinc-finger proteins (ZFPs) are candidate RNA-binding proteins (RBPs), yet their RNA substrates and functional significance remain largely uncharacterized. Here, we present a systematic multi-omics analysis of the DNA- and RNA-binding targets and regulatory roles of more than 100 ZFPs representing 37 zinc-finger families. We show that multiple ZFPs are previously unknown regulators of RNA splicing, alternative polyadenylation, stability, or translation. The examined ZFPs show widespread sequence-specific RNA binding and preferentially bind proximal to transcription start sites. Additionally, several ZFPs associate with their targets at both the DNA and RNA levels. We highlight ZNF277, a C2H2 ZFP that binds thousands of RNA targets and acts as a multi-functional RBP. We also show that ZNF473 is a DNA/RNA-associated protein that regulates the expression and splicing of cell cycle genes. Our results reveal diverse roles for ZFPs in transcriptional and post-transcriptional gene regulation.
Asunto(s)
Empalme del ARN , Proteínas de Unión al ARN , Dedos de Zinc , Humanos , Proteínas de Unión al ARN/metabolismo , Proteínas de Unión al ARN/genética , Regulación de la Expresión Génica , Proteínas de Unión al ADN/metabolismo , Proteínas de Unión al ADN/genética , ARN/metabolismo , ARN/genética , Unión Proteica , Células HEK293 , Células HeLa , Empalme Alternativo , Sitios de Unión , MultiómicaRESUMEN
RNA-binding proteins (RBPs) regulate key aspects of RNA processing including alternative splicing, mRNA degradation and localization by physically binding RNA molecules. Current methods to map these interactions, such as CLIP, rely on purifying single proteins at a time. Our new method, ePRINT, maps RBP-RNA interaction networks on a global scale without purifying individual RBPs. ePRINT uses exoribonuclease XRN1 to precisely map the 5' end of the RBP binding site and uncovers direct and indirect targets of an RBP of interest. Importantly, ePRINT can also uncover RBPs that are differentially activated between cell fate transitions, including neural progenitor differentiation into neurons.
Asunto(s)
Proteínas de Unión al ARN , Proteínas de Unión al ARN/metabolismo , Sitios de Unión , Exorribonucleasas/metabolismo , Humanos , ARN/metabolismo , Animales , Unión ProteicaRESUMEN
The uncovering of protein-RNA interactions enables a deeper understanding of RNA processing. Recent multiplexed crosslinking and immunoprecipitation (CLIP) technologies such as antibody-barcoded eCLIP (ABC) dramatically increase the throughput of mapping RNA binding protein (RBP) binding sites. However, multiplex CLIP datasets are multivariate, and each RBP suffers non-uniform signal-to-noise ratio. To address this, we developed Mudskipper, a versatile computational suite comprising two components: a Dirichlet multinomial mixture model to account for the multivariate nature of ABC datasets and a softmasking approach that identifies and removes non-specific protein-RNA interactions in RBPs with low signal-to-noise ratio. Mudskipper demonstrates superior precision and recall over existing tools on multiplex datasets and supports analysis of repetitive elements and small non-coding RNAs. Our findings unravel splicing outcomes and variant-associated disruptions, enabling higher-throughput investigations into diseases and regulation mediated by RBPs.
Asunto(s)
Proteínas de Unión al ARN , Proteínas de Unión al ARN/metabolismo , Proteínas de Unión al ARN/genética , Humanos , Inmunoprecipitación/métodos , Sitios de Unión , Programas Informáticos , Biología Computacional/métodos , ARN/metabolismo , ARN/genética , Unión ProteicaRESUMEN
Here, we present a protocol for using Skipper, a pipeline designed to process crosslinking and immunoprecipitation (CLIP) data into annotated binding sites. We describe steps for partitioning annotated transcript regions and fitting data to a beta-binomial model to call windows of enriched binding. From raw CLIP data, we detail how users can map reproducible RNA-binding sites to call enriched windows and perform downstream analysis. This protocol supports optional customizations for different use cases. For complete details on the use and execution of this protocol, please refer to Boyle et al.1.
Asunto(s)
Inmunoprecipitación , Sitios de Unión , Inmunoprecipitación/métodos , Humanos , Programas Informáticos , Reactivos de Enlaces Cruzados/química , ARN/metabolismo , ARN/genéticaRESUMEN
Technology for crosslinking and immunoprecipitation (CLIP) followed by sequencing (CLIP-seq) has identified the transcriptomic targets of hundreds of RNA-binding proteins in cells. To increase the power of existing and future CLIP-seq datasets, we introduce Skipper, an end-to-end workflow that converts unprocessed reads into annotated binding sites using an improved statistical framework. Compared with existing methods, Skipper on average calls 210%-320% more transcriptomic binding sites and sometimes >1,000% more sites, providing deeper insight into post-transcriptional gene regulation. Skipper also calls binding to annotated repetitive elements and identifies bound elements for 99% of enhanced CLIP experiments. We perform nine translation factor enhanced CLIPs and apply Skipper to learn determinants of translation factor occupancy, including transcript region, sequence, and subcellular localization. Furthermore, we observe depletion of genetic variation in occupied sites and nominate transcripts subject to selective constraint because of translation factor occupancy. Skipper offers fast, easy, customizable, and state-of-the-art analysis of CLIP-seq data.