RESUMEN
mRNAs interact with RNA-binding proteins (RBPs) throughout their processing and maturation. While efforts have assigned RBPs to RNA substrates, less exploration has leveraged protein-protein interactions (PPIs) to study proteins in mRNA life-cycle stages. We generated an RNA-aware, RBP-centric PPI map across the mRNA life cycle in human cells by immunopurification-mass spectrometry (IP-MS) of â¼100 endogenous RBPs with and without RNase, augmented by size exclusion chromatography-mass spectrometry (SEC-MS). We identify 8,742 known and 20,802 unreported interactions between 1,125 proteins and determine that 73% of the IP-MS-identified interactions are RNA regulated. Our interactome links many proteins, some with unknown functions, to specific mRNA life-cycle stages, with nearly half associated with multiple stages. We demonstrate the value of this resource by characterizing the splicing and export functions of enhancer of rudimentary homolog (ERH), and by showing that small nuclear ribonucleoprotein U5 subunit 200 (SNRNP200) interacts with stress granule proteins and binds cytoplasmic RNA differently during stress.
RESUMEN
RNA-binding proteins (RBPs) control RNA metabolism to orchestrate gene expression and, when dysfunctional, underlie human diseases. Proteome-wide discovery efforts predict thousands of RBP candidates, many of which lack canonical RNA-binding domains (RBDs). Here, we present a hybrid ensemble RBP classifier (HydRA), which leverages information from both intermolecular protein interactions and internal protein sequence patterns to predict RNA-binding capacity with unparalleled specificity and sensitivity using support vector machines (SVMs), convolutional neural networks (CNNs), and Transformer-based protein language models. Occlusion mapping by HydRA robustly detects known RBDs and predicts hundreds of uncharacterized RNA-binding associated domains. Enhanced CLIP (eCLIP) for HydRA-predicted RBP candidates reveals transcriptome-wide RNA targets and confirms RNA-binding activity for HydRA-predicted RNA-binding associated domains. HydRA accelerates construction of a comprehensive RBP catalog and expands the diversity of RNA-binding associated domains.
Asunto(s)
Aprendizaje Profundo , Hydra , Animales , Humanos , ARN/metabolismo , Unión Proteica , Sitios de Unión/genética , Hydra/genética , Hydra/metabolismoRESUMEN
Messenger RNAs (mRNAs) interact with RNA-binding proteins (RBPs) in diverse ribonucleoprotein complexes (RNPs) during distinct life-cycle stages for their processing and maturation. While substantial attention has focused on understanding RNA regulation by assigning proteins, particularly RBPs, to specific RNA substrates, there has been considerably less exploration leveraging protein-protein interaction (PPI) methodologies to identify and study the role of proteins in mRNA life-cycle stages. To address this gap, we generated an RNA-aware RBP-centric PPI map across the mRNA life-cycle by immunopurification (IP-MS) of ~100 endogenous RBPs across the life-cycle in the presence or absence of RNase, augmented by size exclusion chromatography (SEC-MS). Aside from confirming 8,700 known and discovering 20,359 novel interactions between 1125 proteins, we determined that 73% of our IP interactions are regulated by the presence of RNA. Our PPI data enables us to link proteins to life-cycle stage functions, highlighting that nearly half of the proteins participate in at least two distinct stages. We show that one of the most highly interconnected proteins, ERH, engages in multiple RNA processes, including via interactions with nuclear speckles and the mRNA export machinery. We also demonstrate that the spliceosomal protein SNRNP200 participates in distinct stress granule-associated RNPs and occupies different RNA target regions in the cytoplasm during stress. Our comprehensive RBP-focused PPI network is a novel resource for identifying multi-stage RBPs and exploring RBP complexes in RNA maturation.