RESUMO
RNA-binding proteins (RBPs) play key roles in a wide range of physiological and pathological processes. To facilitate the investigation of RBP functions and disease associations, we updated the EuRBPDB and renamed it as RBPWorld (http://research.gzsys.org.cn/rbpworld/#/home). Leveraging 998 RNA-binding domains (RBDs) and 87 RNA-binding Proteome (RBPome) datasets, we successfully identified 1 393 413 RBPs from 445 species, including 3030 human RBPs (hRBPs). RBPWorld includes primary RNA targets of diverse hRBPs, as well as potential downstream regulatory pathways and alternative splicing patterns governed by various hRBPs. These insights were derived from analyses of 1515 crosslinking immunoprecipitation-seq datasets and 616 RNA-seq datasets from cells with hRBP gene knockdown or knockout. Furthermore, we systematically identified 929 RBPs with multi-functions, including acting as metabolic enzymes and transcription factors. RBPWorld includes 838 disease-associated hRBPs and 970 hRBPs that interact with 12 disease-causing RNA viruses. This provision allows users to explore the regulatory roles of hRBPs within the context of diseases. Finally, we developed an intuitive interface for RBPWorld, facilitating users easily access all the included data. We believe that RBPWorld will be a valuable resource in advancing our understanding of the biological roles of RBPs across different species.
RESUMO
Complex RNA-RNA interactions underlie fundamental biological processes. However, a large number of RNA-RNA interactions remain unknown. Most existing methods used to map RNA-RNA interactions are based on proximity ligation, but these strategies also capture a huge amount of intramolecular RNA secondary structures, making it almost impossible to detect most RNA-RNA interactions. To overcome this limitation, we developed an efficient, genome-wide method, Capture Interacting RNA and Deep Sequencing (CIRDES) for in vivo capturing of the RNA interactome. We designed multiple 20-nt CIRDES probes tiling the whole RNA sequence of interest. This strategy obtained high selectivity and low background noise proved by qRT-PCR data. CIRDES enriched target RNA and its interacting RNAs from cells crosslinked by formaldehyde in high efficiency. After hybridization and purification, the captured RNAs were converted to the cDNA library after a highly efficient ligation to a 3' end infrared-dye-conjugated RNA adapter based on adapter ligation library construction. Using CIRDES, we detected highly abundant known interacting RNA, as well as a large number of novel targets of U6 snRNA. The enrichment of U4 snRNA, which interacts with U6, confirmed the robustness of the identification of the RNA-RNA interaction by CIRDES. These results suggest that the CIRDES is an efficient strategy for genome-wide RNA-RNA interactome analysis.
Assuntos
Genoma , Sondas RNA/metabolismo , RNA Nuclear Pequeno/metabolismo , Biblioteca Gênica , Células Hep G2 , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Hibridização de Ácido Nucleico , Sondas RNA/genética , RNA Nuclear Pequeno/genética , RNA Nuclear Pequeno/isolamento & purificação , Análise de Sequência de RNARESUMO
Several studies have shown that long non-coding RNAs (lncRNAs) may play an essential role in Epithelial-Mesenchymal Transition (EMT), which is an important step in tumor metastasis; however, little is known about the global change of lncRNA transcriptome during EMT. To investigate how lncRNA transcriptome alterations contribute to EMT progression regulation, we deep-sequenced the whole-transcriptome of MCF10A as the cells underwent TGF-ß-induced EMT. RESULTS: Deep-sequencing results showed that the long RNA transcriptome of MCF10A had undergone global changes as early as 8h after treatment with TGF-ß. The expression of 3403 known and novel lncRNAs, and 570 known and novel circRNAs were altered during EMT. To identify the key lncRNA-regulator, we constructed the co-expression network and found all junction nodes in the network are lncRNAs. One junction node, RP6-65G23.5, was further verified as a key regulator of EMT. Intriguingly, we identified 216 clusters containing lncRNAs which were located in "gene desert" regions. The expressions of all lncRNAs in these clusters changed concurrently during EMT, strongly suggesting that these clusters might play important roles in EMT. Our study reveals a global reprogramming of lncRNAs transcriptome during EMT and provides clues for the future study of the molecular mechanism of EMT.
Assuntos
Neoplasias da Mama/genética , Transição Epitelial-Mesenquimal/genética , Sequenciamento de Nucleotídeos em Larga Escala , RNA Longo não Codificante/biossíntese , Neoplasias da Mama/patologia , Linhagem Celular Tumoral , Reprogramação Celular/genética , Feminino , Regulação Neoplásica da Expressão Gênica , Humanos , Metástase Neoplásica , RNA Longo não Codificante/genética , Transcriptoma/genéticaRESUMO
Estrogen receptor (ER) plays important roles in cell growth, development and tumorigenesis. Although ER-regulated genes have been extensively investigated, little is known about roles of ER-regulated lncRNAs in breast cancer. Here, we conducted genome-wide study of ER-regulated lncRNAs by using RNA-seq, ChIP-seq and TCGA data. A total of identified 114 ER-regulated lncRNAs were identified, many of them were overexpressed in ER+ breast cancer and co-expressed with some key regulators. Silencing one of most prominent lncRNA, AP000439.3, resulted in inhibition of cell cycle progression and proliferation. Further study revealed AP000439.3 can regulate expression of CCND1 through enhancing estrogen receptor induction of CCND1. This finding revealed lncRNAs may serve as important effectors of ER in regulation of gene expression and cell phenotype in breast cancer.