RESUMO
Small nucleolar RNAs (snoRNAs) are structured noncoding RNAs present in multiple copies within eukaryotic genomes. snoRNAs guide chemical modifications on their target RNA and regulate processes like ribosome assembly and splicing. Most human snoRNAs are embedded within host gene introns, the remainder being independently expressed from intergenic regions. We recently characterized the abundance of snoRNAs and their host gene across several healthy human tissues and found that the level of most snoRNAs does not correlate with that of their host gene, with the observation that snoRNAs embedded within the same host gene often differ drastically in abundance. To better understand the determinants of snoRNA expression, we trained machine learning models to predict whether snoRNAs are expressed or not in human tissues based on more than 30 collected features related to snoRNAs and their genomic context. By interpreting the models' predictions, we find that snoRNAs rely on conserved motifs, a stable global structure and terminal stem, and a transcribed locus to be expressed. We observe that these features explain well the varying abundance of snoRNAs embedded within the same host gene. By predicting the expression status of snoRNAs across several vertebrates, we notice that only one-third of all annotated snoRNAs are expressed per genome, as in humans. Our results suggest that ancestral snoRNAs disseminated within vertebrate genomes, sometimes leading to the development of new functions and a probable gain in fitness and thereby conserving features favorable to the expression of these few snoRNAs, the large remainder often degenerating into pseudogenes.
Assuntos
RNA Nucleolar Pequeno , Vertebrados , Animais , Humanos , RNA Nucleolar Pequeno/genética , RNA Nucleolar Pequeno/química , Vertebrados/genética , Eucariotos/genética , RNA não Traduzido , Expressão GênicaAssuntos
Simulação por Computador , Perfilação da Expressão Gênica , RNA-Seq , RNA/biossíntese , RNA/genética , Animais , HumanosRESUMO
DNA replication is a fundamental cellular process that ensures the transfer of genetic information during cell division. Genome duplication takes place in S phase and requires a dynamic and highly coordinated recruitment of multiple proteins at replication forks. Various genotoxic stressors lead to fork instability and collapse, hence the need for DNA repair pathways. By identifying the multitude of protein interactions implicated in those events, we can better grasp the complex and dynamic molecular mechanisms that facilitate DNA replication and repair. Proximity-dependent biotin identification was used to identify associations with 17 proteins within four core replication components, namely the CDC45/MCM2-7/GINS helicase that unwinds DNA, the DNA polymerases, replication protein A subunits, and histone chaperones needed to disassemble and reassemble chromatin. We further investigated the impact of genotoxic stress on these interactions. This analysis revealed a vast proximity association network with 108 nuclear proteins further modulated in the presence of hydroxyurea; 45 being enriched and 63 depleted. Interestingly, hydroxyurea treatment also caused a redistribution of associations with 11 interactors, meaning that the replisome is dynamically reorganized when stressed. The analysis identified several poorly characterized proteins, thereby uncovering new putative players in the cellular response to DNA replication arrest. It also provides a new comprehensive proteomic framework to understand how cells respond to obstacles during DNA replication.
Assuntos
Replicação do DNA , Hidroxiureia , Proteômica , Hidroxiureia/farmacologia , Proteômica/métodos , Humanos , Dano ao DNA , Proteínas de Ciclo Celular/metabolismo , Proteínas de Ciclo Celular/genética , Proteoma/metabolismoRESUMO
The OpenProt proteogenomic resource (https://www.openprot.org/) provides users with a complete and freely accessible set of non-canonical or alternative open reading frames (AltORFs) within the transcriptome of various species, as well as functional annotations of the corresponding protein sequences not found in standard databases. Enhancements in this update are largely the result of user feedback and include the prediction of structure, subcellular localization, and intrinsic disorder, using cutting-edge algorithms based on machine learning techniques. The mass spectrometry pipeline now integrates a machine learning-based peptide rescoring method to improve peptide identification. We continue to help users explore this cryptic proteome by providing OpenCustomDB, a tool that enables users to build their own customized protein databases, and OpenVar, a genomic annotator including genetic variants within AltORFs and protein sequences. A new interface improves the visualization of all functional annotations, including a spectral viewer and the prediction of multicoding genes. All data on OpenProt are freely available and downloadable. Overall, OpenProt continues to establish itself as an important resource for the exploration and study of new proteins.
Assuntos
Bases de Dados de Proteínas , Peptídeos , Proteômica , Sequência de Aminoácidos , Genômica , Internet , Peptídeos/genética , Proteoma/genética , Proteômica/métodos , HumanosRESUMO
snoDB is an interactive database of human small nucleolar RNAs (snoRNAs) that includes up-to-date information on snoRNA features, genomic location, conservation, host gene, snoRNA-RNA targets and snoRNA abundance and provides links to other resources. In the second edition of this database (snoDB 2.0), we added an entirely new section on ribosomal RNA (rRNA) chemical modifications guided by snoRNAs with easy navigation between the different rRNA versions used in the literature and experimentally measured levels of modification. We also included new layers of information, including snoRNA motifs, secondary structure prediction, snoRNA-protein interactions, copy annotations and low structure bias expression data in a wide panel of tissues and cell lines to bolster functional probing of snoRNA biology. Version 2.0 features updated identifiers, more links to external resources and duplicate entry resolution. As a result, snoDB 2.0, which is freely available at https://bioinfo-scottgroup.med.usherbrooke.ca/snoDB/, represents a one-stop shop for snoRNA features, rRNA modification targets, functional impact and potential regulators.
Assuntos
Bases de Dados Genéticas , RNA Nucleolar Pequeno , Humanos , Genômica , RNA Ribossômico/metabolismo , RNA Nucleolar Pequeno/metabolismoRESUMO
The field of RNA research has provided profound insights into the basic mechanisms modulating the function and adaption of biological systems. RNA has also been at the center stage in the development of transformative biotechnological and medical applications, perhaps most notably was the advent of mRNA vaccines that were critical in helping humanity through the Covid-19 pandemic. Unbeknownst to many, Canada boasts a diverse community of RNA scientists, spanning multiple disciplines and locations, whose cutting-edge research has established a rich track record of contributions across various aspects of RNA science over many decades. Through this position paper, we seek to highlight key contributions made by Canadian investigators to the RNA field, via both thematic and historical viewpoints. We also discuss initiatives underway to organize and enhance the impact of the Canadian RNA research community, particularly focusing on the creation of the not-for-profit organization RNA Canada ARN. Considering the strategic importance of RNA research in biology and medicine, and its considerable potential to help address major challenges facing humanity, sustained support of this sector will be critical to help Canadian scientists play key roles in the ongoing RNA revolution and the many benefits this could bring about to Canada.
RESUMO
Small nucleolar RNAs (snoRNAs) are a class of conserved noncoding RNAs forming complexes with proteins to catalyse site-specific modifications on ribosomal RNA. Besides this canonical role, several snoRNAs are now known to regulate diverse levels of gene expression. While these functions are carried out in trans by mature snoRNAs, evidence has also been emerging of regulatory roles of snoRNAs in cis, either within their genomic locus or as longer transcription intermediates during their maturation. Herein, we review recent findings that snoRNAs can interact in cis with their intron to regulate the expression of their host gene. We also explore the ever-growing diversity of longer host-derived snoRNA extensions and their functional impact across the transcriptome. Finally, we discuss the role of snoRNA duplications into forging these new layers of snoRNA-mediated regulation, as well as their involvement in the genomic imprinting of their host locus.
Assuntos
RNA Nucleolar Pequeno , RNA não Traduzido , RNA Nucleolar Pequeno/genética , RNA Nucleolar Pequeno/metabolismo , RNA não Traduzido/genética , RNA Ribossômico/genética , ÍntronsRESUMO
Box C/D small nucleolar RNAs (snoRNAs) are a conserved class of RNA known for their role in guiding ribosomal RNA 2'-O-ribose methylation. Recently, C/D snoRNAs were also implicated in regulating the expression of non-ribosomal genes through different modes of binding. Large scale RNA-RNA interaction datasets detect many snoRNAs binding messenger RNA, but are limited by specific experimental conditions. To enable a more comprehensive study of C/D snoRNA interactions, we created snoGloBe, a human C/D snoRNA interaction predictor based on a gradient boosting classifier. SnoGloBe considers the target type, position and sequence of the interactions, enabling it to outperform existing predictors. Interestingly, for specific snoRNAs, snoGloBe identifies strong enrichment of interactions near gene expression regulatory elements including splice sites. Abundance and splicing of predicted targets were altered upon the knockdown of their associated snoRNA. Strikingly, the predicted snoRNA interactions often overlap with the binding sites of functionally related RNA binding proteins, reinforcing their role in gene expression regulation. SnoGloBe is also an excellent tool for discovering viral RNA targets, as shown by its capacity to identify snoRNAs targeting the heavily methylated SARS-CoV-2 RNA. Overall, snoGloBe is capable of identifying experimentally validated binding sites and predicting novel sites with shared regulatory function.
Assuntos
RNA Nucleolar Pequeno , Software , Sequência de Bases , Humanos , RNA Ribossômico/metabolismo , RNA Nucleolar Pequeno/metabolismo , RNA Viral , SARS-CoV-2RESUMO
Ribonucleic acid sequencing (RNA-seq) identifies and quantifies RNA molecules from a biological sample. Transformation from raw sequencing data to meaningful gene or isoform counts requires an in silico bioinformatics pipeline. Such pipelines are modular in nature, built using selected software and biological references. Software is usually chosen and parameterized according to the sequencing protocol and biological question. However, while biological and technical noise is alleviated through replicates, biases due to the pipeline and choice of biological references are often overlooked. Here, we show that the current standard practice prevents reproducibility in RNA-seq studies by failing to specify required methodological information. Peer-reviewed articles are intended to apply currently accepted scientific and methodological standards. Inasmuch as the bias-less and optimal RNA-seq pipeline is not perfectly defined, methodological information holds a meaningful role in defining the results. This work illustrates the need for a standardized and explicit display of methodological information in RNA-seq experiments.
Assuntos
RNA-Seq/métodos , Animais , Humanos , RNA-Seq/normas , Valores de Referência , Reprodutibilidade dos TestesRESUMO
RBFOX2 controls the splicing of a large number of transcripts implicated in cell differentiation and development. Parsing RNA-binding protein datasets, we uncover that RBFOX2 can interact with hnRNPC, hnRNPM and SRSF1 to regulate splicing of a broad range of splicing events using different sequence motifs and binding modes. Using immunoprecipitation, specific RBP knockdown, RNA-seq and splice-sensitive PCR, we show that RBFOX2 can target splice sites using three binding configurations: single, multiple or secondary modes. In the single binding mode RBFOX2 is recruited to its target splice sites through a single canonical binding motif, while in the multiple binding mode RBFOX2 binding sites include the adjacent binding of at least one other RNA binding protein partner. Finally, in the secondary binding mode RBFOX2 likely does not bind the RNA directly but is recruited to splice sites lacking its canonical binding motif through the binding of one of its protein partners. These dynamic modes bind distinct sets of transcripts at different positions and distances relative to alternative splice sites explaining the heterogeneity of RBFOX2 targets and splicing outcomes.
Assuntos
Ribonucleoproteínas Nucleares Heterogêneas Grupo C/genética , Ribonucleoproteínas Nucleares Heterogêneas Grupo M/genética , Fatores de Processamento de RNA/genética , Splicing de RNA/genética , Proteínas de Ligação a RNA/genética , Proteínas Repressoras/genética , Processamento Alternativo/genética , Sítios de Ligação , Humanos , RNA/genética , Sítios de Splice de RNA/genéticaRESUMO
OpenProt (www.openprot.org) is the first proteogenomic resource supporting a polycistronic annotation model for eukaryotic genomes. It provides a deeper annotation of open reading frames (ORFs) while mining experimental data for supporting evidence using cutting-edge algorithms. This update presents the major improvements since the initial release of OpenProt. All species support recent NCBI RefSeq and Ensembl annotations, with changes in annotations being reported in OpenProt. Using the 131 ribosome profiling datasets re-analysed by OpenProt to date, non-AUG initiation starts are reported alongside a confidence score of the initiating codon. From the 177 mass spectrometry datasets re-analysed by OpenProt to date, the unicity of the detected peptides is controlled at each implementation. Furthermore, to guide the users, detectability statistics and protein relationships (isoforms) are now reported for each protein. Finally, to foster access to deeper ORF annotation independently of one's bioinformatics skills or computational resources, OpenProt now offers a data analysis platform. Users can submit their dataset for analysis and receive the results from the analysis by OpenProt. All data on OpenProt are freely available and downloadable for each species, the release-based format ensuring a continuous access to the data. Thus, OpenProt enables a more comprehensive annotation of eukaryotic genomes and fosters functional proteomic discoveries.
Assuntos
Bases de Dados de Proteínas , Eucariotos/genética , Genoma , Anotação de Sequência Molecular , Fases de Leitura Aberta/genética , Espectrometria de Massas , Isoformas de Proteínas/genética , Proteogenômica , Ribossomos/metabolismo , Interface Usuário-ComputadorRESUMO
BACKGROUND: Alternative splicing can increase the diversity of gene functions by generating multiple isoforms with different sequences and functions. However, the extent to which splicing events have functional consequences remains unclear and predicting the impact of splicing events on protein activity is limited to gene-specific analysis. RESULTS: To accelerate the identification of functionally relevant alternative splicing events we created SAPFIR, a predictor of protein features associated with alternative splicing events. This webserver tool uses InterProScan to predict protein features such as functional domains, motifs and sites in the human and mouse genomes and link them to alternative splicing events. Alternative protein features are displayed as functions of the transcripts and splice sites. SAPFIR could be used to analyze proteins generated from a single gene or a group of genes and can directly identify alternative protein features in large sequence data sets. The accuracy and utility of SAPFIR was validated by its ability to rediscover previously validated alternative protein domains. In addition, our de novo analysis of public datasets using SAPFIR indicated that only a small portion of alternative protein domains was conserved between human and mouse, and that in human, genes involved in nervous system process, regulation of DNA-templated transcription and aging are more likely to produce isoforms missing functional domains due to alternative splicing. CONCLUSION: Overall SAPFIR represents a new tool for the rapid identification of functional alternative splicing events and enables the identification of cellular functions affected by a defined splicing program. SAPFIR is freely available at https://bioinfo-scottgroup.med.usherbrooke.ca/sapfir/ , a website implemented in Python, with all major browsers supported. The source code is available at https://github.com/DelongZHOU/SAPFIR .
Assuntos
Processamento Alternativo , Splicing de RNA , Animais , Genoma , Camundongos , Isoformas de Proteínas/genética , SoftwareRESUMO
BACKGROUND: During pregnancy, maternal metabolism undergoes substantial changes to support the developing fetus. Such changes are finely regulated by different mechanisms carried out by effectors such as microRNAs (miRNAs). These small non-coding RNAs regulate numerous biological functions, mostly through post-transcriptional repression of gene expression. miRNAs are also secreted in circulation by numerous organs, such as the placenta. However, the complete plasmatic microtranscriptome of pregnant women has still not been fully described, although some miRNA clusters from the chromosome 14 (C14MC) and the chromosome 19 (C19MC and miR-371-3 cluster) have been proposed as being specific to pregnancy. Our aims were thus to describe the plasma microtranscriptome during the first trimester of pregnancy, by assessing the differences with non-pregnant women, and how it varies between the 4th and the 16th week of pregnancy. METHODS: Plasmatic miRNAs from 436 pregnant (gestational week 4 to 16) and 15 non-pregnant women were quantified using Illumina HiSeq next-generation sequencing platform. Differentially abundant miRNAs were identified using DESeq2 package (FDR q-value ≤ 0.05) and their targeted biological pathways were assessed with DIANA-miRpath. RESULTS: A total of 2101 miRNAs were detected, of which 191 were differentially abundant (fold change < 0.05 or > 2, FDR q-value ≤ 0.05) between pregnant and non-pregnant women. Of these, 100 miRNAs were less and 91 miRNAs were more abundant in pregnant women. Additionally, the abundance of 57 miRNAs varied according to gestational age at first trimester, of which 47 were positively and 10 were negatively associated with advancing gestational age. miRNAs from the C19MC were positively associated with both pregnancy and gestational age variation during the first trimester. Biological pathway analysis revealed that these 191 (pregnancy-specific) and 57 (gestational age markers) miRNAs targeted genes involved in fatty acid metabolism, ECM-receptor interaction and TGF-beta signaling pathways. CONCLUSION: We have identified circulating miRNAs specific to pregnancy and/or that varied with gestational age in first trimester. These miRNAs target biological pathways involved in lipid metabolism as well as placenta and embryo development, suggesting a contribution to the maternal metabolic adaptation to pregnancy and fetal growth.
Assuntos
MicroRNAs/genética , Primeiro Trimestre da Gravidez/genética , Adolescente , Adulto , Estudos de Casos e Controles , Estudos de Coortes , Feminino , Expressão Gênica , Perfilação da Expressão Gênica , Idade Gestacional , Humanos , Masculino , MicroRNAs/sangue , Pessoa de Meia-Idade , Gravidez , Primeiro Trimestre da Gravidez/sangue , Fatores de Tempo , Adulto JovemRESUMO
Radio-chemotherapy with 5-flu orouracil (5-FU) is the standard of care treatment for patients with colorectal cancer, but it is only effective for a third of them. Despite our understanding of the mechanism of action of 5-FU, drug resistance remains a significant limitation to the clinical use of 5-FU, as both intrinsic and acquired chemoresistance represents the major obstacles for the success of 5-FU-based chemotherapy. In order to identify the mechanism of acquired resistance, 5-FU chemoresistance was induced in CRC cell lines by passaging cells with increasing concentrations of 5-FU. To study global molecular changes, quantitative proteomics and transcriptomics analyses were performed on these cell lines, comparing the resistant cells as well as the effect of chemo and radiotherapy. Interestingly, a very high proportion of downregulated genes were annotated as transcription factors coding for Krüppel-associated box (KRAB) domain-containing zinc-finger proteins (KZFPs), the largest family of transcriptional repressors. Among nearly 350 KRAB-ZFPs, almost a quarter were downregulated after the induction of a 5-FU-resistance including a common one between the three CRC cell lines, ZNF649, whose role is still unknown. To confirm the observations of the proteomic and transcriptomic approaches, the abundance of 20 different KZFPs and control mRNAs was validated by RT-qPCR. In fact, several KZFPs were no longer detectable using qPCR in cell lines resistant to 5-FU, and the KZFPs that were downregulated only in one or two cell lines showed similar pattern of expression as measured by the omics approaches. This proteomic, transcriptomic and genomic analysis of intrinsic and acquired resistance highlights a possible new mechanism involved in the cellular adaptation to 5-FU and therefore identifies potential new therapeutic targets to overcome this resistance.
Assuntos
Neoplasias Colorretais , Fluoruracila , Linhagem Celular Tumoral , Neoplasias Colorretais/tratamento farmacológico , Neoplasias Colorretais/genética , Regulação para Baixo , Resistencia a Medicamentos Antineoplásicos/genética , Fluoruracila/farmacologia , Fluoruracila/uso terapêutico , Humanos , Proteômica , Dedos de Zinco/genéticaRESUMO
Serine-rich splicing factor 3 (SRSF3) was recently reported as being necessary to preserve RNA stability via an mTOR mechanism in a cardiac mouse model in adulthood. Here, we demonstrate the link between Srsf3 and mitochondrial integrity in an embryonic cardiomyocyte-specific Srsf3 conditional knockout (cKO) mouse model. Fifteen-day-old Srsf3 cKO mice showed dramatically reduced (below 50%) survival and reduced the left ventricular systolic performance, and histological analysis of these hearts revealed a significant increase in cardiomyocyte size, confirming the severe remodeling induced by Srsf3 deletion. RNA-seq analysis of the hearts of 5-day-old Srsf3 cKO mice revealed early changes in expression levels and alternative splicing of several transcripts related to mitochondrial integrity and oxidative phosphorylation. Likewise, the levels of several protein complexes of the electron transport chain decreased, and mitochondrial complex I-driven respiration of permeabilized cardiac muscle fibers from the left ventricle was impaired. Furthermore, transmission electron microscopy analysis showed disordered mitochondrial length and cristae structure. Together with its indispensable role in the physiological maintenance of mouse hearts, these results highlight the previously unrecognized function of Srsf3 in regulating the mitochondrial integrity.
Assuntos
Regulação da Expressão Gênica , Mitocôndrias Cardíacas/patologia , Miócitos Cardíacos/patologia , Fatores de Processamento de Serina-Arginina/fisiologia , Processamento Alternativo , Animais , Perfilação da Expressão Gênica , Camundongos , Camundongos Endogâmicos C57BL , Camundongos Knockout , Mitocôndrias Cardíacas/metabolismo , Miócitos Cardíacos/metabolismo , Fosforilação Oxidativa , RNA-SeqRESUMO
HNF4α is a nuclear receptor produced as 12 isoforms from two promoters by alternative splicing. To characterize the transcriptional capacities of all 12 HNF4α isoforms, stable lines expressing each isoform were generated. The entire transcriptome associated with each isoform was analyzed as well as their respective interacting proteome. Major differences were noted in the transcriptional function of these isoforms. The α1 and α2 isoforms were the strongest regulators of gene expression whereas the α3 isoform exhibited significantly reduced activity. The α4, α5, and α6 isoforms, which use an alternative first exon, were characterized for the first time, and showed a greatly reduced transcriptional potential with an inability to recognize the consensus response element of HNF4α. Several transcription factors and coregulators were identified as potential specific partners for certain HNF4α isoforms. An analysis integrating the vast amount of omics data enabled the identification of transcriptional regulatory mechanisms specific to certain HNF4α isoforms, hence demonstrating the importance of considering all isoforms given their seemingly diverse functions.
Assuntos
Fator 4 Nuclear de Hepatócito/metabolismo , Transcrição Gênica , Linhagem Celular Tumoral , DNA/metabolismo , Regulação da Expressão Gênica , Redes Reguladoras de Genes , Proteínas de Fluorescência Verde/metabolismo , Fator 4 Nuclear de Hepatócito/genética , Humanos , Ligação Proteica , Isoformas de Proteínas/metabolismo , Receptores Citoplasmáticos e Nucleares/metabolismo , Elementos de Resposta/genética , Fatores de Transcrição/metabolismo , Transcriptoma/genéticaRESUMO
The study of RNA expression is the fastest growing area of genomic research. However, despite the dramatic increase in the number of sequenced transcriptomes, we still do not have accurate estimates of the number and expression levels of non-coding RNA genes. Non-coding transcripts are often overlooked due to incomplete genome annotation. In this study, we use annotation-independent detection of RNA reads generated using a reverse transcriptase with low structure bias to identify non-coding RNA. Transcripts between 20 and 500 nucleotides were filtered and crosschecked with non-coding RNA annotations revealing 111 non-annotated non-coding RNAs expressed in different cell lines and tissues. Inspecting the sequence and structural features of these transcripts indicated that 60% of these transcripts correspond to new snoRNA and tRNA-like genes. The identified genes exhibited features of their respective families in terms of structure, expression, conservation and response to depletion of interacting proteins. Together, our data reveal a new group of RNA that are difficult to detect using standard gene prediction and RNA sequencing techniques, suggesting that reliance on actual gene annotation and sequencing techniques distorts the perceived architecture of the human transcriptome.
Assuntos
Anotação de Sequência Molecular/métodos , RNA Mensageiro/genética , RNA Nucleolar Pequeno/genética , RNA de Transferência/genética , RNA não Traduzido/genética , Transcriptoma , Animais , Pareamento de Bases , Sequência de Bases , Linhagem Celular Tumoral , Conjuntos de Dados como Assunto , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Humanos , Conformação de Ácido Nucleico , Filogenia , RNA Mensageiro/classificação , RNA Mensageiro/metabolismo , RNA Nucleolar Pequeno/classificação , RNA Nucleolar Pequeno/metabolismo , RNA de Transferência/classificação , RNA de Transferência/metabolismo , RNA não Traduzido/classificação , RNA não Traduzido/metabolismo , Análise de Sequência de RNA , Sequenciamento do ExomaRESUMO
Small nucleolar RNAs (snoRNAs) are an abundant type of non-coding RNA with conserved functions in all known eukaryotes. Classified into two main families, the box C/D and H/ACA snoRNAs, they enact their most well characterized role of guiding site specific modifications in ribosomal RNA, through the formation of specific ribonucleoprotein complexes, with fundamental implications in ribosome biogenesis. However, it is becoming increasingly clear that the landscape of snoRNA cellular functionality is much broader than it once seemed with novel members, non-uniform expression patterns, new and diverse targets as well as several emerging non-canonical functions ranging from the modulation of alternative splicing to the regulation of chromatin architecture. In order to facilitate the further characterization of human snoRNAs in a holistic manner, we introduce an online interactive database tool: snoDB. Its purpose is to consolidate information on human snoRNAs from different sources such as sequence databases, target information, both canonical and non-canonical from the literature and from high-throughput RNA-RNA interaction datasets, as well as high-throughput sequencing data that can be visualized interactively.
Assuntos
Bases de Dados de Ácidos Nucleicos , RNA Nucleolar Pequeno/química , RNA Nucleolar Pequeno/metabolismo , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Análise de Sequência de RNA , Interface Usuário-ComputadorRESUMO
BACKGROUND: Small nucleolar RNAs (snoRNAs) are an abundant class of noncoding RNAs present in all eukaryotes and best known for their involvement in ribosome biogenesis. In mammalian genomes, many snoRNAs exist in multiple copies, resulting from recombination and retrotransposition from an ancestral snoRNA. To gain insight into snoRNA copy regulation, we used Rfam classification and normal human tissue expression datasets generated using low structure bias RNA-seq to characterize snoRNA families. RESULTS: We found that although box H/ACA families are on average larger than box C/D families, the number of expressed members is similar for both types. Family members can cover a wide range of average abundance values, but importantly, expression variability of individual members of a family is preferred over the total variability of the family, especially for box H/ACA snoRNAs, suggesting that while members are likely differentially regulated, mechanisms exist to ensure uniformity of the total family abundance across tissues. Box C/D snoRNA family members are mostly embedded in the same host gene while box H/ACA family members tend to be encoded in more than one different host, supporting a model in which box C/D snoRNA duplication occurred mostly by cis recombination while box H/ACA snoRNA families have gained copy members through retrotransposition. And unexpectedly, snoRNAs encoded in the same host gene can be regulated independently, as some snoRNAs within the same family vary in abundance in a divergent way between tissues. CONCLUSIONS: SnoRNA copy regulation affects family sizes, genomic location of the members and controls simultaneously member and total family abundance to respond to the needs of individual tissues.
Assuntos
Genoma , RNA Nucleolar Pequeno , Animais , Características da Família , Genômica , Humanos , RNA Nucleolar Pequeno/genéticaRESUMO
Stress hormones bind and activate the glucocorticoid receptor (GR) in many tissues including the brain. We identified arginine and glutamate rich 1 (ARGLU1) in a screen for new modulators of glucocorticoid signaling in the CNS. Biochemical studies show that the glutamate rich C-terminus of ARGLU1 coactivates multiple nuclear receptors including the glucocorticoid receptor (GR) and the arginine rich N-terminus interacts with splicing factors and binds to RNA. RNA-seq of neural cells depleted of ARGLU1 revealed significant changes in the expression and alternative splicing of distinct genes involved in neurogenesis. Loss of ARGLU1 is embryonic lethal in mice, and knockdown in zebrafish causes neurodevelopmental and heart defects. Treatment with dexamethasone, a GR activator, also induces changes in the pattern of alternatively spliced genes, many of which were lost when ARGLU1 was absent. Importantly, the genes found to be alternatively spliced in response to glucocorticoid treatment were distinct from those under transcriptional control by GR, suggesting an additional mechanism of glucocorticoid action is present in neural cells. Our results thus show that ARGLU1 is a novel factor for embryonic development that modulates basal transcription and alternative splicing in neural cells with consequences for glucocorticoid signaling.