Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 73
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Genome Res ; 33(4): 525-540, 2023 04.
Artículo en Inglés | MEDLINE | ID: mdl-37072185

RESUMEN

Small nucleolar RNAs (snoRNAs) are structured noncoding RNAs present in multiple copies within eukaryotic genomes. snoRNAs guide chemical modifications on their target RNA and regulate processes like ribosome assembly and splicing. Most human snoRNAs are embedded within host gene introns, the remainder being independently expressed from intergenic regions. We recently characterized the abundance of snoRNAs and their host gene across several healthy human tissues and found that the level of most snoRNAs does not correlate with that of their host gene, with the observation that snoRNAs embedded within the same host gene often differ drastically in abundance. To better understand the determinants of snoRNA expression, we trained machine learning models to predict whether snoRNAs are expressed or not in human tissues based on more than 30 collected features related to snoRNAs and their genomic context. By interpreting the models' predictions, we find that snoRNAs rely on conserved motifs, a stable global structure and terminal stem, and a transcribed locus to be expressed. We observe that these features explain well the varying abundance of snoRNAs embedded within the same host gene. By predicting the expression status of snoRNAs across several vertebrates, we notice that only one-third of all annotated snoRNAs are expressed per genome, as in humans. Our results suggest that ancestral snoRNAs disseminated within vertebrate genomes, sometimes leading to the development of new functions and a probable gain in fitness and thereby conserving features favorable to the expression of these few snoRNAs, the large remainder often degenerating into pseudogenes.


Asunto(s)
ARN Nucleolar Pequeño , Vertebrados , Animales , Humanos , ARN Nucleolar Pequeño/genética , ARN Nucleolar Pequeño/química , Vertebrados/genética , Eucariontes/genética , ARN no Traducido , Expresión Génica
3.
Mol Cell Proteomics ; 23(5): 100767, 2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38615877

RESUMEN

DNA replication is a fundamental cellular process that ensures the transfer of genetic information during cell division. Genome duplication takes place in S phase and requires a dynamic and highly coordinated recruitment of multiple proteins at replication forks. Various genotoxic stressors lead to fork instability and collapse, hence the need for DNA repair pathways. By identifying the multitude of protein interactions implicated in those events, we can better grasp the complex and dynamic molecular mechanisms that facilitate DNA replication and repair. Proximity-dependent biotin identification was used to identify associations with 17 proteins within four core replication components, namely the CDC45/MCM2-7/GINS helicase that unwinds DNA, the DNA polymerases, replication protein A subunits, and histone chaperones needed to disassemble and reassemble chromatin. We further investigated the impact of genotoxic stress on these interactions. This analysis revealed a vast proximity association network with 108 nuclear proteins further modulated in the presence of hydroxyurea; 45 being enriched and 63 depleted. Interestingly, hydroxyurea treatment also caused a redistribution of associations with 11 interactors, meaning that the replisome is dynamically reorganized when stressed. The analysis identified several poorly characterized proteins, thereby uncovering new putative players in the cellular response to DNA replication arrest. It also provides a new comprehensive proteomic framework to understand how cells respond to obstacles during DNA replication.


Asunto(s)
Replicación del ADN , Hidroxiurea , Proteómica , Hidroxiurea/farmacología , Proteómica/métodos , Humanos , Daño del ADN , Proteínas de Ciclo Celular/metabolismo , Proteínas de Ciclo Celular/genética , Proteoma/metabolismo
4.
Nucleic Acids Res ; 52(D1): D522-D528, 2024 Jan 05.
Artículo en Inglés | MEDLINE | ID: mdl-37956315

RESUMEN

The OpenProt proteogenomic resource (https://www.openprot.org/) provides users with a complete and freely accessible set of non-canonical or alternative open reading frames (AltORFs) within the transcriptome of various species, as well as functional annotations of the corresponding protein sequences not found in standard databases. Enhancements in this update are largely the result of user feedback and include the prediction of structure, subcellular localization, and intrinsic disorder, using cutting-edge algorithms based on machine learning techniques. The mass spectrometry pipeline now integrates a machine learning-based peptide rescoring method to improve peptide identification. We continue to help users explore this cryptic proteome by providing OpenCustomDB, a tool that enables users to build their own customized protein databases, and OpenVar, a genomic annotator including genetic variants within AltORFs and protein sequences. A new interface improves the visualization of all functional annotations, including a spectral viewer and the prediction of multicoding genes. All data on OpenProt are freely available and downloadable. Overall, OpenProt continues to establish itself as an important resource for the exploration and study of new proteins.


Asunto(s)
Bases de Datos de Proteínas , Péptidos , Proteómica , Secuencia de Aminoácidos , Genómica , Internet , Péptidos/genética , Proteoma/genética , Proteómica/métodos , Humanos
5.
Nucleic Acids Res ; 51(D1): D291-D296, 2023 01 06.
Artículo en Inglés | MEDLINE | ID: mdl-36165892

RESUMEN

snoDB is an interactive database of human small nucleolar RNAs (snoRNAs) that includes up-to-date information on snoRNA features, genomic location, conservation, host gene, snoRNA-RNA targets and snoRNA abundance and provides links to other resources. In the second edition of this database (snoDB 2.0), we added an entirely new section on ribosomal RNA (rRNA) chemical modifications guided by snoRNAs with easy navigation between the different rRNA versions used in the literature and experimentally measured levels of modification. We also included new layers of information, including snoRNA motifs, secondary structure prediction, snoRNA-protein interactions, copy annotations and low structure bias expression data in a wide panel of tissues and cell lines to bolster functional probing of snoRNA biology. Version 2.0 features updated identifiers, more links to external resources and duplicate entry resolution. As a result, snoDB 2.0, which is freely available at https://bioinfo-scottgroup.med.usherbrooke.ca/snoDB/, represents a one-stop shop for snoRNA features, rRNA modification targets, functional impact and potential regulators.


Asunto(s)
Bases de Datos Genéticas , ARN Nucleolar Pequeño , Humanos , Genómica , ARN Ribosómico/metabolismo , ARN Nucleolar Pequeño/metabolismo
6.
RNA Biol ; 21(1): 1-11, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-38626213

RESUMEN

Small nucleolar RNAs (snoRNAs) are a class of conserved noncoding RNAs forming complexes with proteins to catalyse site-specific modifications on ribosomal RNA. Besides this canonical role, several snoRNAs are now known to regulate diverse levels of gene expression. While these functions are carried out in trans by mature snoRNAs, evidence has also been emerging of regulatory roles of snoRNAs in cis, either within their genomic locus or as longer transcription intermediates during their maturation. Herein, we review recent findings that snoRNAs can interact in cis with their intron to regulate the expression of their host gene. We also explore the ever-growing diversity of longer host-derived snoRNA extensions and their functional impact across the transcriptome. Finally, we discuss the role of snoRNA duplications into forging these new layers of snoRNA-mediated regulation, as well as their involvement in the genomic imprinting of their host locus.


Asunto(s)
ARN Nucleolar Pequeño , ARN no Traducido , ARN Nucleolar Pequeño/genética , ARN Nucleolar Pequeño/metabolismo , ARN no Traducido/genética , ARN Ribosómico/genética , Intrones
7.
Nucleic Acids Res ; 50(11): 6067-6083, 2022 06 24.
Artículo en Inglés | MEDLINE | ID: mdl-35657102

RESUMEN

Box C/D small nucleolar RNAs (snoRNAs) are a conserved class of RNA known for their role in guiding ribosomal RNA 2'-O-ribose methylation. Recently, C/D snoRNAs were also implicated in regulating the expression of non-ribosomal genes through different modes of binding. Large scale RNA-RNA interaction datasets detect many snoRNAs binding messenger RNA, but are limited by specific experimental conditions. To enable a more comprehensive study of C/D snoRNA interactions, we created snoGloBe, a human C/D snoRNA interaction predictor based on a gradient boosting classifier. SnoGloBe considers the target type, position and sequence of the interactions, enabling it to outperform existing predictors. Interestingly, for specific snoRNAs, snoGloBe identifies strong enrichment of interactions near gene expression regulatory elements including splice sites. Abundance and splicing of predicted targets were altered upon the knockdown of their associated snoRNA. Strikingly, the predicted snoRNA interactions often overlap with the binding sites of functionally related RNA binding proteins, reinforcing their role in gene expression regulation. SnoGloBe is also an excellent tool for discovering viral RNA targets, as shown by its capacity to identify snoRNAs targeting the heavily methylated SARS-CoV-2 RNA. Overall, snoGloBe is capable of identifying experimentally validated binding sites and predicting novel sites with shared regulatory function.


Asunto(s)
ARN Nucleolar Pequeño , Programas Informáticos , Secuencia de Bases , Humanos , ARN Ribosómico/metabolismo , ARN Nucleolar Pequeño/metabolismo , ARN Viral , SARS-CoV-2
8.
Brief Bioinform ; 22(1): 140-145, 2021 01 18.
Artículo en Inglés | MEDLINE | ID: mdl-31813948

RESUMEN

Ribonucleic acid sequencing (RNA-seq) identifies and quantifies RNA molecules from a biological sample. Transformation from raw sequencing data to meaningful gene or isoform counts requires an in silico bioinformatics pipeline. Such pipelines are modular in nature, built using selected software and biological references. Software is usually chosen and parameterized according to the sequencing protocol and biological question. However, while biological and technical noise is alleviated through replicates, biases due to the pipeline and choice of biological references are often overlooked. Here, we show that the current standard practice prevents reproducibility in RNA-seq studies by failing to specify required methodological information. Peer-reviewed articles are intended to apply currently accepted scientific and methodological standards. Inasmuch as the bias-less and optimal RNA-seq pipeline is not perfectly defined, methodological information holds a meaningful role in defining the results. This work illustrates the need for a standardized and explicit display of methodological information in RNA-seq experiments.


Asunto(s)
RNA-Seq/métodos , Animales , Humanos , RNA-Seq/normas , Valores de Referencia , Reproducibilidad de los Resultados
9.
Nucleic Acids Res ; 49(14): 8370-8383, 2021 08 20.
Artículo en Inglés | MEDLINE | ID: mdl-34244793

RESUMEN

RBFOX2 controls the splicing of a large number of transcripts implicated in cell differentiation and development. Parsing RNA-binding protein datasets, we uncover that RBFOX2 can interact with hnRNPC, hnRNPM and SRSF1 to regulate splicing of a broad range of splicing events using different sequence motifs and binding modes. Using immunoprecipitation, specific RBP knockdown, RNA-seq and splice-sensitive PCR, we show that RBFOX2 can target splice sites using three binding configurations: single, multiple or secondary modes. In the single binding mode RBFOX2 is recruited to its target splice sites through a single canonical binding motif, while in the multiple binding mode RBFOX2 binding sites include the adjacent binding of at least one other RNA binding protein partner. Finally, in the secondary binding mode RBFOX2 likely does not bind the RNA directly but is recruited to splice sites lacking its canonical binding motif through the binding of one of its protein partners. These dynamic modes bind distinct sets of transcripts at different positions and distances relative to alternative splice sites explaining the heterogeneity of RBFOX2 targets and splicing outcomes.


Asunto(s)
Ribonucleoproteína Heterogénea-Nuclear Grupo C/genética , Ribonucleoproteína Heterogénea-Nuclear Grupo M/genética , Factores de Empalme de ARN/genética , Empalme del ARN/genética , Proteínas de Unión al ARN/genética , Proteínas Represoras/genética , Empalme Alternativo/genética , Sitios de Unión , Humanos , ARN/genética , Sitios de Empalme de ARN/genética
10.
Nucleic Acids Res ; 49(D1): D380-D388, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33179748

RESUMEN

OpenProt (www.openprot.org) is the first proteogenomic resource supporting a polycistronic annotation model for eukaryotic genomes. It provides a deeper annotation of open reading frames (ORFs) while mining experimental data for supporting evidence using cutting-edge algorithms. This update presents the major improvements since the initial release of OpenProt. All species support recent NCBI RefSeq and Ensembl annotations, with changes in annotations being reported in OpenProt. Using the 131 ribosome profiling datasets re-analysed by OpenProt to date, non-AUG initiation starts are reported alongside a confidence score of the initiating codon. From the 177 mass spectrometry datasets re-analysed by OpenProt to date, the unicity of the detected peptides is controlled at each implementation. Furthermore, to guide the users, detectability statistics and protein relationships (isoforms) are now reported for each protein. Finally, to foster access to deeper ORF annotation independently of one's bioinformatics skills or computational resources, OpenProt now offers a data analysis platform. Users can submit their dataset for analysis and receive the results from the analysis by OpenProt. All data on OpenProt are freely available and downloadable for each species, the release-based format ensuring a continuous access to the data. Thus, OpenProt enables a more comprehensive annotation of eukaryotic genomes and fosters functional proteomic discoveries.


Asunto(s)
Bases de Datos de Proteínas , Eucariontes/genética , Genoma , Anotación de Secuencia Molecular , Sistemas de Lectura Abierta/genética , Espectrometría de Masas , Isoformas de Proteínas/genética , Proteogenómica , Ribosomas/metabolismo , Interfaz Usuario-Computador
11.
BMC Bioinformatics ; 23(1): 250, 2022 Jun 24.
Artículo en Inglés | MEDLINE | ID: mdl-35751026

RESUMEN

BACKGROUND: Alternative splicing can increase the diversity of gene functions by generating multiple isoforms with different sequences and functions. However, the extent to which splicing events have functional consequences remains unclear and predicting the impact of splicing events on protein activity is limited to gene-specific analysis. RESULTS: To accelerate the identification of functionally relevant alternative splicing events we created SAPFIR, a predictor of protein features associated with alternative splicing events. This webserver tool uses InterProScan to predict protein features such as functional domains, motifs and sites in the human and mouse genomes and link them to alternative splicing events. Alternative protein features are displayed as functions of the transcripts and splice sites. SAPFIR could be used to analyze proteins generated from a single gene or a group of genes and can directly identify alternative protein features in large sequence data sets. The accuracy and utility of SAPFIR was validated by its ability to rediscover previously validated alternative protein domains. In addition, our de novo analysis of public datasets using SAPFIR indicated that only a small portion of alternative protein domains was conserved between human and mouse, and that in human, genes involved in nervous system process, regulation of DNA-templated transcription and aging are more likely to produce isoforms missing functional domains due to alternative splicing. CONCLUSION: Overall SAPFIR represents a new tool for the rapid identification of functional alternative splicing events and enables the identification of cellular functions affected by a defined splicing program. SAPFIR is freely available at https://bioinfo-scottgroup.med.usherbrooke.ca/sapfir/ , a website implemented in Python, with all major browsers supported. The source code is available at https://github.com/DelongZHOU/SAPFIR .


Asunto(s)
Empalme Alternativo , Empalme del ARN , Animales , Genoma , Ratones , Isoformas de Proteínas/genética , Programas Informáticos
12.
Reprod Biol Endocrinol ; 20(1): 14, 2022 Jan 14.
Artículo en Inglés | MEDLINE | ID: mdl-35031065

RESUMEN

BACKGROUND: During pregnancy, maternal metabolism undergoes substantial changes to support the developing fetus. Such changes are finely regulated by different mechanisms carried out by effectors such as microRNAs (miRNAs). These small non-coding RNAs regulate numerous biological functions, mostly through post-transcriptional repression of gene expression. miRNAs are also secreted in circulation by numerous organs, such as the placenta. However, the complete plasmatic microtranscriptome of pregnant women has still not been fully described, although some miRNA clusters from the chromosome 14 (C14MC) and the chromosome 19 (C19MC and miR-371-3 cluster) have been proposed as being specific to pregnancy. Our aims were thus to describe the plasma microtranscriptome during the first trimester of pregnancy, by assessing the differences with non-pregnant women, and how it varies between the 4th and the 16th week of pregnancy. METHODS: Plasmatic miRNAs from 436 pregnant (gestational week 4 to 16) and 15 non-pregnant women were quantified using Illumina HiSeq next-generation sequencing platform. Differentially abundant miRNAs were identified using DESeq2 package (FDR q-value ≤ 0.05) and their targeted biological pathways were assessed with DIANA-miRpath. RESULTS: A total of 2101 miRNAs were detected, of which 191 were differentially abundant (fold change < 0.05 or > 2, FDR q-value ≤ 0.05) between pregnant and non-pregnant women. Of these, 100 miRNAs were less and 91 miRNAs were more abundant in pregnant women. Additionally, the abundance of 57 miRNAs varied according to gestational age at first trimester, of which 47 were positively and 10 were negatively associated with advancing gestational age. miRNAs from the C19MC were positively associated with both pregnancy and gestational age variation during the first trimester. Biological pathway analysis revealed that these 191 (pregnancy-specific) and 57 (gestational age markers) miRNAs targeted genes involved in fatty acid metabolism, ECM-receptor interaction and TGF-beta signaling pathways. CONCLUSION: We have identified circulating miRNAs specific to pregnancy and/or that varied with gestational age in first trimester. These miRNAs target biological pathways involved in lipid metabolism as well as placenta and embryo development, suggesting a contribution to the maternal metabolic adaptation to pregnancy and fetal growth.


Asunto(s)
MicroARNs/genética , Primer Trimestre del Embarazo/genética , Adolescente , Adulto , Estudios de Casos y Controles , Estudios de Cohortes , Femenino , Expresión Génica , Perfilación de la Expresión Génica , Edad Gestacional , Humanos , Masculino , MicroARNs/sangre , Persona de Mediana Edad , Embarazo , Primer Trimestre del Embarazo/sangre , Factores de Tiempo , Adulto Joven
13.
BMC Cancer ; 22(1): 363, 2022 Apr 04.
Artículo en Inglés | MEDLINE | ID: mdl-35379199

RESUMEN

Radio-chemotherapy with 5-flu orouracil (5-FU) is the standard of care treatment for patients with colorectal cancer, but it is only effective for a third of them. Despite our understanding of the mechanism of action of 5-FU, drug resistance remains a significant limitation to the clinical use of 5-FU, as both intrinsic and acquired chemoresistance represents the major obstacles for the success of 5-FU-based chemotherapy. In order to identify the mechanism of acquired resistance, 5-FU chemoresistance was induced in CRC cell lines by passaging cells with increasing concentrations of 5-FU. To study global molecular changes, quantitative proteomics and transcriptomics analyses were performed on these cell lines, comparing the resistant cells as well as the effect of chemo and radiotherapy. Interestingly, a very high proportion of downregulated genes were annotated as transcription factors coding for Krüppel-associated box (KRAB) domain-containing zinc-finger proteins (KZFPs), the largest family of transcriptional repressors. Among nearly 350 KRAB-ZFPs, almost a quarter were downregulated after the induction of a 5-FU-resistance including a common one between the three CRC cell lines, ZNF649, whose role is still unknown. To confirm the observations of the proteomic and transcriptomic approaches, the abundance of 20 different KZFPs and control mRNAs was validated by RT-qPCR. In fact, several KZFPs were no longer detectable using qPCR in cell lines resistant to 5-FU, and the KZFPs that were downregulated only in one or two cell lines showed similar pattern of expression as measured by the omics approaches. This proteomic, transcriptomic and genomic analysis of intrinsic and acquired resistance highlights a possible new mechanism involved in the cellular adaptation to 5-FU and therefore identifies potential new therapeutic targets to overcome this resistance.


Asunto(s)
Neoplasias Colorrectales , Fluorouracilo , Línea Celular Tumoral , Neoplasias Colorrectales/tratamiento farmacológico , Neoplasias Colorrectales/genética , Regulación hacia Abajo , Resistencia a Antineoplásicos/genética , Fluorouracilo/farmacología , Fluorouracilo/uso terapéutico , Humanos , Proteómica , Dedos de Zinc/genética
14.
FASEB J ; 35(5): e21544, 2021 05.
Artículo en Inglés | MEDLINE | ID: mdl-33819356

RESUMEN

Serine-rich splicing factor 3 (SRSF3) was recently reported as being necessary to preserve RNA stability via an mTOR mechanism in a cardiac mouse model in adulthood. Here, we demonstrate the link between Srsf3 and mitochondrial integrity in an embryonic cardiomyocyte-specific Srsf3 conditional knockout (cKO) mouse model. Fifteen-day-old Srsf3 cKO mice showed dramatically reduced (below 50%) survival and reduced the left ventricular systolic performance, and histological analysis of these hearts revealed a significant increase in cardiomyocyte size, confirming the severe remodeling induced by Srsf3 deletion. RNA-seq analysis of the hearts of 5-day-old Srsf3 cKO mice revealed early changes in expression levels and alternative splicing of several transcripts related to mitochondrial integrity and oxidative phosphorylation. Likewise, the levels of several protein complexes of the electron transport chain decreased, and mitochondrial complex I-driven respiration of permeabilized cardiac muscle fibers from the left ventricle was impaired. Furthermore, transmission electron microscopy analysis showed disordered mitochondrial length and cristae structure. Together with its indispensable role in the physiological maintenance of mouse hearts, these results highlight the previously unrecognized function of Srsf3 in regulating the mitochondrial integrity.


Asunto(s)
Regulación de la Expresión Génica , Mitocondrias Cardíacas/patología , Miocitos Cardíacos/patología , Factores de Empalme Serina-Arginina/fisiología , Empalme Alternativo , Animales , Perfilación de la Expresión Génica , Ratones , Ratones Endogámicos C57BL , Ratones Noqueados , Mitocondrias Cardíacas/metabolismo , Miocitos Cardíacos/metabolismo , Fosforilación Oxidativa , RNA-Seq
15.
Mol Cell Proteomics ; 19(5): 808-827, 2020 05.
Artículo en Inglés | MEDLINE | ID: mdl-32123031

RESUMEN

HNF4α is a nuclear receptor produced as 12 isoforms from two promoters by alternative splicing. To characterize the transcriptional capacities of all 12 HNF4α isoforms, stable lines expressing each isoform were generated. The entire transcriptome associated with each isoform was analyzed as well as their respective interacting proteome. Major differences were noted in the transcriptional function of these isoforms. The α1 and α2 isoforms were the strongest regulators of gene expression whereas the α3 isoform exhibited significantly reduced activity. The α4, α5, and α6 isoforms, which use an alternative first exon, were characterized for the first time, and showed a greatly reduced transcriptional potential with an inability to recognize the consensus response element of HNF4α. Several transcription factors and coregulators were identified as potential specific partners for certain HNF4α isoforms. An analysis integrating the vast amount of omics data enabled the identification of transcriptional regulatory mechanisms specific to certain HNF4α isoforms, hence demonstrating the importance of considering all isoforms given their seemingly diverse functions.


Asunto(s)
Factor Nuclear 4 del Hepatocito/metabolismo , Transcripción Genética , Línea Celular Tumoral , ADN/metabolismo , Regulación de la Expresión Génica , Redes Reguladoras de Genes , Proteínas Fluorescentes Verdes/metabolismo , Factor Nuclear 4 del Hepatocito/genética , Humanos , Unión Proteica , Isoformas de Proteínas/metabolismo , Receptores Citoplasmáticos y Nucleares/metabolismo , Elementos de Respuesta/genética , Factores de Transcripción/metabolismo , Transcriptoma/genética
16.
Nucleic Acids Res ; 48(5): 2271-2286, 2020 03 18.
Artículo en Inglés | MEDLINE | ID: mdl-31980822

RESUMEN

The study of RNA expression is the fastest growing area of genomic research. However, despite the dramatic increase in the number of sequenced transcriptomes, we still do not have accurate estimates of the number and expression levels of non-coding RNA genes. Non-coding transcripts are often overlooked due to incomplete genome annotation. In this study, we use annotation-independent detection of RNA reads generated using a reverse transcriptase with low structure bias to identify non-coding RNA. Transcripts between 20 and 500 nucleotides were filtered and crosschecked with non-coding RNA annotations revealing 111 non-annotated non-coding RNAs expressed in different cell lines and tissues. Inspecting the sequence and structural features of these transcripts indicated that 60% of these transcripts correspond to new snoRNA and tRNA-like genes. The identified genes exhibited features of their respective families in terms of structure, expression, conservation and response to depletion of interacting proteins. Together, our data reveal a new group of RNA that are difficult to detect using standard gene prediction and RNA sequencing techniques, suggesting that reliance on actual gene annotation and sequencing techniques distorts the perceived architecture of the human transcriptome.


Asunto(s)
Anotación de Secuencia Molecular/métodos , ARN Mensajero/genética , ARN Nucleolar Pequeño/genética , ARN de Transferencia/genética , ARN no Traducido/genética , Transcriptoma , Animales , Emparejamiento Base , Secuencia de Bases , Línea Celular Tumoral , Conjuntos de Datos como Asunto , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Humanos , Conformación de Ácido Nucleico , Filogenia , ARN Mensajero/clasificación , ARN Mensajero/metabolismo , ARN Nucleolar Pequeño/clasificación , ARN Nucleolar Pequeño/metabolismo , ARN de Transferencia/clasificación , ARN de Transferencia/metabolismo , ARN no Traducido/clasificación , ARN no Traducido/metabolismo , Análisis de Secuencia de ARN , Secuenciación del Exoma
17.
Nucleic Acids Res ; 48(D1): D220-D225, 2020 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-31598696

RESUMEN

Small nucleolar RNAs (snoRNAs) are an abundant type of non-coding RNA with conserved functions in all known eukaryotes. Classified into two main families, the box C/D and H/ACA snoRNAs, they enact their most well characterized role of guiding site specific modifications in ribosomal RNA, through the formation of specific ribonucleoprotein complexes, with fundamental implications in ribosome biogenesis. However, it is becoming increasingly clear that the landscape of snoRNA cellular functionality is much broader than it once seemed with novel members, non-uniform expression patterns, new and diverse targets as well as several emerging non-canonical functions ranging from the modulation of alternative splicing to the regulation of chromatin architecture. In order to facilitate the further characterization of human snoRNAs in a holistic manner, we introduce an online interactive database tool: snoDB. Its purpose is to consolidate information on human snoRNAs from different sources such as sequence databases, target information, both canonical and non-canonical from the literature and from high-throughput RNA-RNA interaction datasets, as well as high-throughput sequencing data that can be visualized interactively.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , ARN Nucleolar Pequeño/química , ARN Nucleolar Pequeño/metabolismo , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Análisis de Secuencia de ARN , Interfaz Usuario-Computador
18.
BMC Genomics ; 22(1): 414, 2021 Jun 05.
Artículo en Inglés | MEDLINE | ID: mdl-34090325

RESUMEN

BACKGROUND: Small nucleolar RNAs (snoRNAs) are an abundant class of noncoding RNAs present in all eukaryotes and best known for their involvement in ribosome biogenesis. In mammalian genomes, many snoRNAs exist in multiple copies, resulting from recombination and retrotransposition from an ancestral snoRNA. To gain insight into snoRNA copy regulation, we used Rfam classification and normal human tissue expression datasets generated using low structure bias RNA-seq to characterize snoRNA families. RESULTS: We found that although box H/ACA families are on average larger than box C/D families, the number of expressed members is similar for both types. Family members can cover a wide range of average abundance values, but importantly, expression variability of individual members of a family is preferred over the total variability of the family, especially for box H/ACA snoRNAs, suggesting that while members are likely differentially regulated, mechanisms exist to ensure uniformity of the total family abundance across tissues. Box C/D snoRNA family members are mostly embedded in the same host gene while box H/ACA family members tend to be encoded in more than one different host, supporting a model in which box C/D snoRNA duplication occurred mostly by cis recombination while box H/ACA snoRNA families have gained copy members through retrotransposition. And unexpectedly, snoRNAs encoded in the same host gene can be regulated independently, as some snoRNAs within the same family vary in abundance in a divergent way between tissues. CONCLUSIONS: SnoRNA copy regulation affects family sizes, genomic location of the members and controls simultaneously member and total family abundance to respond to the needs of individual tissues.


Asunto(s)
Genoma , ARN Nucleolar Pequeño , Animales , Composición Familiar , Genómica , Humanos , ARN Nucleolar Pequeño/genética
19.
Nucleic Acids Res ; 47(D1): D403-D410, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30299502

RESUMEN

Advances in proteomics and sequencing have highlighted many non-annotated open reading frames (ORFs) in eukaryotic genomes. Genome annotations, cornerstones of today's research, mostly rely on protein prior knowledge and on ab initio prediction algorithms. Such algorithms notably enforce an arbitrary criterion of one coding sequence (CDS) per transcript, leading to a substantial underestimation of the coding potential of eukaryotes. Here, we present OpenProt, the first database fully endorsing a polycistronic model of eukaryotic genomes to date. OpenProt contains all possible ORFs longer than 30 codons across 10 species, and cumulates supporting evidence such as protein conservation, translation and expression. OpenProt annotates all known proteins (RefProts), novel predicted isoforms (Isoforms) and novel predicted proteins from alternative ORFs (AltProts). It incorporates cutting-edge algorithms to evaluate protein orthology and re-interrogate publicly available ribosome profiling and mass spectrometry datasets, supporting the annotation of thousands of predicted ORFs. The constantly growing database currently cumulates evidence from 87 ribosome profiling and 114 mass spectrometry studies from several species, tissues and cell lines. All data is freely available and downloadable from a web platform (www.openprot.org) supporting a genome browser and advanced queries for each species. Thus, OpenProt enables a more comprehensive landscape of eukaryotic genomes' coding potential.


Asunto(s)
Eucariontes/genética , Genes/genética , Genoma , Sistemas de Lectura Abierta/genética , Proteoma/genética , Algoritmos , Animales , Humanos , Espectrometría de Masas , Anotación de Secuencia Molecular , Isoformas de Proteínas/genética , Proteómica/métodos , Ribosomas/metabolismo , Homología de Secuencia de Aminoácido
20.
Nucleic Acids Res ; 47(6): 2856-2870, 2019 04 08.
Artículo en Inglés | MEDLINE | ID: mdl-30698747

RESUMEN

Stress hormones bind and activate the glucocorticoid receptor (GR) in many tissues including the brain. We identified arginine and glutamate rich 1 (ARGLU1) in a screen for new modulators of glucocorticoid signaling in the CNS. Biochemical studies show that the glutamate rich C-terminus of ARGLU1 coactivates multiple nuclear receptors including the glucocorticoid receptor (GR) and the arginine rich N-terminus interacts with splicing factors and binds to RNA. RNA-seq of neural cells depleted of ARGLU1 revealed significant changes in the expression and alternative splicing of distinct genes involved in neurogenesis. Loss of ARGLU1 is embryonic lethal in mice, and knockdown in zebrafish causes neurodevelopmental and heart defects. Treatment with dexamethasone, a GR activator, also induces changes in the pattern of alternatively spliced genes, many of which were lost when ARGLU1 was absent. Importantly, the genes found to be alternatively spliced in response to glucocorticoid treatment were distinct from those under transcriptional control by GR, suggesting an additional mechanism of glucocorticoid action is present in neural cells. Our results thus show that ARGLU1 is a novel factor for embryonic development that modulates basal transcription and alternative splicing in neural cells with consequences for glucocorticoid signaling.


Asunto(s)
Desarrollo Embrionario , Glucocorticoides/farmacología , Péptidos y Proteínas de Señalización Intracelular/fisiología , Empalme del ARN/genética , Activación Transcripcional/genética , Empalme Alternativo/efectos de los fármacos , Empalme Alternativo/genética , Animales , Animales Modificados Genéticamente , Células Cultivadas , Embrión no Mamífero , Desarrollo Embrionario/efectos de los fármacos , Desarrollo Embrionario/genética , Glucocorticoides/metabolismo , Células HEK293 , Humanos , Ratones , Ratones Endogámicos C57BL , Neurogénesis/efectos de los fármacos , Neurogénesis/genética , Empalme del ARN/efectos de los fármacos , Transducción de Señal/efectos de los fármacos , Transducción de Señal/genética , Estrés Fisiológico/efectos de los fármacos , Estrés Fisiológico/genética , Transactivadores/fisiología , Activación Transcripcional/efectos de los fármacos , Pez Cebra
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA