Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 130
Filter
1.
Nucleic Acids Res ; 52(9): 5152-5165, 2024 May 22.
Article in English | MEDLINE | ID: mdl-38647067

ABSTRACT

Structured noncoding RNAs (ncRNAs) contribute to many important cellular processes involving chemical catalysis, molecular recognition and gene regulation. Few ncRNA classes are broadly distributed among organisms from all three domains of life, but the list of rarer classes that exhibit surprisingly diverse functions is growing. We previously developed a computational pipeline that enables the near-comprehensive identification of structured ncRNAs expressed from individual bacterial genomes. The regions between protein coding genes are first sorted based on length and the fraction of guanosine and cytidine nucleotides. Long, GC-rich intergenic regions are then examined for sequence and structural similarity to other bacterial genomes. Herein, we describe the implementation of this pipeline on 50 bacterial genomes from varied phyla. More than 4700 candidate intergenic regions with the desired characteristics were identified, which yielded 44 novel riboswitch candidates and numerous other putative ncRNA motifs. Although experimental validation studies have yet to be conducted, this rate of riboswitch candidate discovery is consistent with predictions that many hundreds of novel riboswitch classes remain to be discovered among the bacterial species whose genomes have already been sequenced. Thus, many thousands of additional novel ncRNA classes likely remain to be discovered in the bacterial domain of life.


Subject(s)
Genome, Bacterial , RNA, Bacterial , RNA, Untranslated , DNA, Intergenic/genetics , Genome, Bacterial/genetics , Genomics/methods , Riboswitch/genetics , RNA, Bacterial/genetics , RNA, Bacterial/chemistry , RNA, Untranslated/genetics , RNA, Untranslated/classification , RNA, Untranslated/chemistry
2.
Nucleic Acids Res ; 50(D1): D222-D230, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34850920

ABSTRACT

MicroRNAs (miRNAs) are noncoding RNAs with 18-26 nucleotides; they pair with target mRNAs to regulate gene expression and produce significant changes in various physiological and pathological processes. In recent years, the interaction between miRNAs and their target genes has become one of the mainstream directions for drug development. As a large-scale biological database that mainly provides miRNA-target interactions (MTIs) verified by biological experiments, miRTarBase has undergone five revisions and enhancements. The database has accumulated >2 200 449 verified MTIs from 13 389 manually curated articles and CLIP-seq data. An optimized scoring system is adopted to enhance this update's critical recognition of MTI-related articles and corresponding disease information. In addition, single-nucleotide polymorphisms and disease-related variants related to the binding efficiency of miRNA and target were characterized in miRNAs and gene 3' untranslated regions. miRNA expression profiles across extracellular vesicles, blood and different tissues, including exosomal miRNAs and tissue-specific miRNAs, were integrated to explore miRNA functions and biomarkers. For the user interface, we have classified attributes, including RNA expression, specific interaction, protein expression and biological function, for various validation experiments related to the role of miRNA. We also used seed sequence information to evaluate the binding sites of miRNA. In summary, these enhancements render miRTarBase as one of the most research-amicable MTI databases that contain comprehensive and experimentally verified annotations. The newly updated version of miRTarBase is now available at https://miRTarBase.cuhk.edu.cn/.


Subject(s)
3' Untranslated Regions , Databases, Nucleic Acid , Gene Regulatory Networks , MicroRNAs/genetics , Neoplasms/genetics , RNA, Untranslated/genetics , Animals , Binding Sites , Biomarkers/metabolism , Data Mining/statistics & numerical data , Exosomes/chemistry , Exosomes/metabolism , Gene Expression Regulation , Humans , Internet , Mice , MicroRNAs/classification , MicroRNAs/metabolism , Molecular Sequence Annotation , Neoplasms/metabolism , Neoplasms/pathology , Polymorphism, Single Nucleotide , RNA, Untranslated/classification , RNA, Untranslated/metabolism , Tumor Cells, Cultured , User-Computer Interface
3.
Nucleic Acids Res ; 50(D1): D333-D339, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34551440

ABSTRACT

Resolving the spatial distribution of the transcriptome at a subcellular level can increase our understanding of biology and diseases. To facilitate studies of biological functions and molecular mechanisms in the transcriptome, we updated RNALocate, a resource for RNA subcellular localization analysis that is freely accessible at http://www.rnalocate.org/ or http://www.rna-society.org/rnalocate/. Compared to RNALocate v1.0, the new features in version 2.0 include (i) expansion of the data sources and the coverage of species; (ii) incorporation and integration of RNA-seq datasets containing information about subcellular localization; (iii) addition and reorganization of RNA information (RNA subcellular localization conditions and descriptive figures for method, RNA homology information, RNA interaction and ncRNA disease information) and (iv) three additional prediction tools: DM3Loc, iLoc-lncRNA and iLoc-mRNA. Overall, RNALocate v2.0 provides a comprehensive RNA subcellular localization resource for researchers to deconvolute the highly complex architecture of the cell.


Subject(s)
Databases, Nucleic Acid , RNA, Untranslated/genetics , Software , Transcriptome , Animals , Base Sequence , Cell Compartmentation , Datasets as Topic , Drosophila melanogaster/genetics , Drosophila melanogaster/metabolism , Eukaryotic Cells/cytology , Eukaryotic Cells/metabolism , Gene Expression Regulation , Gene Ontology , Humans , Internet , Mice , Molecular Sequence Annotation , RNA, Untranslated/classification , RNA, Untranslated/metabolism , Rats , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Sequence Alignment , Sequence Homology, Nucleic Acid , Subcellular Fractions/chemistry , Subcellular Fractions/metabolism , Zebrafish/genetics , Zebrafish/metabolism
4.
Nucleic Acids Res ; 50(D1): D950-D955, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34723317

ABSTRACT

The rapid development of single-molecule long-read sequencing (LRS) and single-cell assay for transposase accessible chromatin sequencing (scATAC-seq) technologies presents both challenges and opportunities for the annotation of noncoding variants. Here, we updated 3DSNP, a comprehensive database for human noncoding variant annotation, to expand its applications to structural variation (SV) and to implement variant annotation down to single-cell resolution. The updates of 3DSNP include (i) annotation of 108 317 SVs from a full spectrum of functions, especially their potential effects on three-dimensional chromatin structures, (ii) evaluation of the accessible chromatin peaks flanking the variants across 126 cell types/subtypes in 15 human fetal tissues and 54 cell types/subtypes in 25 human adult tissues by integrating scATAC-seq data and (iii) expansion of Hi-C data to 49 human cell types. In summary, this version is a significant and comprehensive improvement over the previous version. The 3DSNP v2.0 database is freely available at https://omic.tech/3dsnpv2/.


Subject(s)
Chromatin/chemistry , Databases, Genetic , Molecular Sequence Annotation , RNA, Untranslated/genetics , Software , Adult , Cell Lineage/genetics , Chromatin/metabolism , Chromosome Mapping , Eukaryotic Cells/cytology , Eukaryotic Cells/metabolism , Fetus , Genome, Human , High-Throughput Nucleotide Sequencing , Humans , Internet , Polymorphism, Single Nucleotide , RNA, Untranslated/classification , RNA, Untranslated/metabolism , Single Molecule Imaging/methods , Single-Cell Analysis/methods
5.
Nucleic Acids Res ; 50(D1): D928-D933, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34723320

ABSTRACT

As a means to aid in the investigation of viral infection mechanisms and identification of more effective antivirus targets, the availability of a source which continually collects and updates information on the virus and host ncRNA-associated interaction resources is essential. Here, we update the ViRBase database to version 3.0 (http://www.virbase.org/ or http://www.rna-society.org/virbase/). This update represents a major revision: (i) the total number of interaction entries is now greater than 820,000, an approximately 70-fold increment, involving 116 virus and 36 host organisms, (ii) it supplements and provides more details on RNA annotations (including RNA editing, RNA localization and RNA modification), ncRNA SNP and ncRNA-drug related information and (iii) it provides two additional tools for predicting binding sites (IntaRNA and PRIdictor), a visual plug-in to display interactions and a website which is optimized for more practical and user-friendly operation. Overall, ViRBase v3.0 provides a more comprehensive resource for virus and host ncRNA-associated interactions enabling researchers a more effective means for investigation of viral infections.


Subject(s)
Databases, Genetic , Genome, Viral , Host-Pathogen Interactions/genetics , RNA, Untranslated/genetics , Software , Viruses/genetics , Binding Sites , Chromatin/chemistry , Chromatin/metabolism , Humans , Internet , Molecular Sequence Annotation , Polymorphism, Single Nucleotide , RNA Editing , RNA, Untranslated/classification , RNA, Untranslated/metabolism , Signal Transduction , Virus Diseases/genetics , Virus Diseases/metabolism , Virus Diseases/pathology , Virus Diseases/virology , Viruses/classification , Viruses/metabolism , Viruses/pathogenicity
6.
Nucleic Acids Res ; 50(D1): D279-D286, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34747466

ABSTRACT

RNA polymerase III (Pol III) transcribes hundreds of non-coding RNA genes (ncRNAs), which involve in a variety of cellular processes. However, the expression, functions, regulatory networks and evolution of these Pol III-transcribed ncRNAs are still largely unknown. In this study, we developed a novel resource, Pol3Base (http://rna.sysu.edu.cn/pol3base/), to decode the interactome, expression, evolution, epitranscriptome and disease variations of Pol III-transcribed ncRNAs. The current release of Pol3Base includes thousands of regulatory relationships between ∼79 000 ncRNAs and transcription factors by mining 56 ChIP-seq datasets. By integrating CLIP-seq datasets, we deciphered the interactions of these ncRNAs with >240 RNA binding proteins. Moreover, Pol3Base contains ∼9700 RNA modifications located within thousands of Pol III-transcribed ncRNAs. Importantly, we characterized expression profiles of ncRNAs in >70 tissues and 28 different tumor types. In addition, by comparing these ncRNAs from human and mouse, we revealed about 4000 evolutionary conserved ncRNAs. We also identified ∼11 403 tRNA-derived small RNAs (tsRNAs) in 32 different tumor types. Finally, by analyzing somatic mutation data, we investigated the mutation map of these ncRNAs to help uncover their potential roles in diverse diseases. This resource will help expand our understanding of potential functions and regulatory networks of Pol III-transcribed ncRNAs.


Subject(s)
Databases, Genetic , Neoplasms/genetics , RNA Polymerase III/genetics , RNA, Untranslated/genetics , RNA-Binding Proteins/genetics , Software , Transcription Factors/genetics , Animals , Data Mining , Datasets as Topic , Evolution, Molecular , Gene Expression Regulation , Gene Regulatory Networks , Humans , Internet , Mice , Mutation , Neoplasms/classification , Neoplasms/metabolism , Neoplasms/pathology , RNA Polymerase III/metabolism , RNA, Transfer/classification , RNA, Transfer/genetics , RNA, Transfer/metabolism , RNA, Untranslated/classification , RNA, Untranslated/metabolism , RNA-Binding Proteins/classification , RNA-Binding Proteins/metabolism , Transcription Factors/classification , Transcription Factors/metabolism , Transcription, Genetic
7.
Nat Rev Rheumatol ; 17(11): 692-705, 2021 11.
Article in English | MEDLINE | ID: mdl-34588660

ABSTRACT

Non-coding RNAs have distinct regulatory roles in the pathogenesis of joint diseases including osteoarthritis (OA) and rheumatoid arthritis (RA). As the amount of high-throughput profiling studies and mechanistic investigations of microRNAs, long non-coding RNAs and circular RNAs in joint tissues and biofluids has increased, data have emerged that suggest complex interactions among non-coding RNAs that are often overlooked as critical regulators of gene expression. Identifying these non-coding RNAs and their interactions is useful for understanding both joint health and disease. Non-coding RNAs regulate signalling pathways and biological processes that are important for normal joint development but, when dysregulated, can contribute to disease. The specific expression profiles of non-coding RNAs in various disease states support their roles as promising candidate biomarkers, mediators of pathogenic mechanisms and potential therapeutic targets. This Review synthesizes literature published in the past 2 years on the role of non-coding RNAs in OA and RA with a focus on inflammation, cell death, cell proliferation and extracellular matrix dysregulation. Research to date makes it apparent that 'non-coding' does not mean 'non-essential' and that non-coding RNAs are important parts of a complex interactome that underlies OA and RA.


Subject(s)
Gene Expression Regulation , Joint Diseases , Joints , RNA, Untranslated , Arthritis, Rheumatoid/genetics , Arthritis, Rheumatoid/immunology , Arthritis, Rheumatoid/physiopathology , Biomarkers/analysis , Epigenesis, Genetic/immunology , Epigenesis, Genetic/physiology , Gene Expression Regulation/physiology , Genomics , Humans , Inflammation/genetics , Inflammation/immunology , Inflammation/physiopathology , Inflammation/therapy , Joint Diseases/genetics , Joint Diseases/immunology , Joint Diseases/physiopathology , Joint Diseases/therapy , Joints/immunology , Joints/physiology , Joints/physiopathology , Osteoarthritis/genetics , Osteoarthritis/immunology , Osteoarthritis/physiopathology , RNA/classification , RNA/physiology , RNA, Untranslated/biosynthesis , RNA, Untranslated/classification , RNA, Untranslated/physiology
8.
Int J Mol Sci ; 22(16)2021 Aug 13.
Article in English | MEDLINE | ID: mdl-34445436

ABSTRACT

Apart from protein-coding Ribonucleic acids (RNAs), there exists a variety of non-coding RNAs (ncRNAs) which regulate complex cellular and molecular processes. High-throughput sequencing technologies and bioinformatics approaches have largely promoted the exploration of ncRNAs which revealed their crucial roles in gene regulation, miRNA binding, protein interactions, and splicing. Furthermore, ncRNAs are involved in the development of complicated diseases like cancer. Categorization of ncRNAs is essential to understand the mechanisms of diseases and to develop effective treatments. Sub-cellular localization information of ncRNAs demystifies diverse functionalities of ncRNAs. To date, several computational methodologies have been proposed to precisely identify the class as well as sub-cellular localization patterns of RNAs). This paper discusses different types of ncRNAs, reviews computational approaches proposed in the last 10 years to distinguish coding-RNA from ncRNA, to identify sub-types of ncRNAs such as piwi-associated RNA, micro RNA, long ncRNA, and circular RNA, and to determine sub-cellular localization of distinct ncRNAs and RNAs. Furthermore, it summarizes diverse ncRNA classification and sub-cellular localization determination datasets along with benchmark performance to aid the development and evaluation of novel computational methodologies. It identifies research gaps, heterogeneity, and challenges in the development of computational approaches for RNA sequence analysis. We consider that our expert analysis will assist Artificial Intelligence researchers with knowing state-of-the-art performance, model selection for various tasks on one platform, dominantly used sequence descriptors, neural architectures, and interpreting inter-species and intra-species performance deviation.


Subject(s)
Computational Biology/methods , RNA, Untranslated/classification , RNA, Untranslated/metabolism , Animals , Artificial Intelligence , Databases, Factual , High-Throughput Nucleotide Sequencing , Humans , RNA, Untranslated/genetics , Sequence Analysis, RNA , Tissue Distribution
9.
RNA Biol ; 18(12): 2168-2182, 2021 12.
Article in English | MEDLINE | ID: mdl-34110970

ABSTRACT

Mitochondrial noncoding RNAs (mt-ncRNAs) include noncoding RNAs inside the mitochondria that are transcribed from the mitochondrial genome or nuclear genome, and noncoding RNAs transcribed from the mitochondrial genome that are transported to the cytosol or nucleus. Recent findings have revealed that mt-ncRNAs play important roles in not only mitochondrial functions, but also other cellular activities. This review proposes a classification of mt-ncRNAs and outlines the emerging understanding of mitochondrial circular RNAs (mt-circRNAs), mitochondrial microRNAs (mitomiRs), and mitochondrial long noncoding RNAs (mt-lncRNAs), with an emphasis on their identification and functions.


Subject(s)
Mitochondria/genetics , RNA, Untranslated/genetics , Animals , Epigenesis, Genetic , Gene Expression Regulation , Humans , RNA, Mitochondrial/genetics , RNA, Untranslated/classification
10.
Adv Biol Regul ; 80: 100809, 2021 05.
Article in English | MEDLINE | ID: mdl-33932728

ABSTRACT

Non-coding RNAs (ncRNAs) play important and diverse roles in mammalian cell biology and pathology. Although the functions of an increasing number of ncRNAs have been identified, the mechanisms underlying ncRNA gene expression remain elusive and are incompletely understood. Here, we investigated ncRNA gene expression in Michigan cancer foundation 7 (MCF7), a malignant breast cancer cell line, on treatment of tetraarsenic oxide (TAO), a potential anti-cancer drug. Our genomic analyses found that TAO up- or down-regulated ncRNA genes genome-wide. A subset of identified ncRNAs with critical biological and clinical functions were validated by real-time quantitative polymerase chain reaction. Intriguingly, these TAO-regulated genes included CDKN2B-AS, HOXA11-AS, SHH, and DUSP5 that are known to interact with or be targeted by polycomb repressive complexes (PRCs). In addition, the PRC subunits were enriched in these TAO-regulated ncRNA genes and TAO treatment deregulated the expression of PRC subunits. Strikingly, TAO decreased the cellular and gene-specific levels of EZH2 expression and H3K27me3. In particular, TAO reduced EZH2 and H3K27me3 and increased transcription at MALAT1 gene. Inhibiting the catalytic activity of EZH2 using GSK343 increased representative TAO-inducible ncRNA genes. Together, our findings suggest that the expression of a subset of ncRNA genes is regulated by PRC2 and that TAO could be a potent epigenetic regulator through PRCs to modulate the ncRNA gene expression in MCF7 cells.


Subject(s)
Antineoplastic Agents/pharmacology , Arsenic Trioxide/pharmacology , Histones/genetics , Polycomb-Group Proteins/genetics , RNA, Untranslated/genetics , Transcriptome , Autophagy/drug effects , Autophagy/genetics , Cell Cycle/drug effects , Cell Cycle/genetics , Computational Biology/methods , DNA Repair/drug effects , Enhancer of Zeste Homolog 2 Protein/genetics , Enhancer of Zeste Homolog 2 Protein/metabolism , Exocytosis/drug effects , Gene Expression Regulation, Neoplastic , Gene Ontology , Genome, Human , HEK293 Cells , Histones/metabolism , Humans , MCF-7 Cells , Molecular Sequence Annotation , Polycomb-Group Proteins/classification , Polycomb-Group Proteins/metabolism , RNA, Long Noncoding/genetics , RNA, Long Noncoding/metabolism , RNA, Untranslated/classification , RNA, Untranslated/metabolism
11.
Methods Mol Biol ; 2300: 3-9, 2021.
Article in English | MEDLINE | ID: mdl-33792866

ABSTRACT

The discovery of new classes of non-coding RNAs has always been preceded or accompanied by technological breakthroughs, and these outstanding progresses in transcriptomics approaches enabled to regularly add new members to the list. From the first detection of tRNAs, through the revolution of miRNAs discovery, to the recent identification of eRNAs or the identification of new functions for already known ncRNAs, this introductive review provides a very concise historical and functional overview of most prominent small regulatory non-coding RNA families.


Subject(s)
RNA, Untranslated/classification , RNA, Untranslated/history , Animals , Gene Expression Regulation , History, 20th Century , Humans , Multigene Family , RNA, Untranslated/genetics
12.
Artif Cells Nanomed Biotechnol ; 49(1): 204-218, 2021 Dec.
Article in English | MEDLINE | ID: mdl-33645342

ABSTRACT

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a zoo tonic, highly pathogenic virus. The new type of coronavirus with contagious nature spread from Wuhan (China) to the whole world in a very short time and caused the new coronavirus disease (COVID-19). COVID-19 has turned into a global public health crisis due to spreading by close person-to-person contact with high transmission capacity. Thus, research about the treatment of the damages caused by the virus or prevention from infection increases everyday. Besides, there is still no approved and definitive, standardized treatment for COVID-19. However, this disaster experienced by human beings has made us realize the significance of having a system ready for use to prevent humanity from viral attacks without wasting time. As is known, nanocarriers can be targeted to the desired cells in vitro and in vivo. The nano-carrier system targeting a specific protein, containing the enzyme inhibiting the action of the virus can be developed. The system can be used by simple modifications when we encounter another virus epidemic in the future. In this review, we present a potential treatment method consisting of a nanoparticle-ribozyme conjugate, targeting ACE-2 receptors by reviewing the virus-associated ribozymes, their structures, types and working mechanisms.


Subject(s)
COVID-19 Drug Treatment , Nanoparticles/administration & dosage , RNA, Catalytic/therapeutic use , RNA, Viral/antagonists & inhibitors , SARS-CoV-2/drug effects , Angiotensin-Converting Enzyme 2/antagonists & inhibitors , Clinical Trials as Topic , Drug Carriers , Drug Compounding , Drug Design , HIV Infections/drug therapy , HIV-1/drug effects , HIV-1/genetics , Humans , Middle East Respiratory Syndrome Coronavirus/drug effects , Middle East Respiratory Syndrome Coronavirus/genetics , Models, Molecular , Nucleic Acid Conformation , RNA Interference , RNA, Catalytic/administration & dosage , RNA, Catalytic/chemistry , RNA, Catalytic/classification , RNA, Untranslated/classification , RNA, Untranslated/genetics , RNA, Untranslated/therapeutic use , Receptors, Coronavirus/antagonists & inhibitors , Severe acute respiratory syndrome-related coronavirus/drug effects , Severe acute respiratory syndrome-related coronavirus/genetics , SARS-CoV-2/genetics , SARS-CoV-2/physiology , Spike Glycoprotein, Coronavirus/physiology , Virus Replication/drug effects
13.
Article in English | MEDLINE | ID: mdl-32224462

ABSTRACT

Evidence has accumulated enough to prove non-coding RNAs (ncRNAs) play important roles in cellular biological processes and disease pathogenesis. High throughput techniques have produced a large number of ncRNAs whose function remains unknown. Since the accurate identification of ncRNAs family is helpful to the research of their function, it is of necessity and urgency to predict the family of each ncRNAs. Although several traditional excellent methods are applicable to predict the family of ncRNAs, their complex procedures or inaccurate performance remain major problems confronting us. The main idea of those methods is first to predict the secondary structure, and then identify ncRNAs family according to properties of the secondary structure. Unfortunately, the multi-step error superposition, especially the imperfection of RNA secondary structure prediction tools, maybe the cause of low accuracy. In this paper, a novel end-to-end method 'ncRFP' was proposed to complete the prediction task based on Deep Learning. Instead of predicting the secondary structure, ncRFP predicts the ncRNAs family by automatically extracting features from ncRNAs sequences. Compared with other methods, ncRFP not only simplifies the process but also improves accuracy. The source code of ncRFP can be available at https://github.com/linyuwangPHD/ncRFP.


Subject(s)
Computational Biology/methods , Deep Learning , RNA, Untranslated , Sequence Analysis, RNA/methods , Software , Nucleic Acid Conformation , RNA, Untranslated/chemistry , RNA, Untranslated/classification , RNA, Untranslated/genetics
14.
Nucleic Acids Res ; 49(D1): D160-D164, 2021 01 08.
Article in English | MEDLINE | ID: mdl-32833025

ABSTRACT

Many studies have indicated that non-coding RNA (ncRNA) dysfunction is closely related to numerous diseases. Recently, accumulated ncRNA-disease associations have made related databases insufficient to meet the demands of biomedical research. The constant updating of ncRNA-disease resources has become essential. Here, we have updated the mammal ncRNA-disease repository (MNDR, http://www.rna-society.org/mndr/) to version 3.0, containing more than one million entries, four-fold increment in data compared to the previous version. Experimental and predicted circRNA-disease associations have been integrated, increasing the number of categories of ncRNAs to five, and the number of mammalian species to 11. Moreover, ncRNA-disease related drug annotations and associations, as well as ncRNA subcellular localizations and interactions, were added. In addition, three ncRNA-disease (miRNA/lncRNA/circRNA) prediction tools were provided, and the website was also optimized, making it more practical and user-friendly. In summary, MNDR v3.0 will be a valuable resource for the investigation of disease mechanisms and clinical treatment strategies.


Subject(s)
Databases, Nucleic Acid , MicroRNAs/genetics , Neoplasms/genetics , RNA, Circular/genetics , RNA, Untranslated/genetics , Animals , Humans , Internet , Mammals , MicroRNAs/classification , MicroRNAs/metabolism , Molecular Sequence Annotation , Neoplasms/classification , Neoplasms/metabolism , Neoplasms/pathology , RNA, Circular/classification , RNA, Circular/metabolism , RNA, Untranslated/classification , RNA, Untranslated/metabolism , Software
15.
Nucleic Acids Res ; 49(D1): D192-D200, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33211869

ABSTRACT

Rfam is a database of RNA families where each of the 3444 families is represented by a multiple sequence alignment of known RNA sequences and a covariance model that can be used to search for additional members of the family. Recent developments have involved expert collaborations to improve the quality and coverage of Rfam data, focusing on microRNAs, viral and bacterial RNAs. We have completed the first phase of synchronising microRNA families in Rfam and miRBase, creating 356 new Rfam families and updating 40. We established a procedure for comprehensive annotation of viral RNA families starting with Flavivirus and Coronaviridae RNAs. We have also increased the coverage of bacterial and metagenome-based RNA families from the ZWD database. These developments have enabled a significant growth of the database, with the addition of 759 new families in Rfam 14. To facilitate further community contribution to Rfam, expert users are now able to build and submit new families using the newly developed Rfam Cloud family curation system. New Rfam website features include a new sequence similarity search powered by RNAcentral, as well as search and visualisation of families with pseudoknots. Rfam is freely available at https://rfam.org.


Subject(s)
Databases, Nucleic Acid , Metagenome , MicroRNAs/genetics , RNA, Bacterial/genetics , RNA, Untranslated/genetics , RNA, Viral/genetics , Bacteria/genetics , Bacteria/metabolism , Base Pairing , Base Sequence , Humans , Internet , MicroRNAs/classification , MicroRNAs/metabolism , Molecular Sequence Annotation , Nucleic Acid Conformation , RNA, Bacterial/classification , RNA, Bacterial/metabolism , RNA, Untranslated/classification , RNA, Untranslated/metabolism , RNA, Viral/classification , RNA, Viral/metabolism , Sequence Alignment , Sequence Analysis, RNA , Software , Viruses/genetics , Viruses/metabolism
16.
Nucleic Acids Res ; 49(D1): D212-D220, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33106848

ABSTRACT

RNAcentral is a comprehensive database of non-coding RNA (ncRNA) sequences that provides a single access point to 44 RNA resources and >18 million ncRNA sequences from a wide range of organisms and RNA types. RNAcentral now also includes secondary (2D) structure information for >13 million sequences, making RNAcentral the world's largest RNA 2D structure database. The 2D diagrams are displayed using R2DT, a new 2D structure visualization method that uses consistent, reproducible and recognizable layouts for related RNAs. The sequence similarity search has been updated with a faster interface featuring facets for filtering search results by RNA type, organism, source database or any keyword. This sequence search tool is available as a reusable web component, and has been integrated into several RNAcentral member databases, including Rfam, miRBase and snoDB. To allow for a more fine-grained assignment of RNA types and subtypes, all RNAcentral sequences have been annotated with Sequence Ontology terms. The RNAcentral database continues to grow and provide a central data resource for the RNA community. RNAcentral is freely available at https://rnacentral.org.


Subject(s)
Databases, Nucleic Acid/organization & administration , Molecular Sequence Annotation , RNA, Untranslated/genetics , Software , Animals , Apicomplexa/classification , Apicomplexa/genetics , Base Sequence , Betacoronavirus/classification , Betacoronavirus/genetics , Databases, Nucleic Acid/supply & distribution , Fungi/classification , Fungi/genetics , Gene Ontology , Humans , Internet , Nucleic Acid Conformation , RNA, Untranslated/classification , RNA, Untranslated/metabolism , Sequence Analysis, RNA
17.
Nucleic Acids Res ; 49(D1): D1094-D1101, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33095860

ABSTRACT

Most mutations in cancer genomes occur in the non-coding regions with unknown impact on tumor development. Although the increase in the number of cancer whole-genome sequences has revealed numerous putative non-coding cancer drivers, their information is dispersed across multiple studies making it difficult to understand their roles in tumorigenesis of different cancer types. We have developed CNCDatabase, Cornell Non-coding Cancer driver Database (https://cncdatabase.med.cornell.edu/) that contains detailed information about predicted non-coding drivers at gene promoters, 5' and 3' UTRs (untranslated regions), enhancers, CTCF insulators and non-coding RNAs. CNCDatabase documents 1111 protein-coding genes and 90 non-coding RNAs with reported drivers in their non-coding regions from 32 cancer types by computational predictions of positive selection using whole-genome sequences; differential gene expression in samples with and without mutations; or another set of experimental validations including luciferase reporter assays and genome editing. The database can be easily modified and scaled as lists of non-coding drivers are revised in the community with larger whole-genome sequencing studies, CRISPR screens and further experimental validations. Overall, CNCDatabase provides a helpful resource for researchers to explore the pathological role of non-coding alterations in human cancers.


Subject(s)
Carcinogenesis/genetics , Databases, Genetic , Gene Expression Regulation, Neoplastic , Genome, Human , Neoplasms/genetics , 3' Untranslated Regions , 5' Untranslated Regions , Carcinogenesis/metabolism , Carcinogenesis/pathology , Clustered Regularly Interspaced Short Palindromic Repeats , Enhancer Elements, Genetic , Genes, Reporter , Humans , Insulator Elements , Luciferases/genetics , Luciferases/metabolism , Mutation , Neoplasms/metabolism , Neoplasms/pathology , Open Reading Frames , Promoter Regions, Genetic , RNA, Untranslated/classification , RNA, Untranslated/genetics , RNA, Untranslated/metabolism , Untranslated Regions , Whole Genome Sequencing
18.
Sci Rep ; 10(1): 18863, 2020 11 02.
Article in English | MEDLINE | ID: mdl-33139759

ABSTRACT

Recent studies show that non-coding RNAs (ncRNAs) can regulate the expression of protein-coding genes and play important roles in mammalian development. Previous studies have revealed that during C. elegans (Caenorhabditis elegans) embryo development, numerous genes in each cell are spatiotemporally regulated, causing the cell to differentiate into distinct cell types and tissues. We ask whether ncRNAs participate in the spatiotemporal regulation of genes in different types of cells and tissues during the embryogenesis of C. elegans. Here, by using marker-free full-length high-depth single-cell RNA sequencing (scRNA-seq) technique, we sequence the whole transcriptomes from 1031 embryonic cells of C. elegans and detect 20,431 protein-coding genes, including 22 cell-type-specific protein-coding markers, and 9843 ncRNAs including 11 cell-type-specific ncRNA markers. We induce a ncRNAs-based clustering strategy as a complementary strategy to the protein-coding gene-based clustering strategy for single-cell classification. We identify 94 ncRNAs that have never been reported to regulate gene expressions, are co-expressed with 1208 protein-coding genes in cell type specific and/or embryo time specific manners. Our findings suggest that these ncRNAs could potentially influence the spatiotemporal expression of the corresponding genes during the embryogenesis of C. elegans.


Subject(s)
Caenorhabditis elegans/genetics , Embryonic Development/genetics , RNA, Untranslated/genetics , Transcriptome/genetics , Animals , Caenorhabditis elegans/growth & development , Caenorhabditis elegans Proteins/genetics , Gene Expression Regulation, Developmental/genetics , RNA, Untranslated/classification , Single-Cell Analysis
19.
Comput Biol Chem ; 88: 107364, 2020 Oct.
Article in English | MEDLINE | ID: mdl-32890916

ABSTRACT

A non-coding RNA (ncRNA) is a kind of RNA that is not converted into protein, however, it is involved in many biological processes, diseases, and cancers. Numerous ncRNAs have been identified and classified with high throughput sequencing technology. Hence, accurate ncRNAs class prediction is important and necessary for further study of their functions. Several computation techniques have been employed to predict the class of ncRNAs. Recent classification methods used the secondary structure as their primary input. However, the computational tools of RNA secondary structure are not accurate enough which affects the final performance of ncRNAs predictors. In this paper, we propose a simple yet efficient method, called ncRDeep, for ncRNAs prediction. It uses a simple convolutional neural network and RNA sequence information only. The ncRDeep was evaluated on benchmark datasets and the comparison results showed that the ncRDeep outperforms the state-of-the-art methods significantly. More specifically, the average accuracy was improved by 8.32%. Finally, we built a freely accessible web server for the developed tool ncRDeep at http://home.jbnu.ac.kr/NSCL/ncRDeep.htm.


Subject(s)
Neural Networks, Computer , RNA, Untranslated/classification , RNA, Untranslated/genetics , Databases, Genetic , Humans
20.
Genes Genomics ; 42(11): 1259-1265, 2020 11.
Article in English | MEDLINE | ID: mdl-32946063

ABSTRACT

BACKGROUND: Down syndrome (DS), caused by trisomy 21, is the most common human chromosomal disorder. Hippocampal abnormalities have been believed to be responsible for the DS developmental cognitive deficits. Cumulative evidences indicated that non-coding RNAs (ncRNAs) participated in brain development and function. Currently, few was known whether dysregulated ncRNAs existed in DS whether the dysregulated ncRNAs played important pathology roles in DS. OBJECTIVE: The purpose of this study was generating an overview map of the dysregulated ncRNAs in DS, including the microRNA (miRNA), long ncRNA (lncRNA) and circular RNA (circRNAs). DS mouse models are invaluable tools for further mechanism and therapy studies. METHODS: The well-studied DS mouse model Dp(16)1/Yey was used in this study as it contains the trisomy of the whole human chromosome 21 syntenic region on mouse chromosomes 16. Hippocampi were isolated from pups of seven-days-old. Libraries for miRNA, lncRNA and circRNAs were constructed separately, and the next generation sequencing method was utilized. RESULTS: Differentially expressed (DE) miRNAs, lncRNAs and circRNAs were reported. Relative few regulating relationship were found between the DE miRNAs and DE mRNAs. LncRNAs originated from the trisomic regions expressed in clusters, but not all of them were 1.5-fold increased expressed. Dramatic DE circular RNAs were found in the DS hippocampus. The host genes of the DE circRNAs were enriched on functions which were well-known impaired in DS, e.g. long-term-potentiation, glutamatergic synapse, and GABAergic synapse. CONCLUSIONS: We generated the first DS developmental hippocampal ncRNA transcriptome map. This work laid foundations for further investigations on role of ncRNAs in hippocampal functions.


Subject(s)
Down Syndrome/genetics , RNA, Untranslated/genetics , Transcriptome/genetics , Animals , Disease Models, Animal , Down Syndrome/pathology , Gene Expression Profiling/methods , Gene Expression Regulation , High-Throughput Nucleotide Sequencing , Hippocampus/metabolism , Hippocampus/pathology , Humans , Mice , MicroRNAs/genetics , MicroRNAs/isolation & purification , RNA, Circular/genetics , RNA, Circular/isolation & purification , RNA, Long Noncoding/genetics , RNA, Long Noncoding/isolation & purification , RNA, Untranslated/classification , RNA, Untranslated/isolation & purification
SELECTION OF CITATIONS
SEARCH DETAIL
...