Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 78
Filter
Add more filters

Publication year range
1.
Nature ; 622(7981): 41-47, 2023 Oct.
Article in English | MEDLINE | ID: mdl-37794265

ABSTRACT

Scientists have been trying to identify every gene in the human genome since the initial draft was published in 2001. In the years since, much progress has been made in identifying protein-coding genes, currently estimated to number fewer than 20,000, with an ever-expanding number of distinct protein-coding isoforms. Here we review the status of the human gene catalogue and the efforts to complete it in recent years. Beside the ongoing annotation of protein-coding genes, their isoforms and pseudogenes, the invention of high-throughput RNA sequencing and other technological breakthroughs have led to a rapid growth in the number of reported non-coding RNA genes. For most of these non-coding RNAs, the functional relevance is currently unclear; we look at recent advances that offer paths forward to identifying their functions and towards eventually completing the human gene catalogue. Finally, we examine the need for a universal annotation standard that includes all medically significant genes and maintains their relationships with different reference genomes for the use of the human gene catalogue in clinical settings.


Subject(s)
Genes , Genome, Human , Molecular Sequence Annotation , Protein Isoforms , Humans , Genome, Human/genetics , Molecular Sequence Annotation/standards , Molecular Sequence Annotation/trends , Protein Isoforms/genetics , Human Genome Project , Pseudogenes , RNA/genetics
2.
Nucleic Acids Res ; 52(D1): D304-D310, 2024 Jan 05.
Article in English | MEDLINE | ID: mdl-37986224

ABSTRACT

TarBase is a reference database dedicated to produce, curate and deliver high quality experimentally-supported microRNA (miRNA) targets on protein-coding transcripts. In its latest version (v9.0, https://dianalab.e-ce.uth.gr/tarbasev9), it pushes the envelope by introducing virally-encoded miRNAs, interactions leading to target-directed miRNA degradation (TDMD) events and the largest collection of miRNA-gene interactions to date in a plethora of experimental settings, tissues and cell-types. It catalogues ∼6 million entries, comprising ∼2 million unique miRNA-gene pairs, supported by 37 experimental (high- and low-yield) protocols in 172 tissues and cell-types. Interactions are annotated with rich metadata including information on genes/transcripts, miRNAs, samples, experimental contexts and publications, while millions of miRNA-binding locations are also provided at cell-type resolution. A completely re-designed interface with state-of-the-art web technologies, incorporates more features, and allows flexible and ingenious use. The new interface provides the capability to design sophisticated queries with numerous filtering criteria including cell lines, experimental conditions, cell types, experimental methods, species and/or tissues of interest. Additionally, a plethora of fine-tuning capacities have been integrated to the platform, offering the refinement of the returned interactions based on miRNA confidence and expression levels, while boundless local retrieval of the offered interactions and metadata is enabled.


Subject(s)
Databases, Nucleic Acid , MicroRNAs , Genes, Viral/genetics , Internet , MicroRNAs/genetics , MicroRNAs/metabolism , Animals
3.
Nucleic Acids Res ; 51(W1): W148-W153, 2023 07 05.
Article in English | MEDLINE | ID: mdl-37094027

ABSTRACT

DIANA-microT-CDS is a state-of-the-art miRNA target prediction algorithm catering the scientific community since 2009. It is one of the first algorithms to predict miRNA binding sites in both the 3' Untranslated Region (3'-UTR) and the coding sequence (CDS) of transcripts, with increased performance. Its current version, DIANA-microT 2023 (www.microrna.gr/microt_webserver/), brings forward a significantly updated set of interactions. DIANA-microT-CDS has been executed utilizing annotation information from Ensembl v102, miRBase 22.1 and, for the first time, MirGeneDB 2.1, yielding more than 83 million interactions in human, mouse, rat, chicken, fly and worm species. Additionally, this version delivers predicted interactions of miRNAs encoded from 20 viruses against host transcripts from human, mouse and chicken species. Numerous resources have been interconnected into DIANA-microT, including DIANA-TarBase, plasmiR, HMDD, UCSC, dbSNP, ClinVar, as well as miRNA/gene abundance values for 369 distinct cell-lines/tissues. The server interface has been redesigned allowing users to use smart filtering options, identify abundance patterns of interest, pinpoint known SNPs residing on binding sites and obtain miRNA-disease information. The contents of DIANA-microT webserver are freely accessible and can also be locally downloaded without any login requirements.


Subject(s)
MicroRNAs , Humans , Mice , Rats , Animals , MicroRNAs/metabolism , Software , Algorithms , Binding Sites , Polymorphism, Single Nucleotide , 3' Untranslated Regions/genetics
4.
Nucleic Acids Res ; 51(W1): W154-W159, 2023 07 05.
Article in English | MEDLINE | ID: mdl-37260078

ABSTRACT

DIANA-miRPath is an online miRNA analysis platform harnessing predicted or experimentally supported miRNA interactions towards the exploration of combined miRNA effects. In its latest version (v4.0, http://www.microrna.gr/miRPathv4), DIANA-miRPath breaks new ground by introducing the capacity to tailor its target-based miRNA functional analysis engine to specific biological and/or experimental contexts. Via a redesigned modular interface with rich interaction, annotation and parameterization options, users can now perform enrichment analysis on Gene Ontology (GO) terms, KEGG and REACTOME pathways, sets from Molecular Signatures Database (MSigDB) and PFAM. Included miRNA interaction sets are derived from state-of-the-art resources of experimentally supported (DIANA-TarBase v8.0, miRTarBase and microCLIP cell-type-specific interactions) or from in silico miRNA-target interactions (updated DIANA-microT-CDS and TargetScan predictions). Bulk and single-cell expression datasets from The Cancer Genome Atlas (TCGA), the Genotype-Tissue Expression project (GTEx) and adult/fetal single-cell atlases are integrated and can be used to assess the expression of enriched term components across a wide range of states. A discrete module enabling enrichment analyses using CRISPR knock-out screen datasets enables the detection of selected miRNAs with potentially crucial roles within conditions under study. Notably, the option to upload custom interaction, term, expression and screen sets further expands the versatility of miRPath webserver.


Subject(s)
MicroRNAs , Software , Cell Communication , Databases, Chemical , MicroRNAs/genetics , MicroRNAs/metabolism
5.
Curr Issues Mol Biol ; 46(8): 9047-9073, 2024 Aug 18.
Article in English | MEDLINE | ID: mdl-39194752

ABSTRACT

Tomato (Solanum lycopersicum) is a major horticultural crop of high economic importance. Phelipanche and Orobanche genera (broomrapes) are parasitic weeds, constituting biotic stressors that impact tomato production. Developing varieties with tolerance to broomrapes has become imperative for sustainable agriculture. Solanum pennellii, a wild relative of cultivated tomato, has been utilized as breeding material for S. lycopersicum. In the present study, it is the first time that an in-depth analysis has been conducted for these two specific introgression lines (ILs), IL6-2 and IL6-3 (S. lycopersicum X S. pennellii), which were employed to identify genes and metabolic pathways associated with resistance against broomrape. Comparative transcriptomic analysis revealed a multitude of differentially expressed genes (DEGs) in roots, especially in the resistant genotype IL6-3, several of which were validated by quantitative PCR. DEG and pathway enrichment analysis (PEA) revealed diverse molecular mechanisms that can potentially be implicated in the host's defense response and the establishment of resistance. The identified DEGs were mostly up-regulated in response to broomrape parasitism and play crucial roles in various processes different from strigolactone regulation. Our findings indicate that, in addition to the essential role of strigolactone metabolism, multiple cellular processes may be involved in the tomato's response to broomrapes. The insights gained from this study will enhance our understanding and facilitate molecular breeding methods regarding broomrape parasitism. Moreover, they will assist in developing sustainable strategies and providing alternative solutions for weed management in tomatoes and other agronomically important crops.

6.
Nucleic Acids Res ; 50(D1): D1055-D1061, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34469540

ABSTRACT

microRNAs (miRNAs) are short (∼23nt) single-stranded non-coding RNAs that act as potent post-transcriptional gene expression regulators. Information about miRNA expression and distribution across cell types and tissues is crucial to the understanding of their function and for their translational use as biomarkers or therapeutic targets. DIANA-miTED is the most comprehensive and systematic collection of miRNA expression values derived from the analysis of 15 183 raw human small RNA-Seq (sRNA-Seq) datasets from the Sequence Read Archive (SRA) and The Cancer Genome Atlas (TCGA). Metadata quality maximizes the utility of expression atlases, therefore we manually curated SRA and TCGA-derived information to deliver a comprehensive and standardized set, incorporating in total 199 tissues, 82 anatomical sublocations, 267 cell lines and 261 diseases. miTED offers rich instant visualizations of the expression and sample distributions of requested data across variables, as well as study-wide diagrams and graphs enabling efficient content exploration. Queries also generate links towards state-of-the-art miRNA functional resources, deeming miTED an ideal starting point for expression retrieval, exploration, comparison, and downstream analysis, without requiring bioinformatics support or expertise. DIANA-miTED is freely available at http://www.microrna.gr/mited.


Subject(s)
Databases, Genetic , Databases, Nucleic Acid , MicroRNAs/genetics , Software , Binding Sites/genetics , Gene Expression Regulation/genetics , Genome/genetics , Humans , MicroRNAs/classification , Tissue Distribution/genetics , Transcriptome/genetics
7.
Nucleic Acids Res ; 49(D1): D1328-D1333, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33080028

ABSTRACT

We present Peryton (https://dianalab.e-ce.uth.gr/peryton/), a database of experimentally supported microbe-disease associations. Its first version constitutes a novel resource hosting more than 7900 entries linking 43 diseases with 1396 microorganisms. Peryton's content is exclusively sustained by manual curation of biomedical articles. Diseases and microorganisms are provided in a systematic, standardized manner using reference resources to create database dictionaries. Information about the experimental design, study cohorts and the applied high- or low-throughput techniques is meticulously annotated and catered to users. Several functionalities are provided to enhance user experience and enable ingenious use of Peryton. One or more microorganisms and/or diseases can be queried at the same time. Advanced filtering options and direct text-based filtering of results enable refinement of returned information and the conducting of tailored queries suitable to different research questions. Peryton also provides interactive visualizations to effectively capture different aspects of its content and results can be directly downloaded for local storage and downstream analyses. Peryton will serve as a valuable source, enabling scientists of microbe-related disease fields to form novel hypotheses but, equally importantly, to assist in cross-validation of findings.


Subject(s)
Bacterial Infections/microbiology , Databases, Factual , Gastrointestinal Diseases/microbiology , Host-Pathogen Interactions , Mycoses/microbiology , Neoplasms/microbiology , Neurodegenerative Diseases/microbiology , Bacterial Infections/classification , Bacterial Infections/genetics , Bacterial Infections/pathology , Cohort Studies , Data Mining , Gastrointestinal Diseases/classification , Gastrointestinal Diseases/genetics , Gastrointestinal Diseases/pathology , Humans , Internet , Mycoses/classification , Mycoses/genetics , Mycoses/pathology , Neoplasms/classification , Neoplasms/genetics , Neoplasms/pathology , Neurodegenerative Diseases/classification , Neurodegenerative Diseases/genetics , Neurodegenerative Diseases/pathology , Research Design , Software
8.
BMC Bioinformatics ; 23(Suppl 2): 395, 2022 Dec 12.
Article in English | MEDLINE | ID: mdl-36510136

ABSTRACT

BACKGROUND: The widespread usage of Cap Analysis of Gene Expression (CAGE) has led to numerous breakthroughs in understanding the transcription mechanisms. Recent evidence in the literature, however, suggests that CAGE suffers from transcriptional and technical noise. Regardless of the sample quality, there is a significant number of CAGE peaks that are not associated with transcription initiation events. This type of signal is typically attributed to technical noise and more frequently to random five-prime capping or transcription bioproducts. Thus, the need for computational methods emerges, that can accurately increase the signal-to-noise ratio in CAGE data, resulting in error-free transcription start site (TSS) annotation and quantification of regulatory region usage. In this study, we present DeepTSS, a novel computational method for processing CAGE samples, that combines genomic signal processing (GSP), structural DNA features, evolutionary conservation evidence and raw DNA sequence with Deep Learning (DL) to provide single-nucleotide TSS predictions with unprecedented levels of performance. RESULTS: To evaluate DeepTSS, we utilized experimental data, protein-coding gene annotations and computationally-derived genome segmentations by chromatin states. DeepTSS was found to outperform existing algorithms on all benchmarks, achieving 98% precision and 96% sensitivity (accuracy 95.4%) on the protein-coding gene strategy, with 96.66% of its positive predictions overlapping active chromatin, 98.27% and 92.04% co-localized with at least one transcription factor and H3K4me3 peak. CONCLUSIONS: CAGE is a key protocol in deciphering the language of transcription, however, as every experimental protocol, it suffers from biological and technical noise that can severely affect downstream analyses. DeepTSS is a novel DL-based method for effectively removing noisy CAGE signal. In contrast to existing software, DeepTSS does not require feature selection since the embedded convolutional layers can readily identify patterns and only utilize the important ones for the classification task. This study highlights the key role that DL can play in Molecular Biology, by removing the inherent flaws of experimental protocols, that form the backbone of contemporary research. Here, we show how DeepTSS can unleash the full potential of an already popular and mature method such as CAGE, and push the boundaries of coding and non-coding gene expression regulator research even further.


Subject(s)
Neural Networks, Computer , Software , Transcription Initiation Site , Promoter Regions, Genetic , Chromatin
9.
Nucleic Acids Res ; 48(D1): D101-D110, 2020 01 08.
Article in English | MEDLINE | ID: mdl-31732741

ABSTRACT

DIANA-LncBase v3.0 (www.microrna.gr/LncBase) is a reference repository with experimentally supported miRNA targets on non-coding transcripts. Its third version provides approximately half a million entries, corresponding to ∼240 000 unique tissue and cell type specific miRNA-lncRNA pairs. This compilation of interactions is derived from the manual curation of publications and the analysis of >300 high-throughput datasets. miRNA targets are supported by 14 experimental methodologies, applied to 243 distinct cell types and tissues in human and mouse. The largest part of the database is highly confident, AGO-CLIP-derived miRNA-binding events. LncBase v3.0 is the first relevant database to employ a robust CLIP-Seq-guided algorithm, microCLIP framework, to analyze 236 AGO-CLIP-Seq libraries and catalogue ∼370 000 miRNA binding events. The database was redesigned from the ground up, providing new functionalities. Known short variant information, on >67,000 experimentally supported target sites and lncRNA expression profiles in different cellular compartments are catered to users. Interactive visualization plots, portraying correlations of miRNA-lncRNA pairs, as well as lncRNA expression profiles in a wide range of cell types and tissues, are presented for the first time through a dedicated page. LncBase v3.0 constitutes a valuable asset for ncRNA research, providing new insights to the understanding of the still widely unexplored lncRNA functions.


Subject(s)
Computational Biology/methods , Databases, Nucleic Acid , MicroRNAs/genetics , RNA Interference , RNA, Untranslated/genetics , Software , Gene Expression Regulation , High-Throughput Nucleotide Sequencing , Molecular Sequence Annotation , Sequence Analysis, RNA
10.
Nucleic Acids Res ; 46(D1): D239-D245, 2018 01 04.
Article in English | MEDLINE | ID: mdl-29156006

ABSTRACT

DIANA-TarBase v8 (http://www.microrna.gr/tarbase) is a reference database devoted to the indexing of experimentally supported microRNA (miRNA) targets. Its eighth version is the first database indexing >1 million entries, corresponding to ∼670 000 unique miRNA-target pairs. The interactions are supported by >33 experimental methodologies, applied to ∼600 cell types/tissues under ∼451 experimental conditions. It integrates information on cell-type specific miRNA-gene regulation, while hundreds of thousands of miRNA-binding locations are reported. TarBase is coming of age, with more than a decade of continuous support in the non-coding RNA field. A new module has been implemented that enables the browsing of interactions through different filtering combinations. It permits easy retrieval of positive and negative miRNA targets per species, methodology, cell type and tissue. An incorporated ranking system is utilized for the display of interactions based on the robustness of their supporting methodologies. Statistics, pie-charts and interactive bar-plots depicting the database content are available through a dedicated result page. An intuitive interface is introduced, providing a user-friendly application with flexible options to different queries.


Subject(s)
Databases, Nucleic Acid , Epistasis, Genetic , MicroRNAs/genetics , MicroRNAs/metabolism , Animals , High-Throughput Nucleotide Sequencing , Humans , Oligonucleotide Array Sequence Analysis , Sequence Analysis, RNA , User-Computer Interface
11.
Proc Natl Acad Sci U S A ; 114(18): E3679-E3688, 2017 05 02.
Article in English | MEDLINE | ID: mdl-28416701

ABSTRACT

α-Synuclein (αSyn) is the major gene linked to sporadic Parkinson's disease (PD), whereas the G209A (p.A53T) αSyn mutation causes a familial form of PD characterized by early onset and a generally severe phenotype, including nonmotor manifestations. Here we generated de novo induced pluripotent stem cells (iPSCs) from patients harboring the p.A53T mutation and developed a robust model that captures PD pathogenic processes under basal conditions. iPSC-derived mutant neurons displayed novel disease-relevant phenotypes, including protein aggregation, compromised neuritic outgrowth, and contorted or fragmented axons with swollen varicosities containing αSyn and Tau. The identified neuropathological features closely resembled those in brains of p.A53T patients. Small molecules targeting αSyn reverted the degenerative phenotype under both basal and induced stress conditions, indicating a treatment strategy for PD and other synucleinopathies. Furthermore, mutant neurons showed disrupted synaptic connectivity and widespread transcriptional alterations in genes involved in synaptic signaling, a number of which have been previously linked to mental disorders, raising intriguing implications for potentially converging disease mechanisms.


Subject(s)
Axons/metabolism , Induced Pluripotent Stem Cells/metabolism , Models, Biological , Mutation, Missense , Parkinson Disease/metabolism , Polyneuropathies/metabolism , Synaptic Transmission , alpha-Synuclein/metabolism , Amino Acid Substitution , Axons/pathology , Humans , Induced Pluripotent Stem Cells/pathology , Parkinson Disease/genetics , Parkinson Disease/pathology , Polyneuropathies/genetics , Polyneuropathies/pathology , alpha-Synuclein/genetics
12.
Nucleic Acids Res ; 45(D1): D128-D134, 2017 01 04.
Article in English | MEDLINE | ID: mdl-27794554

ABSTRACT

RNAcentral is a database of non-coding RNA (ncRNA) sequences that aggregates data from specialised ncRNA resources and provides a single entry point for accessing ncRNA sequences of all ncRNA types from all organisms. Since its launch in 2014, RNAcentral has integrated twelve new resources, taking the total number of collaborating database to 22, and began importing new types of data, such as modified nucleotides from MODOMICS and PDB. We created new species-specific identifiers that refer to unique RNA sequences within a context of single species. The website has been subject to continuous improvements focusing on text and sequence similarity searches as well as genome browsing functionality. All RNAcentral data is provided for free and is available for browsing, bulk downloads, and programmatic access at http://rnacentral.org/.


Subject(s)
Databases, Nucleic Acid , RNA, Untranslated/chemistry , Animals , Genomics , Humans , Nucleotides/chemistry , Sequence Analysis, RNA , Species Specificity
13.
Nucleic Acids Res ; 44(D1): D190-5, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26586797

ABSTRACT

microRNAs (miRNAs) are small non-coding RNAs that actively fine-tune gene expression. The accurate characterization of the mechanisms underlying miRNA transcription regulation will further expand our knowledge regarding their implication in homeostatic and pathobiological networks. Aim of DIANA-miRGen v3.0 (http://www.microrna.gr/mirgen) is to provide for the first time accurate cell-line-specific miRNA gene transcription start sites (TSSs), coupled with genome-wide maps of transcription factor (TF) binding sites in order to unveil the mechanisms of miRNA transcription regulation. To this end, more than 7.3 billion RNA-, ChIP- and DNase-Seq next generation sequencing reads were analyzed/assembled and combined with state-of-the-art miRNA TSS prediction and TF binding site identification algorithms. The new database schema and web interface facilitates user interaction, provides advanced queries and innate connection with other DIANA resources for miRNA target identification and pathway analysis. The database currently supports 276 miRNA TSSs that correspond to 428 precursors and >19M binding sites of 202 TFs on a genome-wide scale in nine cell-lines and six tissues of Homo sapiens and Mus musculus.


Subject(s)
Databases, Nucleic Acid , MicroRNAs/genetics , Promoter Regions, Genetic , Animals , Binding Sites , Cell Line , Gene Expression Regulation , Humans , Mice , Transcription Factors/metabolism , Transcription Initiation Site
14.
Nucleic Acids Res ; 44(D1): D231-8, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26612864

ABSTRACT

microRNAs (miRNAs) are short non-coding RNAs (ncRNAs) that act as post-transcriptional regulators of coding gene expression. Long non-coding RNAs (lncRNAs) have been recently reported to interact with miRNAs. The sponge-like function of lncRNAs introduces an extra layer of complexity in the miRNA interactome. DIANA-LncBase v1 provided a database of experimentally supported and in silico predicted miRNA Recognition Elements (MREs) on lncRNAs. The second version of LncBase (www.microrna.gr/LncBase) presents an extensive collection of miRNA:lncRNA interactions. The significantly enhanced database includes more than 70 000 low and high-throughput, (in)direct miRNA:lncRNA experimentally supported interactions, derived from manually curated publications and the analysis of 153 AGO CLIP-Seq libraries. The new experimental module presents a 14-fold increase compared to the previous release. LncBase v2 hosts in silico predicted miRNA targets on lncRNAs, identified with the DIANA-microT algorithm. The relevant module provides millions of predicted miRNA binding sites, accompanied with detailed metadata and MRE conservation metrics. LncBase v2 caters information regarding cell type specific miRNA:lncRNA regulation and enables users to easily identify interactions in 66 different cell types, spanning 36 tissues for human and mouse. Database entries are also supported by accurate lncRNA expression information, derived from the analysis of more than 6 billion RNA-Seq reads.


Subject(s)
Databases, Nucleic Acid , MicroRNAs/metabolism , RNA, Long Noncoding/metabolism , Abstracting and Indexing , Animals , Binding Sites , Humans , Mice , MicroRNAs/chemistry , RNA, Long Noncoding/chemistry
15.
Nucleic Acids Res ; 44(W1): W128-34, 2016 07 08.
Article in English | MEDLINE | ID: mdl-27207881

ABSTRACT

Differential expression analysis (DEA) is one of the main instruments utilized for revealing molecular mechanisms in pathological and physiological conditions. DIANA-mirExTra v2.0 (http://www.microrna.gr/mirextrav2) performs a combined DEA of mRNAs and microRNAs (miRNAs) to uncover miRNAs and transcription factors (TFs) playing important regulatory roles between two investigated states. The web server uses as input miRNA/RNA-Seq read count data sets that can be uploaded for analysis. Users can combine their data with 350 small-RNA-Seq and 65 RNA-Seq in-house analyzed libraries which are provided by DIANA-mirExTra v2.0.The web server utilizes miRNA:mRNA, TF:mRNA and TF:miRNA interactions derived from extensive experimental data sets. More than 450 000 miRNA interactions and 2 000 000 TF binding sites from specific or high-throughput techniques have been incorporated, while accurate miRNA TSS annotation is obtained from microTSS experimental/in silico framework. These comprehensive data sets enable users to perform analyses based solely on experimentally supported information and to uncover central regulators within sequencing data: miRNAs controlling mRNAs and TFs regulating mRNA or miRNA expression. The server also supports predicted miRNA:gene interactions from DIANA-microT-CDS for 4 species (human, mouse, nematode and fruit fly). DIANA-mirExTra v2.0 has an intuitive user interface and is freely available to all users without any login requirement.


Subject(s)
Caenorhabditis elegans/genetics , Drosophila melanogaster/genetics , MicroRNAs/genetics , RNA, Messenger/genetics , Software , Transcription Factors/genetics , Transcription, Genetic , Animals , Binding Sites , Caenorhabditis elegans/metabolism , Drosophila melanogaster/metabolism , Gene Expression Profiling , Gene Expression Regulation , Humans , Internet , Mice , MicroRNAs/metabolism , Molecular Sequence Annotation , Protein Binding , RNA, Messenger/metabolism , Sequence Analysis, RNA , Signal Transduction , Transcription Factors/metabolism
16.
Nucleic Acids Res ; 43(W1): W460-6, 2015 Jul 01.
Article in English | MEDLINE | ID: mdl-25977294

ABSTRACT

The functional characterization of miRNAs is still an open challenge. Here, we present DIANA-miRPath v3.0 (http://www.microrna.gr/miRPathv3) an online software suite dedicated to the assessment of miRNA regulatory roles and the identification of controlled pathways. The new miRPath web server renders possible the functional annotation of one or more miRNAs using standard (hypergeometric distributions), unbiased empirical distributions and/or meta-analysis statistics. DIANA-miRPath v3.0 database and functionality have been significantly extended to support all analyses for KEGG molecular pathways, as well as multiple slices of Gene Ontology (GO) in seven species (Homo sapiens, Mus musculus, Rattus norvegicus, Drosophila melanogaster, Caenorhabditis elegans, Gallus gallus and Danio rerio). Importantly, more than 600 000 experimentally supported miRNA targets from DIANA-TarBase v7.0 have been incorporated into the new schema. Users of DIANA-miRPath v3.0 can harness this wealth of information and substitute or combine the available in silico predicted targets from DIANA-microT-CDS and/or TargetScan v6.2 with high quality experimentally supported interactions. A unique feature of DIANA-miRPath v3.0 is its redesigned Reverse Search module, which enables users to identify and visualize miRNAs significantly controlling selected pathways or belonging to specific GO categories based on in silico or experimental data. DIANA-miRPath v3.0 is freely available to all users without any login requirement.


Subject(s)
MicroRNAs/metabolism , Software , Algorithms , Animals , Computer Simulation , Humans , Internet , Mice , MicroRNAs/genetics , MicroRNAs/physiology , Molecular Sequence Annotation , Rats
17.
Nucleic Acids Res ; 43(Database issue): D153-9, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25416803

ABSTRACT

microRNAs (miRNAs) are short non-coding RNA species, which act as potent gene expression regulators. Accurate identification of miRNA targets is crucial to understanding their function. Currently, hundreds of thousands of miRNA:gene interactions have been experimentally identified. However, this wealth of information is fragmented and hidden in thousands of manuscripts and raw next-generation sequencing data sets. DIANA-TarBase was initially released in 2006 and it was the first database aiming to catalog published experimentally validated miRNA:gene interactions. DIANA-TarBase v7.0 (http://www.microrna.gr/tarbase) aims to provide for the first time hundreds of thousands of high-quality manually curated experimentally validated miRNA:gene interactions, enhanced with detailed meta-data. DIANA-TarBase v7.0 enables users to easily identify positive or negative experimental results, the utilized experimental methodology, experimental conditions including cell/tissue type and treatment. The new interface provides also advanced information ranging from the binding site location, as identified experimentally as well as in silico, to the primer sequences used for cloning experiments. More than half a million miRNA:gene interactions have been curated from published experiments on 356 different cell types from 24 species, corresponding to 9- to 250-fold more entries than any other relevant database. DIANA-TarBase v7.0 is freely available.


Subject(s)
Databases, Nucleic Acid , MicroRNAs/metabolism , RNA, Messenger/metabolism , Abstracting and Indexing , Binding Sites , Data Mining , Internet , User-Computer Interface
18.
Nucleic Acids Res ; 41(Database issue): D239-45, 2013 Jan.
Article in English | MEDLINE | ID: mdl-23193281

ABSTRACT

Recently, the attention of the research community has been focused on long non-coding RNAs (lncRNAs) and their physiological/pathological implications. As the number of experiments increase in a rapid rate and transcriptional units are better annotated, databases indexing lncRNA properties and function gradually become essential tools to this process. Aim of DIANA-LncBase (www.microrna.gr/LncBase) is to reinforce researchers' attempts and unravel microRNA (miRNA)-lncRNA putative functional interactions. This study provides, for the first time, a comprehensive annotation of miRNA targets on lncRNAs. DIANA-LncBase hosts transcriptome-wide experimentally verified and computationally predicted miRNA recognition elements (MREs) on human and mouse lncRNAs. The analysis performed includes an integration of most of the available lncRNA resources, relevant high-throughput HITS-CLIP and PAR-CLIP experimental data as well as state-of-the-art in silico target predictions. The experimentally supported entries available in DIANA-LncBase correspond to >5000 interactions, while the computationally predicted interactions exceed 10 million. DIANA-LncBase hosts detailed information for each miRNA-lncRNA pair, such as external links, graphic plots of transcripts' genomic location, representation of the binding sites, lncRNA tissue expression as well as MREs conservation and prediction scores.


Subject(s)
Databases, Nucleic Acid , MicroRNAs/chemistry , MicroRNAs/metabolism , RNA, Long Noncoding/chemistry , RNA, Long Noncoding/metabolism , Animals , Binding Sites , Computational Biology , High-Throughput Nucleotide Sequencing , Humans , Internet , Mice , Sequence Analysis, RNA
19.
Mol Cancer ; 13: 28, 2014 Feb 11.
Article in English | MEDLINE | ID: mdl-24517586

ABSTRACT

BACKGROUND: In mammalians, HIF is a master regulator of hypoxia gene expression through direct binding to DNA, while its role in microRNA expression regulation, critical in the hypoxia response, is not elucidated genome wide. Our aim is to investigate in depth the regulation of microRNA expression by hypoxia in the breast cancer cell line MCF-7, establish the relationship between microRNA expression and HIF binding sites, pri-miRNA transcription and microRNA processing gene expression. METHODS: MCF-7 cells were incubated at 1% Oxygen for 16, 32 and 48 h. SiRNA against HIF-1α and HIF-2α were performed as previously published. MicroRNA and mRNA expression were assessed using microRNA microarrays, small RNA sequencing, gene expression microarrays and Real time PCR. The Kraken pipeline was applied for microRNA-seq analysis along with Bioconductor packages. Microarray data was analysed using Limma (Bioconductor), ChIP-seq data were analysed using Gene Set Enrichment Analysis and multiple testing correction applied in all analyses. RESULTS: Hypoxia time course microRNA sequencing data analysis identified 41 microRNAs significantly up- and 28 down-regulated, including hsa-miR-4521, hsa-miR-145-3p and hsa-miR-222-5p reported in conjunction with hypoxia for the first time. Integration of HIF-1α and HIF-2α ChIP-seq data with expression data showed overall association between binding sites and microRNA up-regulation, with hsa-miR-210-3p and microRNAs of miR-27a/23a/24-2 and miR-30b/30d clusters as predominant examples. Moreover the expression of hsa-miR-27a-3p and hsa-miR-24-3p was found positively associated to a hypoxia gene signature in breast cancer. Gene expression analysis showed no full coordination between pri-miRNA and microRNA expression, pointing towards additional levels of regulation. Several transcripts involved in microRNA processing were found regulated by hypoxia, of which DICER (down-regulated) and AGO4 (up-regulated) were HIF dependent. DICER expression was found inversely correlated to hypoxia in breast cancer. CONCLUSIONS: Integrated analysis of microRNA, mRNA and ChIP-seq data in a model cell line supports the hypothesis that microRNA expression under hypoxia is regulated at transcriptional and post-transcriptional level, with the presence of HIF binding sites at microRNA genomic loci associated with up-regulation. The identification of hypoxia and HIF regulated microRNAs relevant for breast cancer is important for our understanding of disease development and design of therapeutic interventions.


Subject(s)
Breast Neoplasms/genetics , Gene Expression Regulation, Neoplastic/genetics , Hypoxia-Inducible Factor 1/genetics , MicroRNAs/analysis , RNA, Messenger/analysis , Breast Neoplasms/metabolism , Cell Hypoxia/genetics , Humans , Hypoxia-Inducible Factor 1/metabolism , MCF-7 Cells , Oligonucleotide Array Sequence Analysis , Protein Binding , Real-Time Polymerase Chain Reaction , Transcriptome
SELECTION OF CITATIONS
SEARCH DETAIL