Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 27
Filter
Add more filters










Publication year range
1.
PLoS One ; 19(5): e0295971, 2024.
Article in English | MEDLINE | ID: mdl-38709794

ABSTRACT

The human genome is pervasively transcribed and produces a wide variety of long non-coding RNAs (lncRNAs), constituting the majority of transcripts across human cell types. Some specific nuclear lncRNAs have been shown to be important regulatory components acting locally. As RNA-chromatin interaction and Hi-C chromatin conformation data showed that chromatin interactions of nuclear lncRNAs are determined by the local chromatin 3D conformation, we used Hi-C data to identify potential target genes of lncRNAs. RNA-protein interaction data suggested that nuclear lncRNAs act as scaffolds to recruit regulatory proteins to target promoters and enhancers. Nuclear lncRNAs may therefore play a role in directing regulatory factors to locations spatially close to the lncRNA gene. We provide the analysis results through an interactive visualization web portal at https://fantom.gsc.riken.jp/zenbu/reports/#F6_3D_lncRNA.


Subject(s)
Chromatin , RNA, Long Noncoding , RNA, Long Noncoding/genetics , RNA, Long Noncoding/metabolism , Chromatin/metabolism , Chromatin/genetics , Humans , Molecular Sequence Annotation , Cell Nucleus/metabolism , Cell Nucleus/genetics , Genome, Human , Promoter Regions, Genetic
3.
BMC Genom Data ; 22(1): 33, 2021 09 14.
Article in English | MEDLINE | ID: mdl-34521352

ABSTRACT

BACKGROUND: The lymphatic and the blood vasculature are closely related systems that collaborate to ensure the organism's physiological function. Despite their common developmental origin, they present distinct functional fates in adulthood that rely on robust lineage-specific regulatory programs. The recent technological boost in sequencing approaches unveiled long noncoding RNAs (lncRNAs) as prominent regulatory players of various gene expression levels in a cell-type-specific manner. RESULTS: To investigate the potential roles of lncRNAs in vascular biology, we performed antisense oligonucleotide (ASO) knockdowns of lncRNA candidates specifically expressed either in human lymphatic or blood vascular endothelial cells (LECs or BECs) followed by Cap Analysis of Gene Expression (CAGE-Seq). Here, we describe the quality control steps adopted in our analysis pipeline before determining the knockdown effects of three ASOs per lncRNA target on the LEC or BEC transcriptomes. In this regard, we especially observed that the choice of negative control ASOs can dramatically impact the conclusions drawn from the analysis depending on the cellular background. CONCLUSION: In conclusion, the comparison of negative control ASO effects on the targeted cell type transcriptomes highlights the essential need to select a proper control set of multiple negative control ASO based on the investigated cell types.


Subject(s)
Gene Knockdown Techniques/methods , Oligonucleotides, Antisense/genetics , Organ Specificity/genetics , RNA, Long Noncoding/genetics , Adult , Endothelial Cells/metabolism , Gene Knockdown Techniques/standards , Humans , Lymphatic System/cytology , Lymphatic System/metabolism , Oligonucleotides, Antisense/standards , Transcriptome
4.
Nat Commun ; 12(1): 3297, 2021 06 02.
Article in English | MEDLINE | ID: mdl-34078885

ABSTRACT

Using the Cap Analysis of Gene Expression (CAGE) technology, the FANTOM5 consortium provided one of the most comprehensive maps of transcription start sites (TSSs) in several species. Strikingly, ~72% of them could not be assigned to a specific gene and initiate at unconventional regions, outside promoters or enhancers. Here, we probe these unassigned TSSs and show that, in all species studied, a significant fraction of CAGE peaks initiate at microsatellites, also called short tandem repeats (STRs). To confirm this transcription, we develop Cap Trap RNA-seq, a technology which combines cap trapping and long read MinION sequencing. We train sequence-based deep learning models able to predict CAGE signal at STRs with high accuracy. These models unveil the importance of STR surrounding sequences not only to distinguish STR classes, but also to predict the level of transcription initiation. Importantly, genetic variants linked to human diseases are preferentially found at STRs with high transcription initiation level, supporting the biological and clinical relevance of transcription initiation at STRs. Together, our results extend the repertoire of non-coding transcription associated with DNA tandem repeats and complexify STR polymorphism.


Subject(s)
Microsatellite Repeats , Neural Networks, Computer , Neurodegenerative Diseases/genetics , Transcription Initiation Site , Transcription Initiation, Genetic , A549 Cells , Animals , Base Sequence , Computational Biology/methods , Deep Learning , Enhancer Elements, Genetic , Genome, Human , High-Throughput Nucleotide Sequencing , Humans , Mice , Neurodegenerative Diseases/diagnosis , Neurodegenerative Diseases/metabolism , Polymorphism, Genetic , Promoter Regions, Genetic
5.
Nat Commun ; 12(1): 925, 2021 02 10.
Article in English | MEDLINE | ID: mdl-33568674

ABSTRACT

Recent studies have revealed the importance of long noncoding RNAs (lncRNAs) as tissue-specific regulators of gene expression. There is ample evidence that distinct types of vasculature undergo tight transcriptional control to preserve their structure, identity, and functions. We determine a comprehensive map of lineage-specific lncRNAs in human dermal lymphatic and blood vascular endothelial cells (LECs and BECs), combining RNA-Seq and CAGE-Seq. Subsequent antisense oligonucleotide-knockdown transcriptomic profiling of two LEC- and two BEC-specific lncRNAs identifies LETR1 as a critical gatekeeper of the global LEC transcriptome. Deep RNA-DNA, RNA-protein interaction studies, and phenotype rescue analyses reveal that LETR1 is a nuclear trans-acting lncRNA modulating, via key epigenetic factors, the expression of essential target genes, including KLF4 and SEMA3C, governing the growth and migratory ability of LECs. Together, our study provides several lines of evidence supporting the intriguing concept that every cell type expresses precise lncRNA signatures to control lineage-specific regulatory programs.


Subject(s)
Endothelial Cells/cytology , Kruppel-Like Transcription Factors/metabolism , Semaphorins/metabolism , Cell Movement , Cell Proliferation , Endothelial Cells/metabolism , Gene Expression Regulation , Humans , Kruppel-Like Factor 4 , Kruppel-Like Transcription Factors/genetics , RNA, Long Noncoding , Semaphorins/genetics
6.
Genome Res ; 30(7): 951-961, 2020 07.
Article in English | MEDLINE | ID: mdl-32718981

ABSTRACT

Gene expression profiles in homologous tissues have been observed to be different between species, which may be due to differences between species in the gene expression program in each cell type, but may also reflect differences in cell type composition of each tissue in different species. Here, we compare expression profiles in matching primary cells in human, mouse, rat, dog, and chicken using Cap Analysis Gene Expression (CAGE) and short RNA (sRNA) sequencing data from FANTOM5. While we find that expression profiles of orthologous genes in different species are highly correlated across cell types, in each cell type many genes were differentially expressed between species. Expression of genes with products involved in transcription, RNA processing, and transcriptional regulation was more likely to be conserved, while expression of genes encoding proteins involved in intercellular communication was more likely to have diverged during evolution. Conservation of expression correlated positively with the evolutionary age of genes, suggesting that divergence in expression levels of genes critical for cell function was restricted during evolution. Motif activity analysis showed that both promoters and enhancers are activated by the same transcription factors in different species. An analysis of expression levels of mature miRNAs and of primary miRNAs identified by CAGE revealed that evolutionary old miRNAs are more likely to have conserved expression patterns than young miRNAs. We conclude that key aspects of the regulatory network are conserved, while differential expression of genes involved in cell-to-cell communication may contribute greatly to phenotypic differences between species.


Subject(s)
Evolution, Molecular , Transcriptome , Animals , Chickens/genetics , Dogs , Gene Expression Profiling , Gene Regulatory Networks , Humans , Mice , MicroRNAs/metabolism , Nucleotide Motifs , Principal Component Analysis , Promoter Regions, Genetic , Rats , Species Specificity , Transcription Factors/metabolism
7.
Genome Res ; 30(7): 1073-1081, 2020 07.
Article in English | MEDLINE | ID: mdl-32079618

ABSTRACT

Long noncoding RNAs (lncRNAs) have emerged as key coordinators of biological and cellular processes. Characterizing lncRNA expression across cells and tissues is key to understanding their role in determining phenotypes, including human diseases. We present here FC-R2, a comprehensive expression atlas across a broadly defined human transcriptome, inclusive of over 109,000 coding and noncoding genes, as described in the FANTOM CAGE-Associated Transcriptome (FANTOM-CAT) study. This atlas greatly extends the gene annotation used in the original recount2 resource. We demonstrate the utility of the FC-R2 atlas by reproducing key findings from published large studies and by generating new results across normal and diseased human samples. In particular, we (a) identify tissue-specific transcription profiles for distinct classes of coding and noncoding genes, (b) perform differential expression analysis across thirteen cancer types, identifying novel noncoding genes potentially involved in tumor pathogenesis and progression, and (c) confirm the prognostic value for several enhancer lncRNAs expression in cancer. Our resource is instrumental for the systematic molecular characterization of lncRNA by the FANTOM6 Consortium. In conclusion, comprised of over 70,000 samples, the FC-R2 atlas will empower other researchers to investigate functions and biological roles of both known coding genes and novel lncRNAs.


Subject(s)
Transcriptome , Databases, Genetic , Enhancer Elements, Genetic , Gene Expression Profiling , Genome, Human , Humans , Neoplasms/genetics , Organ Specificity , Prognosis , RNA, Long Noncoding/genetics , RNA, Long Noncoding/metabolism , RNA, Messenger/metabolism
8.
Nature ; 543(7644): 199-204, 2017 03 09.
Article in English | MEDLINE | ID: mdl-28241135

ABSTRACT

Long non-coding RNAs (lncRNAs) are largely heterogeneous and functionally uncharacterized. Here, using FANTOM5 cap analysis of gene expression (CAGE) data, we integrate multiple transcript collections to generate a comprehensive atlas of 27,919 human lncRNA genes with high-confidence 5' ends and expression profiles across 1,829 samples from the major human primary cell types and tissues. Genomic and epigenomic classification of these lncRNAs reveals that most intergenic lncRNAs originate from enhancers rather than from promoters. Incorporating genetic and expression data, we show that lncRNAs overlapping trait-associated single nucleotide polymorphisms are specifically expressed in cell types relevant to the traits, implicating these lncRNAs in multiple diseases. We further demonstrate that lncRNAs overlapping expression quantitative trait loci (eQTL)-associated single nucleotide polymorphisms of messenger RNAs are co-expressed with the corresponding messenger RNAs, suggesting their potential roles in transcriptional regulation. Combining these findings with conservation data, we identify 19,175 potentially functional lncRNAs in the human genome.


Subject(s)
Databases, Genetic , RNA, Long Noncoding/chemistry , RNA, Long Noncoding/genetics , Transcriptome/genetics , Cells, Cultured , Conserved Sequence/genetics , Datasets as Topic , Enhancer Elements, Genetic/genetics , Epigenesis, Genetic , Gene Expression Profiling , Gene Expression Regulation , Genome, Human/genetics , Genome-Wide Association Study , Genomics , Humans , Internet , Molecular Sequence Annotation , Organ Specificity/genetics , Polymorphism, Single Nucleotide , Promoter Regions, Genetic/genetics , Quantitative Trait Loci/genetics , RNA Stability , RNA, Messenger/genetics
9.
Article in English | MEDLINE | ID: mdl-28025337

ABSTRACT

Genomics consortia have produced large datasets profiling the expression of genes, micro-RNAs, enhancers and more across human tissues or cells. There is a need for intuitive tools to select subsets of such data that is the most relevant for specific studies. To this end, we present SlideBase, a web tool which offers a new way of selecting genes, promoters, enhancers and microRNAs that are preferentially expressed/used in a specified set of cells/tissues, based on the use of interactive sliders. With the help of sliders, SlideBase enables users to define custom expression thresholds for individual cell types/tissues, producing sets of genes, enhancers etc. which satisfy these constraints. Changes in slider settings result in simultaneous changes in the selected sets, updated in real time. SlideBase is linked to major databases from genomics consortia, including FANTOM, GTEx, The Human Protein Atlas and BioGPS.Database URL: http://slidebase.binf.ku.dk.


Subject(s)
Databases, Genetic , Enhancer Elements, Genetic , Genome, Human , MicroRNAs/genetics , Proteins/genetics , Sequence Analysis, DNA/methods , Human Genome Project , Humans
10.
Nucleic Acids Res ; 44(7): 3233-52, 2016 Apr 20.
Article in English | MEDLINE | ID: mdl-27001520

ABSTRACT

Functionality of the non-coding transcripts encoded by the human genome is the coveted goal of the modern genomics research. While commonly relied on the classical methods of forward genetics, integration of different genomics datasets in a global Systems Biology fashion presents a more productive avenue of achieving this very complex aim. Here we report application of a Systems Biology-based approach to dissect functionality of a newly identified vast class of very long intergenic non-coding (vlinc) RNAs. Using highly quantitative FANTOM5 CAGE dataset, we show that these RNAs could be grouped into 1542 novel human genes based on analysis of insulators that we show here indeed function as genomic barrier elements. We show that vlinc RNAs genes likely function in cisto activate nearby genes. This effect while most pronounced in closely spaced vlinc RNA-gene pairs can be detected over relatively large genomic distances. Furthermore, we identified 101 vlinc RNA genes likely involved in early embryogenesis based on patterns of their expression and regulation. We also found another 109 such genes potentially involved in cellular functions also happening at early stages of development such as proliferation, migration and apoptosis. Overall, we show that Systems Biology-based methods have great promise for functional annotation of non-coding RNAs.


Subject(s)
RNA, Long Noncoding/genetics , Cell Nucleus/genetics , Embryonic Development/genetics , Gene Expression Regulation , Humans , Insulator Elements , Molecular Sequence Annotation , Promoter Regions, Genetic , RNA, Long Noncoding/classification , RNA, Long Noncoding/metabolism , Retroviridae/genetics , Systems Biology , Terminal Repeat Sequences , Transcription Factors/metabolism
11.
PLoS One ; 11(2): e0146950, 2016.
Article in English | MEDLINE | ID: mdl-26863543

ABSTRACT

Analytical PCR experiments preferably use internal probes for monitoring the amplification reaction and specific detection of the amplicon. Such internal probes have to be designed in close context with the amplification primers, and may require additional considerations for the detection of genetic variations. Here we describe Edesign, a new online and stand-alone tool for designing sets of PCR primers together with an internal probe for conducting quantitative real-time PCR (qPCR) and genotypic experiments. Edesign can be used for selecting standard DNA oligonucleotides like for instance TaqMan probes, but has been further extended with new functions and enhanced design features for Eprobes. Eprobes, with their single thiazole orange-labelled nucleotide, allow for highly sensitive genotypic assays because of their higher DNA binding affinity as compared to standard DNA oligonucleotides. Using new thermodynamic parameters, Edesign considers unique features of Eprobes during primer and probe design for establishing qPCR experiments and genotyping by melting curve analysis. Additional functions in Edesign allow probe design for effective discrimination between wild-type sequences and genetic variations either using standard DNA oligonucleotides or Eprobes. Edesign can be freely accessed online at http://www.dnaform.com/edesign2/, and the source code is available for download.


Subject(s)
DNA Primers/genetics , Genotyping Techniques/methods , Real-Time Polymerase Chain Reaction/methods , Base Pair Mismatch , Oligonucleotide Probes/genetics , Taq Polymerase/metabolism
12.
Nucleic Acids Res ; 43(14): 6969-82, 2015 Aug 18.
Article in English | MEDLINE | ID: mdl-26117544

ABSTRACT

Classically or alternatively activated macrophages (M1 and M2, respectively) play distinct and important roles for microbiocidal activity, regulation of inflammation and tissue homeostasis. Despite this, their transcriptional regulatory dynamics are poorly understood. Using promoter-level expression profiling by non-biased deepCAGE we have studied the transcriptional dynamics of classically and alternatively activated macrophages. Transcription factor (TF) binding motif activity analysis revealed four motifs, NFKB1_REL_RELA, IRF1,2, IRF7 and TBP that are commonly activated but have distinct activity dynamics in M1 and M2 activation. We observe matching changes in the expression profiles of the corresponding TFs and show that only a restricted set of TFs change expression. There is an overall drastic and transient up-regulation in M1 and a weaker and more sustainable up-regulation in M2. Novel TFs, such as Thap6, Maff, (M1) and Hivep1, Nfil3, Prdm1, (M2) among others, were suggested to be involved in the activation processes. Additionally, 52 (M1) and 67 (M2) novel differentially expressed genes and, for the first time, several differentially expressed long non-coding RNA (lncRNA) transcriptome markers were identified. In conclusion, the finding of novel motifs, TFs and protein-coding and lncRNA genes is an important step forward to fully understand the transcriptional machinery of macrophage activation.


Subject(s)
Gene Expression Regulation , Macrophage Activation/genetics , Macrophages/metabolism , Transcriptome , Animals , Cells, Cultured , DNA/chemistry , Gene Expression Profiling , High-Throughput Nucleotide Sequencing , Interferon-gamma/pharmacology , Interleukin-13/pharmacology , Interleukin-4/pharmacology , Macrophages/drug effects , Male , Mice, Inbred BALB C , Nucleotide Motifs , Promoter Regions, Genetic , Sequence Analysis, DNA , Transcription Factors/metabolism
13.
J Immunol ; 194(12): 6035-44, 2015 Jun 15.
Article in English | MEDLINE | ID: mdl-25957166

ABSTRACT

Basic leucine zipper transcription factor Batf2 is poorly described, whereas Batf and Batf3 have been shown to play essential roles in dendritic cell, T cell, and B cell development and regulation. Batf2 was drastically induced in IFN-γ-activated classical macrophages (M1) compared with unstimulated or IL-4-activated alternative macrophages (M2). Batf2 knockdown experiments from IFN-γ-activated macrophages and subsequent expression profiling demonstrated important roles for regulation of immune responses, inducing inflammatory and host-protective genes Tnf, Ccl5, and Nos2. Mycobacterium tuberculosis (Beijing strain HN878)-infected macrophages further induced Batf2 and augmented host-protective Batf2-dependent genes, particularly in M1, whose mechanism was suggested to be mediated through both TLR2 and TLR4 by LPS and heat-killed HN878 (HKTB) stimulation experiments. Irf1 binding motif was enriched in the promoters of Batf2-regulated genes. Coimmunoprecipitation study demonstrated Batf2 association with Irf1. Furthermore, Irf1 knockdown showed downregulation of IFN-γ- or LPS/HKTB-activated host-protective genes Tnf, Ccl5, Il12b, and Nos2. Conclusively, Batf2 is an activation marker gene for M1 involved in gene regulation of IFN-γ-activated classical macrophages, as well as LPS/HKTB-induced macrophage stimulation, possibly by Batf2/Irf1 gene induction. Taken together, these results underline the role of Batf2/Irf1 in inducing inflammatory responses in M. tuberculosis infection.


Subject(s)
Basic-Leucine Zipper Transcription Factors/genetics , Interferon Regulatory Factor-1/genetics , Macrophages/immunology , Macrophages/metabolism , Mycobacterium Infections/genetics , Mycobacterium Infections/immunology , Mycobacterium/immunology , Animals , Basic-Leucine Zipper Transcription Factors/metabolism , Cluster Analysis , Disease Models, Animal , Gene Expression , Gene Expression Profiling , Gene Expression Regulation/drug effects , Gene Knockdown Techniques , Interferon Regulatory Factor-1/metabolism , Interferon-gamma/pharmacology , Lipopolysaccharides/immunology , Macrophage Activation/immunology , Male , Mice , Mycobacterium Infections/metabolism , Nitric Oxide Synthase Type II/genetics , Nitric Oxide Synthase Type II/metabolism , Protein Binding , Tumor Necrosis Factors/genetics , Tumor Necrosis Factors/metabolism
14.
J Allergy Clin Immunol ; 136(3): 638-48, 2015 Sep.
Article in English | MEDLINE | ID: mdl-25863981

ABSTRACT

BACKGROUND: Children with problematic severe asthma have poor disease control despite high doses of inhaled corticosteroids and additional therapy, leading to personal suffering, early deterioration of lung function, and significant consumption of health care resources. If no exacerbating factors, such as smoking or allergies, are found after extensive investigation, these children are given a diagnosis of therapy-resistant (or therapy-refractory) asthma (SA). OBJECTIVE: We sought to deepen our understanding of childhood SA by analyzing gene expression and modeling the underlying regulatory transcription factor networks in peripheral blood leukocytes. METHODS: Gene expression was analyzed by using Cap Analysis of Gene Expression in children with SA (n = 13), children with controlled persistent asthma (n = 15), and age-matched healthy control subjects (n = 9). Cap Analysis of Gene Expression sequencing detects the transcription start sites of known and novel mRNAs and noncoding RNAs. RESULTS: Sample groups could be separated by hierarchical clustering on 1305 differentially expressed transcription start sites, including 816 known genes and several novel transcripts. Ten of 13 tested novel transcripts were validated by means of RT-PCR and Sanger sequencing. Expression of RAR-related orphan receptor A (RORA), which has been linked to asthma in genome-wide association studies, was significantly upregulated in patients with SA. Gene network modeling revealed decreased glucocorticoid receptor signaling and increased activity of the mitogen-activated protein kinase and Jun kinase cascades in patients with SA. CONCLUSION: Circulating leukocytes from children with controlled asthma and those with SA have distinct gene expression profiles, demonstrating the possible development of specific molecular biomarkers and supporting the need for novel therapeutic approaches.


Subject(s)
Asthma/drug therapy , Asthma/genetics , Drug Resistance/genetics , Glucocorticoids/therapeutic use , RNA, Messenger/genetics , Transcriptome , Adolescent , Asthma/pathology , Case-Control Studies , Child , Child, Preschool , Female , Gene Expression Profiling , Genome-Wide Association Study , Humans , JNK Mitogen-Activated Protein Kinases/genetics , Male , Nuclear Receptor Subfamily 1, Group F, Member 1/genetics , Receptors, Glucocorticoid/genetics , Severity of Illness Index
15.
Proc Natl Acad Sci U S A ; 111(31): 11467-72, 2014 Aug 05.
Article in English | MEDLINE | ID: mdl-25049417

ABSTRACT

Next-generation sequencing experiments have shown that microRNAs (miRNAs) are expressed in many different isoforms (isomiRs), whose biological relevance is often unclear. We found that mature miR-21, the most widely researched miRNA because of its importance in human disease, is produced in two prevalent isomiR forms that differ by 1 nt at their 3' end, and moreover that the 3' end of miR-21 is posttranscriptionally adenylated by the noncanonical poly(A) polymerase PAPD5. PAPD5 knockdown caused an increase in the miR-21 expression level, suggesting that PAPD5-mediated adenylation of miR-21 leads to its degradation. Exoribonuclease knockdown experiments followed by small-RNA sequencing suggested that PARN degrades miR-21 in the 3'-to-5' direction. In accordance with this model, microarray expression profiling demonstrated that PAPD5 knockdown results in a down-regulation of miR-21 target mRNAs. We found that disruption of the miR-21 adenylation and degradation pathway is a general feature in tumors across a wide range of tissues, as evidenced by data from The Cancer Genome Atlas, as well as in the noncancerous proliferative disease psoriasis. We conclude that PAPD5 and PARN mediate degradation of oncogenic miRNA miR-21 through a tailing and trimming process, and that this pathway is disrupted in cancer and other proliferative diseases.


Subject(s)
Adenine/metabolism , MicroRNAs/metabolism , Neoplasms/genetics , RNA Nucleotidyltransferases/metabolism , RNA Stability , Base Sequence , Cytosine/metabolism , Exoribonucleases/metabolism , Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Gene Knockdown Techniques , High-Throughput Nucleotide Sequencing , Humans , MCF-7 Cells , MicroRNAs/chemistry , MicroRNAs/genetics , Models, Biological , Molecular Sequence Data , Neoplasms/pathology , Nucleic Acid Conformation , Protein Isoforms/chemistry , Protein Isoforms/genetics , Protein Isoforms/metabolism , Ribonuclease III/metabolism
16.
Nature ; 507(7493): 462-70, 2014 Mar 27.
Article in English | MEDLINE | ID: mdl-24670764

ABSTRACT

Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly 'housekeeping', whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.


Subject(s)
Atlases as Topic , Molecular Sequence Annotation , Promoter Regions, Genetic/genetics , Transcriptome/genetics , Animals , Cell Line , Cells, Cultured , Cluster Analysis , Conserved Sequence/genetics , Gene Expression Regulation/genetics , Gene Regulatory Networks/genetics , Genes, Essential/genetics , Genome/genetics , Humans , Mice , Open Reading Frames/genetics , Organ Specificity , RNA, Messenger/analysis , RNA, Messenger/genetics , Transcription Factors/metabolism , Transcription Initiation Site , Transcription, Genetic/genetics
17.
Biochemistry ; 51(31): 6056-67, 2012 Aug 07.
Article in English | MEDLINE | ID: mdl-22765348

ABSTRACT

Nucleic acid oligonucleotides are widely used in hybridization experiments for specific detection of complementary nucleic acid sequences. For design and application of oligonucleotides, an understanding of their thermodynamic properties is essential. Recently, exciton-controlled hybridization-sensitive fluorescent oligonucleotides (ECHOs) were developed as uniquely labeled DNA oligomers containing commonly one thymidine having two covalently linked thiazole orange dye moieties. The fluorescent signal of an ECHO is strictly hybridization-controlled, where the dye moieties have to intercalate into double-stranded DNA for signal generation. Here we analyzed the hybridization thermodynamics of ECHO/DNA duplexes, and thermodynamic parameters were obtained from melting curves of 64 ECHO/DNA duplexes measured by ultraviolet absorbance and fluorescence. Both methods demonstrated a substantial increase in duplex stability (ΔΔG°(37) ~ -2.6 ± 0.7 kcal mol(-1)) compared to that of DNA/DNA duplexes of the same sequence. With the exception of T·G mismatches, this increased stability was mostly unaffected by other mismatches in the position opposite the labeled nucleotide. A nearest neighbor model was constructed for predicting thermodynamic parameters for duplex stability. Evaluation of the nearest neighbor parameters by cross validation tests showed higher predictive reliability for the fluorescence-based than the absorbance-based parameters. Using our experimental data, a tool for predicting the thermodynamics of formation of ECHO/DNA duplexes was developed that is freely available at http://genome.gsc.riken.jp/echo/thermodynamics/. It provides reliable thermodynamic data for using the unique features of ECHOs in fluorescence-based experiments.


Subject(s)
Benzothiazoles/chemistry , DNA/chemistry , Quinolines/chemistry , Thymidine/chemistry , Base Pair Mismatch , Base Sequence , DNA/genetics , Drug Design , Fluorescent Dyes/chemistry , Fluorescent Dyes/metabolism , Models, Molecular , Nucleic Acid Conformation , Nucleic Acid Denaturation , Nucleic Acid Hybridization , Oligodeoxyribonucleotides/chemistry , Oligodeoxyribonucleotides/genetics , Thermodynamics , Transition Temperature
18.
Nucleic Acids Res ; 39(9): e59, 2011 May.
Article in English | MEDLINE | ID: mdl-21310714

ABSTRACT

The application of isothermal amplification technologies is rapidly expanding and currently covers different areas such as infectious disease, genetic disorder and drug dosage adjustment. Meanwhile, many of such technologies have complex reaction processes and often require a fine-tuned primer set where existing primer design tools are not sufficient. We have developed a primer selection system for one important primer, the turn-back primer (TP), which is commonly used in loop-mediated amplification (LAMP) and smart amplification process (SmartAmp). We chose 78 parameters related to the primer and target sequence, and explored their relationship to amplification speed using experimental data for 1344 primer combinations. We employed the least absolute shrinkage and selection operator (LASSO) method for parameter selection and estimation of their numerical coefficients. We subsequently evaluated our prediction model using additional independent experiments and compared to the LAMP primer design tool, Primer Explorer version4 (PE4). The evaluation showed that our approach yields a superior primer design in isothermal amplification and is robust against variations in the experimental setup. Our LASSO regression analysis revealed that availability of the 3'- and 5'-end of the primer are particularly important factors for efficient isothermal amplification. Our computer script is freely available at: http://gerg.gsc.riken.jp/TP_optimization/.


Subject(s)
DNA Primers/chemistry , Nucleic Acid Amplification Techniques , Humans , Software , Temperature
19.
Curr Biol ; 20(17): R735-45, 2010 Sep 14.
Article in English | MEDLINE | ID: mdl-20833318

ABSTRACT

Genome sequencing of multiple species makes it possible to understand the main principles behind the evolution of developmental regulatory networks. It is especially interesting to analyze the evolution of well-defined model systems in which conservation patterns can be directly correlated with the functional roles of various network components. Endospore formation (sporulation), extensively studied in Bacillus subtilis, is driven by such a model bacterial network of cellular development and differentiation. In this review, we analyze the evolution of the sporulation network in multiple endospore-forming bacteria. Importantly, the network evolution is not random but primarily follows the hierarchical organization and functional logic of the sporulation process. Specifically, the sporulation sigma factors and the master regulator of sporulation, Spo0A, are conserved in all considered spore-formers. The sequential activation of these global regulators is also strongly conserved. The feed-forward loops, which are likely used to fine-tune waves of gene expression within regulatory modules, show an intermediate level of conservation. These loops are less conserved than the sigma factors but significantly more than the structural sporulation genes, which form the lowest level in the functional and evolutionary hierarchy of the sporulation network. Interestingly, in spore-forming bacteria, gene regulation is more conserved than gene presence for sporulation genes, while the opposite is true for non-sporulation genes. The observed patterns suggest that, by understanding the functional organization of a developmental network in a model organism, it is possible to understand the logic behind the evolution of this network in multiple related species.


Subject(s)
Bacillus subtilis/physiology , Evolution, Molecular , Spores, Bacterial , Bacillus subtilis/classification , Bacillus subtilis/genetics , DNA Replication , Gene Regulatory Networks , Phylogeny , Transcription, Genetic
20.
Genome Res ; 20(10): 1398-410, 2010 Oct.
Article in English | MEDLINE | ID: mdl-20719920

ABSTRACT

Animal microRNA sequences are subject to 3' nucleotide addition. Through detailed analysis of deep-sequenced short RNA data sets, we show adenylation and uridylation of miRNA is globally present and conserved across Drosophila and vertebrates. To better understand 3' adenylation function, we deep-sequenced RNA after knockdown of nucleotidyltransferase enzymes. The PAPD4 nucleotidyltransferase adenylates a wide range of miRNA loci, but adenylation does not appear to affect miRNA stability on a genome-wide scale. Adenine addition appears to reduce effectiveness of miRNA targeting of mRNA transcripts while deep-sequencing of RNA bound to immunoprecipitated Argonaute (AGO) subfamily proteins EIF2C1-EIF2C3 revealed substantial reduction of adenine addition in miRNA associated with EIF2C2 and EIF2C3. Our findings show 3' addition events are widespread and conserved across animals, PAPD4 is a primary miRNA adenylating enzyme, and suggest a role for 3' adenine addition in modulating miRNA effectiveness, possibly through interfering with incorporation into the RNA-induced silencing complex (RISC), a regulatory role that would complement the role of miRNA uridylation in blocking DICER1 uptake.


Subject(s)
3' Untranslated Regions/genetics , Adenine/metabolism , MicroRNAs/metabolism , Nucleotidyltransferases/metabolism , Animals , Argonaute Proteins , Cell Line , Eukaryotic Initiation Factor-2/metabolism , Eukaryotic Initiation Factors/metabolism , Humans , Mice , MicroRNAs/chemistry , MicroRNAs/genetics , Monocytes , Nucleotidyltransferases/genetics , Polynucleotide Adenylyltransferase , RNA Stability , mRNA Cleavage and Polyadenylation Factors
SELECTION OF CITATIONS
SEARCH DETAIL