Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
1.
Genome Res ; 30(7): 1073-1081, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32079618

RESUMO

Long noncoding RNAs (lncRNAs) have emerged as key coordinators of biological and cellular processes. Characterizing lncRNA expression across cells and tissues is key to understanding their role in determining phenotypes, including human diseases. We present here FC-R2, a comprehensive expression atlas across a broadly defined human transcriptome, inclusive of over 109,000 coding and noncoding genes, as described in the FANTOM CAGE-Associated Transcriptome (FANTOM-CAT) study. This atlas greatly extends the gene annotation used in the original recount2 resource. We demonstrate the utility of the FC-R2 atlas by reproducing key findings from published large studies and by generating new results across normal and diseased human samples. In particular, we (a) identify tissue-specific transcription profiles for distinct classes of coding and noncoding genes, (b) perform differential expression analysis across thirteen cancer types, identifying novel noncoding genes potentially involved in tumor pathogenesis and progression, and (c) confirm the prognostic value for several enhancer lncRNAs expression in cancer. Our resource is instrumental for the systematic molecular characterization of lncRNA by the FANTOM6 Consortium. In conclusion, comprised of over 70,000 samples, the FC-R2 atlas will empower other researchers to investigate functions and biological roles of both known coding genes and novel lncRNAs.


Assuntos
Transcriptoma , Bases de Dados Genéticas , Elementos Facilitadores Genéticos , Perfilação da Expressão Gênica , Genoma Humano , Humanos , Neoplasias/genética , Especificidade de Órgãos , Prognóstico , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , RNA Mensageiro/metabolismo
2.
Genome Res ; 30(7): 951-961, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32718981

RESUMO

Gene expression profiles in homologous tissues have been observed to be different between species, which may be due to differences between species in the gene expression program in each cell type, but may also reflect differences in cell type composition of each tissue in different species. Here, we compare expression profiles in matching primary cells in human, mouse, rat, dog, and chicken using Cap Analysis Gene Expression (CAGE) and short RNA (sRNA) sequencing data from FANTOM5. While we find that expression profiles of orthologous genes in different species are highly correlated across cell types, in each cell type many genes were differentially expressed between species. Expression of genes with products involved in transcription, RNA processing, and transcriptional regulation was more likely to be conserved, while expression of genes encoding proteins involved in intercellular communication was more likely to have diverged during evolution. Conservation of expression correlated positively with the evolutionary age of genes, suggesting that divergence in expression levels of genes critical for cell function was restricted during evolution. Motif activity analysis showed that both promoters and enhancers are activated by the same transcription factors in different species. An analysis of expression levels of mature miRNAs and of primary miRNAs identified by CAGE revealed that evolutionary old miRNAs are more likely to have conserved expression patterns than young miRNAs. We conclude that key aspects of the regulatory network are conserved, while differential expression of genes involved in cell-to-cell communication may contribute greatly to phenotypic differences between species.


Assuntos
Evolução Molecular , Transcriptoma , Animais , Galinhas/genética , Cães , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Humanos , Camundongos , MicroRNAs/metabolismo , Motivos de Nucleotídeos , Análise de Componente Principal , Regiões Promotoras Genéticas , Ratos , Especificidade da Espécie , Fatores de Transcrição/metabolismo
3.
Nature ; 543(7644): 199-204, 2017 03 09.
Artigo em Inglês | MEDLINE | ID: mdl-28241135

RESUMO

Long non-coding RNAs (lncRNAs) are largely heterogeneous and functionally uncharacterized. Here, using FANTOM5 cap analysis of gene expression (CAGE) data, we integrate multiple transcript collections to generate a comprehensive atlas of 27,919 human lncRNA genes with high-confidence 5' ends and expression profiles across 1,829 samples from the major human primary cell types and tissues. Genomic and epigenomic classification of these lncRNAs reveals that most intergenic lncRNAs originate from enhancers rather than from promoters. Incorporating genetic and expression data, we show that lncRNAs overlapping trait-associated single nucleotide polymorphisms are specifically expressed in cell types relevant to the traits, implicating these lncRNAs in multiple diseases. We further demonstrate that lncRNAs overlapping expression quantitative trait loci (eQTL)-associated single nucleotide polymorphisms of messenger RNAs are co-expressed with the corresponding messenger RNAs, suggesting their potential roles in transcriptional regulation. Combining these findings with conservation data, we identify 19,175 potentially functional lncRNAs in the human genome.


Assuntos
Bases de Dados Genéticas , RNA Longo não Codificante/química , RNA Longo não Codificante/genética , Transcriptoma/genética , Células Cultivadas , Sequência Conservada/genética , Conjuntos de Dados como Assunto , Elementos Facilitadores Genéticos/genética , Epigênese Genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Genoma Humano/genética , Estudo de Associação Genômica Ampla , Genômica , Humanos , Internet , Anotação de Sequência Molecular , Especificidade de Órgãos/genética , Polimorfismo de Nucleotídeo Único , Regiões Promotoras Genéticas/genética , Locos de Características Quantitativas/genética , Estabilidade de RNA , RNA Mensageiro/genética
4.
Nature ; 507(7493): 462-70, 2014 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-24670764

RESUMO

Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly 'housekeeping', whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.


Assuntos
Atlas como Assunto , Anotação de Sequência Molecular , Regiões Promotoras Genéticas/genética , Transcriptoma/genética , Animais , Linhagem Celular , Células Cultivadas , Análise por Conglomerados , Sequência Conservada/genética , Regulação da Expressão Gênica/genética , Redes Reguladoras de Genes/genética , Genes Essenciais/genética , Genoma/genética , Humanos , Camundongos , Fases de Leitura Aberta/genética , Especificidade de Órgãos , RNA Mensageiro/análise , RNA Mensageiro/genética , Fatores de Transcrição/metabolismo , Sítio de Iniciação de Transcrição , Transcrição Gênica/genética
5.
Nucleic Acids Res ; 44(7): 3233-52, 2016 Apr 20.
Artigo em Inglês | MEDLINE | ID: mdl-27001520

RESUMO

Functionality of the non-coding transcripts encoded by the human genome is the coveted goal of the modern genomics research. While commonly relied on the classical methods of forward genetics, integration of different genomics datasets in a global Systems Biology fashion presents a more productive avenue of achieving this very complex aim. Here we report application of a Systems Biology-based approach to dissect functionality of a newly identified vast class of very long intergenic non-coding (vlinc) RNAs. Using highly quantitative FANTOM5 CAGE dataset, we show that these RNAs could be grouped into 1542 novel human genes based on analysis of insulators that we show here indeed function as genomic barrier elements. We show that vlinc RNAs genes likely function in cisto activate nearby genes. This effect while most pronounced in closely spaced vlinc RNA-gene pairs can be detected over relatively large genomic distances. Furthermore, we identified 101 vlinc RNA genes likely involved in early embryogenesis based on patterns of their expression and regulation. We also found another 109 such genes potentially involved in cellular functions also happening at early stages of development such as proliferation, migration and apoptosis. Overall, we show that Systems Biology-based methods have great promise for functional annotation of non-coding RNAs.


Assuntos
RNA Longo não Codificante/genética , Núcleo Celular/genética , Desenvolvimento Embrionário/genética , Regulação da Expressão Gênica , Humanos , Elementos Isolantes , Anotação de Sequência Molecular , Regiões Promotoras Genéticas , RNA Longo não Codificante/classificação , RNA Longo não Codificante/metabolismo , Retroviridae/genética , Biologia de Sistemas , Sequências Repetidas Terminais , Fatores de Transcrição/metabolismo
6.
J Immunol ; 194(12): 6035-44, 2015 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-25957166

RESUMO

Basic leucine zipper transcription factor Batf2 is poorly described, whereas Batf and Batf3 have been shown to play essential roles in dendritic cell, T cell, and B cell development and regulation. Batf2 was drastically induced in IFN-γ-activated classical macrophages (M1) compared with unstimulated or IL-4-activated alternative macrophages (M2). Batf2 knockdown experiments from IFN-γ-activated macrophages and subsequent expression profiling demonstrated important roles for regulation of immune responses, inducing inflammatory and host-protective genes Tnf, Ccl5, and Nos2. Mycobacterium tuberculosis (Beijing strain HN878)-infected macrophages further induced Batf2 and augmented host-protective Batf2-dependent genes, particularly in M1, whose mechanism was suggested to be mediated through both TLR2 and TLR4 by LPS and heat-killed HN878 (HKTB) stimulation experiments. Irf1 binding motif was enriched in the promoters of Batf2-regulated genes. Coimmunoprecipitation study demonstrated Batf2 association with Irf1. Furthermore, Irf1 knockdown showed downregulation of IFN-γ- or LPS/HKTB-activated host-protective genes Tnf, Ccl5, Il12b, and Nos2. Conclusively, Batf2 is an activation marker gene for M1 involved in gene regulation of IFN-γ-activated classical macrophages, as well as LPS/HKTB-induced macrophage stimulation, possibly by Batf2/Irf1 gene induction. Taken together, these results underline the role of Batf2/Irf1 in inducing inflammatory responses in M. tuberculosis infection.


Assuntos
Fatores de Transcrição de Zíper de Leucina Básica/genética , Fator Regulador 1 de Interferon/genética , Macrófagos/imunologia , Macrófagos/metabolismo , Infecções por Mycobacterium/genética , Infecções por Mycobacterium/imunologia , Mycobacterium/imunologia , Animais , Fatores de Transcrição de Zíper de Leucina Básica/metabolismo , Análise por Conglomerados , Modelos Animais de Doenças , Expressão Gênica , Perfilação da Expressão Gênica , Regulação da Expressão Gênica/efeitos dos fármacos , Técnicas de Silenciamento de Genes , Fator Regulador 1 de Interferon/metabolismo , Interferon gama/farmacologia , Lipopolissacarídeos/imunologia , Ativação de Macrófagos/imunologia , Masculino , Camundongos , Infecções por Mycobacterium/metabolismo , Óxido Nítrico Sintase Tipo II/genética , Óxido Nítrico Sintase Tipo II/metabolismo , Ligação Proteica , Fatores de Necrose Tumoral/genética , Fatores de Necrose Tumoral/metabolismo
7.
Nucleic Acids Res ; 43(14): 6969-82, 2015 Aug 18.
Artigo em Inglês | MEDLINE | ID: mdl-26117544

RESUMO

Classically or alternatively activated macrophages (M1 and M2, respectively) play distinct and important roles for microbiocidal activity, regulation of inflammation and tissue homeostasis. Despite this, their transcriptional regulatory dynamics are poorly understood. Using promoter-level expression profiling by non-biased deepCAGE we have studied the transcriptional dynamics of classically and alternatively activated macrophages. Transcription factor (TF) binding motif activity analysis revealed four motifs, NFKB1_REL_RELA, IRF1,2, IRF7 and TBP that are commonly activated but have distinct activity dynamics in M1 and M2 activation. We observe matching changes in the expression profiles of the corresponding TFs and show that only a restricted set of TFs change expression. There is an overall drastic and transient up-regulation in M1 and a weaker and more sustainable up-regulation in M2. Novel TFs, such as Thap6, Maff, (M1) and Hivep1, Nfil3, Prdm1, (M2) among others, were suggested to be involved in the activation processes. Additionally, 52 (M1) and 67 (M2) novel differentially expressed genes and, for the first time, several differentially expressed long non-coding RNA (lncRNA) transcriptome markers were identified. In conclusion, the finding of novel motifs, TFs and protein-coding and lncRNA genes is an important step forward to fully understand the transcriptional machinery of macrophage activation.


Assuntos
Regulação da Expressão Gênica , Ativação de Macrófagos/genética , Macrófagos/metabolismo , Transcriptoma , Animais , Células Cultivadas , DNA/química , Perfilação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Interferon gama/farmacologia , Interleucina-13/farmacologia , Interleucina-4/farmacologia , Macrófagos/efeitos dos fármacos , Masculino , Camundongos Endogâmicos BALB C , Motivos de Nucleotídeos , Regiões Promotoras Genéticas , Análise de Sequência de DNA , Fatores de Transcrição/metabolismo
8.
Proc Natl Acad Sci U S A ; 111(31): 11467-72, 2014 Aug 05.
Artigo em Inglês | MEDLINE | ID: mdl-25049417

RESUMO

Next-generation sequencing experiments have shown that microRNAs (miRNAs) are expressed in many different isoforms (isomiRs), whose biological relevance is often unclear. We found that mature miR-21, the most widely researched miRNA because of its importance in human disease, is produced in two prevalent isomiR forms that differ by 1 nt at their 3' end, and moreover that the 3' end of miR-21 is posttranscriptionally adenylated by the noncanonical poly(A) polymerase PAPD5. PAPD5 knockdown caused an increase in the miR-21 expression level, suggesting that PAPD5-mediated adenylation of miR-21 leads to its degradation. Exoribonuclease knockdown experiments followed by small-RNA sequencing suggested that PARN degrades miR-21 in the 3'-to-5' direction. In accordance with this model, microarray expression profiling demonstrated that PAPD5 knockdown results in a down-regulation of miR-21 target mRNAs. We found that disruption of the miR-21 adenylation and degradation pathway is a general feature in tumors across a wide range of tissues, as evidenced by data from The Cancer Genome Atlas, as well as in the noncancerous proliferative disease psoriasis. We conclude that PAPD5 and PARN mediate degradation of oncogenic miRNA miR-21 through a tailing and trimming process, and that this pathway is disrupted in cancer and other proliferative diseases.


Assuntos
Adenina/metabolismo , MicroRNAs/metabolismo , Neoplasias/genética , RNA Nucleotidiltransferases/metabolismo , Estabilidade de RNA , Sequência de Bases , Citosina/metabolismo , Exorribonucleases/metabolismo , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Técnicas de Silenciamento de Genes , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Células MCF-7 , MicroRNAs/química , MicroRNAs/genética , Modelos Biológicos , Dados de Sequência Molecular , Neoplasias/patologia , Conformação de Ácido Nucleico , Isoformas de Proteínas/química , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Ribonuclease III/metabolismo
9.
J Allergy Clin Immunol ; 136(3): 638-48, 2015 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-25863981

RESUMO

BACKGROUND: Children with problematic severe asthma have poor disease control despite high doses of inhaled corticosteroids and additional therapy, leading to personal suffering, early deterioration of lung function, and significant consumption of health care resources. If no exacerbating factors, such as smoking or allergies, are found after extensive investigation, these children are given a diagnosis of therapy-resistant (or therapy-refractory) asthma (SA). OBJECTIVE: We sought to deepen our understanding of childhood SA by analyzing gene expression and modeling the underlying regulatory transcription factor networks in peripheral blood leukocytes. METHODS: Gene expression was analyzed by using Cap Analysis of Gene Expression in children with SA (n = 13), children with controlled persistent asthma (n = 15), and age-matched healthy control subjects (n = 9). Cap Analysis of Gene Expression sequencing detects the transcription start sites of known and novel mRNAs and noncoding RNAs. RESULTS: Sample groups could be separated by hierarchical clustering on 1305 differentially expressed transcription start sites, including 816 known genes and several novel transcripts. Ten of 13 tested novel transcripts were validated by means of RT-PCR and Sanger sequencing. Expression of RAR-related orphan receptor A (RORA), which has been linked to asthma in genome-wide association studies, was significantly upregulated in patients with SA. Gene network modeling revealed decreased glucocorticoid receptor signaling and increased activity of the mitogen-activated protein kinase and Jun kinase cascades in patients with SA. CONCLUSION: Circulating leukocytes from children with controlled asthma and those with SA have distinct gene expression profiles, demonstrating the possible development of specific molecular biomarkers and supporting the need for novel therapeutic approaches.


Assuntos
Asma/tratamento farmacológico , Asma/genética , Resistência a Medicamentos/genética , Glucocorticoides/uso terapêutico , RNA Mensageiro/genética , Transcriptoma , Adolescente , Asma/patologia , Estudos de Casos e Controles , Criança , Pré-Escolar , Feminino , Perfilação da Expressão Gênica , Estudo de Associação Genômica Ampla , Humanos , Proteínas Quinases JNK Ativadas por Mitógeno/genética , Masculino , Membro 1 do Grupo F da Subfamília 1 de Receptores Nucleares/genética , Receptores de Glucocorticoides/genética , Índice de Gravidade de Doença
10.
PLoS One ; 19(5): e0295971, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38709794

RESUMO

The human genome is pervasively transcribed and produces a wide variety of long non-coding RNAs (lncRNAs), constituting the majority of transcripts across human cell types. Some specific nuclear lncRNAs have been shown to be important regulatory components acting locally. As RNA-chromatin interaction and Hi-C chromatin conformation data showed that chromatin interactions of nuclear lncRNAs are determined by the local chromatin 3D conformation, we used Hi-C data to identify potential target genes of lncRNAs. RNA-protein interaction data suggested that nuclear lncRNAs act as scaffolds to recruit regulatory proteins to target promoters and enhancers. Nuclear lncRNAs may therefore play a role in directing regulatory factors to locations spatially close to the lncRNA gene. We provide the analysis results through an interactive visualization web portal at https://fantom.gsc.riken.jp/zenbu/reports/#F6_3D_lncRNA.


Assuntos
Cromatina , RNA Longo não Codificante , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , Cromatina/metabolismo , Cromatina/genética , Humanos , Anotação de Sequência Molecular , Núcleo Celular/metabolismo , Núcleo Celular/genética , Genoma Humano , Regiões Promotoras Genéticas
11.
Genome Res ; 20(2): 257-64, 2010 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-20051556

RESUMO

MicroRNAs (miRNAs) are short (20-23 nt) RNAs that are sequence-specific mediators of transcriptional and post-transcriptional regulation of gene expression. Modern high-throughput technologies enable deep sequencing of such RNA species on an unprecedented scale. We find that the analysis of small RNA deep-sequencing libraries can be affected by cross-mapping, in which RNA sequences originating from one locus are inadvertently mapped to another. Similar to cross-hybridization on microarrays, cross-mapping is prevalent among miRNAs, as they tend to occur in families, are similar or derived from repeat or structural RNAs, or are post-transcriptionally modified. Here, we develop a strategy to correct for cross-mapping, and apply it to the analysis of RNA editing in mature miRNAs. In contrast to previous reports, our analysis suggests that RNA editing in mature miRNAs is rare in animals.


Assuntos
Biblioteca Gênica , MicroRNAs/genética , Edição de RNA/genética , Alinhamento de Sequência/métodos , Análise de Sequência de RNA/métodos , Animais , Sequência de Bases , Ensaios de Triagem em Larga Escala , Humanos , Camundongos , MicroRNAs/metabolismo
12.
Genome Res ; 20(10): 1398-410, 2010 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-20719920

RESUMO

Animal microRNA sequences are subject to 3' nucleotide addition. Through detailed analysis of deep-sequenced short RNA data sets, we show adenylation and uridylation of miRNA is globally present and conserved across Drosophila and vertebrates. To better understand 3' adenylation function, we deep-sequenced RNA after knockdown of nucleotidyltransferase enzymes. The PAPD4 nucleotidyltransferase adenylates a wide range of miRNA loci, but adenylation does not appear to affect miRNA stability on a genome-wide scale. Adenine addition appears to reduce effectiveness of miRNA targeting of mRNA transcripts while deep-sequencing of RNA bound to immunoprecipitated Argonaute (AGO) subfamily proteins EIF2C1-EIF2C3 revealed substantial reduction of adenine addition in miRNA associated with EIF2C2 and EIF2C3. Our findings show 3' addition events are widespread and conserved across animals, PAPD4 is a primary miRNA adenylating enzyme, and suggest a role for 3' adenine addition in modulating miRNA effectiveness, possibly through interfering with incorporation into the RNA-induced silencing complex (RISC), a regulatory role that would complement the role of miRNA uridylation in blocking DICER1 uptake.


Assuntos
Regiões 3' não Traduzidas/genética , Adenina/metabolismo , MicroRNAs/metabolismo , Nucleotidiltransferases/metabolismo , Animais , Proteínas Argonautas , Linhagem Celular , Fator de Iniciação 2 em Eucariotos/metabolismo , Fatores de Iniciação em Eucariotos/metabolismo , Humanos , Camundongos , MicroRNAs/química , MicroRNAs/genética , Monócitos , Nucleotidiltransferases/genética , Polinucleotídeo Adenililtransferase , Estabilidade de RNA , Fatores de Poliadenilação e Clivagem de mRNA
13.
Nucleic Acids Res ; 39(9): e59, 2011 May.
Artigo em Inglês | MEDLINE | ID: mdl-21310714

RESUMO

The application of isothermal amplification technologies is rapidly expanding and currently covers different areas such as infectious disease, genetic disorder and drug dosage adjustment. Meanwhile, many of such technologies have complex reaction processes and often require a fine-tuned primer set where existing primer design tools are not sufficient. We have developed a primer selection system for one important primer, the turn-back primer (TP), which is commonly used in loop-mediated amplification (LAMP) and smart amplification process (SmartAmp). We chose 78 parameters related to the primer and target sequence, and explored their relationship to amplification speed using experimental data for 1344 primer combinations. We employed the least absolute shrinkage and selection operator (LASSO) method for parameter selection and estimation of their numerical coefficients. We subsequently evaluated our prediction model using additional independent experiments and compared to the LAMP primer design tool, Primer Explorer version4 (PE4). The evaluation showed that our approach yields a superior primer design in isothermal amplification and is robust against variations in the experimental setup. Our LASSO regression analysis revealed that availability of the 3'- and 5'-end of the primer are particularly important factors for efficient isothermal amplification. Our computer script is freely available at: http://gerg.gsc.riken.jp/TP_optimization/.


Assuntos
Primers do DNA/química , Técnicas de Amplificação de Ácido Nucleico , Humanos , Software , Temperatura
14.
Biochemistry ; 51(31): 6056-67, 2012 Aug 07.
Artigo em Inglês | MEDLINE | ID: mdl-22765348

RESUMO

Nucleic acid oligonucleotides are widely used in hybridization experiments for specific detection of complementary nucleic acid sequences. For design and application of oligonucleotides, an understanding of their thermodynamic properties is essential. Recently, exciton-controlled hybridization-sensitive fluorescent oligonucleotides (ECHOs) were developed as uniquely labeled DNA oligomers containing commonly one thymidine having two covalently linked thiazole orange dye moieties. The fluorescent signal of an ECHO is strictly hybridization-controlled, where the dye moieties have to intercalate into double-stranded DNA for signal generation. Here we analyzed the hybridization thermodynamics of ECHO/DNA duplexes, and thermodynamic parameters were obtained from melting curves of 64 ECHO/DNA duplexes measured by ultraviolet absorbance and fluorescence. Both methods demonstrated a substantial increase in duplex stability (ΔΔG°(37) ~ -2.6 ± 0.7 kcal mol(-1)) compared to that of DNA/DNA duplexes of the same sequence. With the exception of T·G mismatches, this increased stability was mostly unaffected by other mismatches in the position opposite the labeled nucleotide. A nearest neighbor model was constructed for predicting thermodynamic parameters for duplex stability. Evaluation of the nearest neighbor parameters by cross validation tests showed higher predictive reliability for the fluorescence-based than the absorbance-based parameters. Using our experimental data, a tool for predicting the thermodynamics of formation of ECHO/DNA duplexes was developed that is freely available at http://genome.gsc.riken.jp/echo/thermodynamics/. It provides reliable thermodynamic data for using the unique features of ECHOs in fluorescence-based experiments.


Assuntos
Benzotiazóis/química , DNA/química , Quinolinas/química , Timidina/química , Pareamento Incorreto de Bases , Sequência de Bases , DNA/genética , Desenho de Fármacos , Corantes Fluorescentes/química , Corantes Fluorescentes/metabolismo , Modelos Moleculares , Conformação de Ácido Nucleico , Desnaturação de Ácido Nucleico , Hibridização de Ácido Nucleico , Oligodesoxirribonucleotídeos/química , Oligodesoxirribonucleotídeos/genética , Termodinâmica , Temperatura de Transição
15.
BMC Genom Data ; 22(1): 33, 2021 09 14.
Artigo em Inglês | MEDLINE | ID: mdl-34521352

RESUMO

BACKGROUND: The lymphatic and the blood vasculature are closely related systems that collaborate to ensure the organism's physiological function. Despite their common developmental origin, they present distinct functional fates in adulthood that rely on robust lineage-specific regulatory programs. The recent technological boost in sequencing approaches unveiled long noncoding RNAs (lncRNAs) as prominent regulatory players of various gene expression levels in a cell-type-specific manner. RESULTS: To investigate the potential roles of lncRNAs in vascular biology, we performed antisense oligonucleotide (ASO) knockdowns of lncRNA candidates specifically expressed either in human lymphatic or blood vascular endothelial cells (LECs or BECs) followed by Cap Analysis of Gene Expression (CAGE-Seq). Here, we describe the quality control steps adopted in our analysis pipeline before determining the knockdown effects of three ASOs per lncRNA target on the LEC or BEC transcriptomes. In this regard, we especially observed that the choice of negative control ASOs can dramatically impact the conclusions drawn from the analysis depending on the cellular background. CONCLUSION: In conclusion, the comparison of negative control ASO effects on the targeted cell type transcriptomes highlights the essential need to select a proper control set of multiple negative control ASO based on the investigated cell types.


Assuntos
Técnicas de Silenciamento de Genes/métodos , Oligonucleotídeos Antissenso/genética , Especificidade de Órgãos/genética , RNA Longo não Codificante/genética , Adulto , Células Endoteliais/metabolismo , Técnicas de Silenciamento de Genes/normas , Humanos , Sistema Linfático/citologia , Sistema Linfático/metabolismo , Oligonucleotídeos Antissenso/normas , Transcriptoma
16.
Nat Commun ; 12(1): 925, 2021 02 10.
Artigo em Inglês | MEDLINE | ID: mdl-33568674

RESUMO

Recent studies have revealed the importance of long noncoding RNAs (lncRNAs) as tissue-specific regulators of gene expression. There is ample evidence that distinct types of vasculature undergo tight transcriptional control to preserve their structure, identity, and functions. We determine a comprehensive map of lineage-specific lncRNAs in human dermal lymphatic and blood vascular endothelial cells (LECs and BECs), combining RNA-Seq and CAGE-Seq. Subsequent antisense oligonucleotide-knockdown transcriptomic profiling of two LEC- and two BEC-specific lncRNAs identifies LETR1 as a critical gatekeeper of the global LEC transcriptome. Deep RNA-DNA, RNA-protein interaction studies, and phenotype rescue analyses reveal that LETR1 is a nuclear trans-acting lncRNA modulating, via key epigenetic factors, the expression of essential target genes, including KLF4 and SEMA3C, governing the growth and migratory ability of LECs. Together, our study provides several lines of evidence supporting the intriguing concept that every cell type expresses precise lncRNA signatures to control lineage-specific regulatory programs.


Assuntos
Células Endoteliais/citologia , Fatores de Transcrição Kruppel-Like/metabolismo , Semaforinas/metabolismo , Movimento Celular , Proliferação de Células , Células Endoteliais/metabolismo , Regulação da Expressão Gênica , Humanos , Fator 4 Semelhante a Kruppel , Fatores de Transcrição Kruppel-Like/genética , RNA Longo não Codificante , Semaforinas/genética
17.
Nat Commun ; 12(1): 3297, 2021 06 02.
Artigo em Inglês | MEDLINE | ID: mdl-34078885

RESUMO

Using the Cap Analysis of Gene Expression (CAGE) technology, the FANTOM5 consortium provided one of the most comprehensive maps of transcription start sites (TSSs) in several species. Strikingly, ~72% of them could not be assigned to a specific gene and initiate at unconventional regions, outside promoters or enhancers. Here, we probe these unassigned TSSs and show that, in all species studied, a significant fraction of CAGE peaks initiate at microsatellites, also called short tandem repeats (STRs). To confirm this transcription, we develop Cap Trap RNA-seq, a technology which combines cap trapping and long read MinION sequencing. We train sequence-based deep learning models able to predict CAGE signal at STRs with high accuracy. These models unveil the importance of STR surrounding sequences not only to distinguish STR classes, but also to predict the level of transcription initiation. Importantly, genetic variants linked to human diseases are preferentially found at STRs with high transcription initiation level, supporting the biological and clinical relevance of transcription initiation at STRs. Together, our results extend the repertoire of non-coding transcription associated with DNA tandem repeats and complexify STR polymorphism.


Assuntos
Repetições de Microssatélites , Redes Neurais de Computação , Doenças Neurodegenerativas/genética , Sítio de Iniciação de Transcrição , Iniciação da Transcrição Genética , Células A549 , Animais , Sequência de Bases , Biologia Computacional/métodos , Aprendizado Profundo , Elementos Facilitadores Genéticos , Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Camundongos , Doenças Neurodegenerativas/diagnóstico , Doenças Neurodegenerativas/metabolismo , Polimorfismo Genético , Regiões Promotoras Genéticas
18.
Bioinformatics ; 25(19): 2613-4, 2009 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-19605420

RESUMO

UNLABELLED: Multi-mapping sequence tags are a significant impediment to short-read sequencing platforms. These tags are routinely omitted from further analysis, leading to experimental bias and reduced coverage. Here, we present MuMRescueLite, a low-resource requirement version of the MuMRescue software that has been used by several next generation sequencing projects to probabilistically reincorporate multi-mapping tags into mapped short read data. AVAILABILITY AND IMPLEMENTATION: MuMRescueLite is written in Python; executables and documentation are available from http://genome.gsc.riken.jp/osc/english/software/.


Assuntos
Biologia Computacional/métodos , Análise de Sequência de DNA/métodos , Software
19.
Bioinformatics ; 25(11): 1422-3, 2009 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-19304878

RESUMO

SUMMARY: The Biopython project is a mature open source international collaboration of volunteer developers, providing Python libraries for a wide range of bioinformatics problems. Biopython includes modules for reading and writing different sequence file formats and multiple sequence alignments, dealing with 3D macro molecular structures, interacting with common tools such as BLAST, ClustalW and EMBOSS, accessing key online databases, as well as providing numerical methods for statistical learning. AVAILABILITY: Biopython is freely available, with documentation and source code at (www.biopython.org) under the Biopython license.


Assuntos
Biologia Computacional/métodos , Software , Bases de Dados Factuais , Internet , Linguagens de Programação
20.
BMC Bioinformatics ; 8: 47, 2007 Feb 08.
Artigo em Inglês | MEDLINE | ID: mdl-17286872

RESUMO

BACKGROUND: Computational prediction methods are currently used to identify genes in prokaryote genomes. However, identification of the correct translation initiation sites remains a difficult task. Accurate translation initiation sites (TISs) are important not only for the annotation of unknown proteins but also for the prediction of operons, promoters, and small non-coding RNA genes, as this typically makes use of the intergenic distance. A further problem is that most existing methods are optimized for Escherichia coli data sets; applying these methods to newly sequenced bacterial genomes may not result in an equivalent level of accuracy. RESULTS: Based on a biological representation of the translation process, we applied Bayesian statistics to create a score function for predicting translation initiation sites. In contrast to existing programs, our combination of methods uses supervised learning to optimally use the set of known translation initiation sites. We combined the Ribosome Binding Site (RBS) sequence, the distance between the translation initiation site and the RBS sequence, the base composition of the start codon, the nucleotide composition (A-rich sequences) following start codons, and the expected distribution of the protein length in a Bayesian scoring function. To further increase the prediction accuracy, we also took into account the operon orientation. The outcome of the procedure achieved a prediction accuracy of 93.2% in 858 E. coli genes from the EcoGene data set and 92.7% accuracy in a data set of 1243 Bacillus subtilis 'non-y' genes. We confirmed the performance in the GC-rich Gamma-Proteobacteria Herminiimonas arsenicoxydans, Pseudomonas aeruginosa, and Burkholderia pseudomallei K96243. CONCLUSION: Hon-yaku, being based on a careful choice of elements important in translation, improved the prediction accuracy in B. subtilis data sets and other bacteria except for E. coli. We believe that most remaining mispredictions are due to atypical ribosomal binding sequences used in specific translation control processes, or likely errors in the training data sets.


Assuntos
Algoritmos , Mapeamento Cromossômico/métodos , DNA Bacteriano/genética , Iniciação Traducional da Cadeia Peptídica/genética , Biossíntese de Proteínas/genética , Análise de Sequência de DNA/métodos , Sítio de Iniciação de Transcrição , Teorema de Bayes , Biomimética/métodos , Bases de Dados Genéticas , Reconhecimento Automatizado de Padrão/métodos , Software , Biologia de Sistemas/métodos
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa