Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Genome Res ; 30(7): 951-961, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-32718981

RESUMEN

Gene expression profiles in homologous tissues have been observed to be different between species, which may be due to differences between species in the gene expression program in each cell type, but may also reflect differences in cell type composition of each tissue in different species. Here, we compare expression profiles in matching primary cells in human, mouse, rat, dog, and chicken using Cap Analysis Gene Expression (CAGE) and short RNA (sRNA) sequencing data from FANTOM5. While we find that expression profiles of orthologous genes in different species are highly correlated across cell types, in each cell type many genes were differentially expressed between species. Expression of genes with products involved in transcription, RNA processing, and transcriptional regulation was more likely to be conserved, while expression of genes encoding proteins involved in intercellular communication was more likely to have diverged during evolution. Conservation of expression correlated positively with the evolutionary age of genes, suggesting that divergence in expression levels of genes critical for cell function was restricted during evolution. Motif activity analysis showed that both promoters and enhancers are activated by the same transcription factors in different species. An analysis of expression levels of mature miRNAs and of primary miRNAs identified by CAGE revealed that evolutionary old miRNAs are more likely to have conserved expression patterns than young miRNAs. We conclude that key aspects of the regulatory network are conserved, while differential expression of genes involved in cell-to-cell communication may contribute greatly to phenotypic differences between species.


Asunto(s)
Evolución Molecular , Transcriptoma , Animales , Pollos/genética , Perros , Perfilación de la Expresión Génica , Redes Reguladoras de Genes , Humanos , Ratones , MicroARNs/metabolismo , Motivos de Nucleótidos , Análisis de Componente Principal , Regiones Promotoras Genéticas , Ratas , Especificidad de la Especie , Factores de Transcripción/metabolismo
2.
Genome Res ; 30(7): 1073-1081, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-32079618

RESUMEN

Long noncoding RNAs (lncRNAs) have emerged as key coordinators of biological and cellular processes. Characterizing lncRNA expression across cells and tissues is key to understanding their role in determining phenotypes, including human diseases. We present here FC-R2, a comprehensive expression atlas across a broadly defined human transcriptome, inclusive of over 109,000 coding and noncoding genes, as described in the FANTOM CAGE-Associated Transcriptome (FANTOM-CAT) study. This atlas greatly extends the gene annotation used in the original recount2 resource. We demonstrate the utility of the FC-R2 atlas by reproducing key findings from published large studies and by generating new results across normal and diseased human samples. In particular, we (a) identify tissue-specific transcription profiles for distinct classes of coding and noncoding genes, (b) perform differential expression analysis across thirteen cancer types, identifying novel noncoding genes potentially involved in tumor pathogenesis and progression, and (c) confirm the prognostic value for several enhancer lncRNAs expression in cancer. Our resource is instrumental for the systematic molecular characterization of lncRNA by the FANTOM6 Consortium. In conclusion, comprised of over 70,000 samples, the FC-R2 atlas will empower other researchers to investigate functions and biological roles of both known coding genes and novel lncRNAs.


Asunto(s)
Transcriptoma , Bases de Datos Genéticas , Elementos de Facilitación Genéticos , Perfilación de la Expresión Génica , Genoma Humano , Humanos , Neoplasias/genética , Especificidad de Órganos , Pronóstico , ARN Largo no Codificante/genética , ARN Largo no Codificante/metabolismo , ARN Mensajero/metabolismo
3.
Nature ; 543(7644): 199-204, 2017 03 09.
Artículo en Inglés | MEDLINE | ID: mdl-28241135

RESUMEN

Long non-coding RNAs (lncRNAs) are largely heterogeneous and functionally uncharacterized. Here, using FANTOM5 cap analysis of gene expression (CAGE) data, we integrate multiple transcript collections to generate a comprehensive atlas of 27,919 human lncRNA genes with high-confidence 5' ends and expression profiles across 1,829 samples from the major human primary cell types and tissues. Genomic and epigenomic classification of these lncRNAs reveals that most intergenic lncRNAs originate from enhancers rather than from promoters. Incorporating genetic and expression data, we show that lncRNAs overlapping trait-associated single nucleotide polymorphisms are specifically expressed in cell types relevant to the traits, implicating these lncRNAs in multiple diseases. We further demonstrate that lncRNAs overlapping expression quantitative trait loci (eQTL)-associated single nucleotide polymorphisms of messenger RNAs are co-expressed with the corresponding messenger RNAs, suggesting their potential roles in transcriptional regulation. Combining these findings with conservation data, we identify 19,175 potentially functional lncRNAs in the human genome.


Asunto(s)
Bases de Datos Genéticas , ARN Largo no Codificante/química , ARN Largo no Codificante/genética , Transcriptoma/genética , Células Cultivadas , Secuencia Conservada/genética , Conjuntos de Datos como Asunto , Elementos de Facilitación Genéticos/genética , Epigénesis Genética , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Genoma Humano/genética , Estudio de Asociación del Genoma Completo , Genómica , Humanos , Internet , Anotación de Secuencia Molecular , Especificidad de Órganos/genética , Polimorfismo de Nucleótido Simple , Regiones Promotoras Genéticas/genética , Sitios de Carácter Cuantitativo/genética , Estabilidad del ARN , ARN Mensajero/genética
4.
Nature ; 507(7493): 462-70, 2014 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-24670764

RESUMEN

Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly 'housekeeping', whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.


Asunto(s)
Atlas como Asunto , Anotación de Secuencia Molecular , Regiones Promotoras Genéticas/genética , Transcriptoma/genética , Animales , Línea Celular , Células Cultivadas , Análisis por Conglomerados , Secuencia Conservada/genética , Regulación de la Expresión Génica/genética , Redes Reguladoras de Genes/genética , Genes Esenciales/genética , Genoma/genética , Humanos , Ratones , Sistemas de Lectura Abierta/genética , Especificidad de Órganos , ARN Mensajero/análisis , ARN Mensajero/genética , Factores de Transcripción/metabolismo , Sitio de Iniciación de la Transcripción , Transcripción Genética/genética
5.
Nucleic Acids Res ; 44(7): 3233-52, 2016 Apr 20.
Artículo en Inglés | MEDLINE | ID: mdl-27001520

RESUMEN

Functionality of the non-coding transcripts encoded by the human genome is the coveted goal of the modern genomics research. While commonly relied on the classical methods of forward genetics, integration of different genomics datasets in a global Systems Biology fashion presents a more productive avenue of achieving this very complex aim. Here we report application of a Systems Biology-based approach to dissect functionality of a newly identified vast class of very long intergenic non-coding (vlinc) RNAs. Using highly quantitative FANTOM5 CAGE dataset, we show that these RNAs could be grouped into 1542 novel human genes based on analysis of insulators that we show here indeed function as genomic barrier elements. We show that vlinc RNAs genes likely function in cisto activate nearby genes. This effect while most pronounced in closely spaced vlinc RNA-gene pairs can be detected over relatively large genomic distances. Furthermore, we identified 101 vlinc RNA genes likely involved in early embryogenesis based on patterns of their expression and regulation. We also found another 109 such genes potentially involved in cellular functions also happening at early stages of development such as proliferation, migration and apoptosis. Overall, we show that Systems Biology-based methods have great promise for functional annotation of non-coding RNAs.


Asunto(s)
ARN Largo no Codificante/genética , Núcleo Celular/genética , Desarrollo Embrionario/genética , Regulación de la Expresión Génica , Humanos , Elementos Aisladores , Anotación de Secuencia Molecular , Regiones Promotoras Genéticas , ARN Largo no Codificante/clasificación , ARN Largo no Codificante/metabolismo , Retroviridae/genética , Biología de Sistemas , Secuencias Repetidas Terminales , Factores de Transcripción/metabolismo
6.
J Immunol ; 194(12): 6035-44, 2015 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-25957166

RESUMEN

Basic leucine zipper transcription factor Batf2 is poorly described, whereas Batf and Batf3 have been shown to play essential roles in dendritic cell, T cell, and B cell development and regulation. Batf2 was drastically induced in IFN-γ-activated classical macrophages (M1) compared with unstimulated or IL-4-activated alternative macrophages (M2). Batf2 knockdown experiments from IFN-γ-activated macrophages and subsequent expression profiling demonstrated important roles for regulation of immune responses, inducing inflammatory and host-protective genes Tnf, Ccl5, and Nos2. Mycobacterium tuberculosis (Beijing strain HN878)-infected macrophages further induced Batf2 and augmented host-protective Batf2-dependent genes, particularly in M1, whose mechanism was suggested to be mediated through both TLR2 and TLR4 by LPS and heat-killed HN878 (HKTB) stimulation experiments. Irf1 binding motif was enriched in the promoters of Batf2-regulated genes. Coimmunoprecipitation study demonstrated Batf2 association with Irf1. Furthermore, Irf1 knockdown showed downregulation of IFN-γ- or LPS/HKTB-activated host-protective genes Tnf, Ccl5, Il12b, and Nos2. Conclusively, Batf2 is an activation marker gene for M1 involved in gene regulation of IFN-γ-activated classical macrophages, as well as LPS/HKTB-induced macrophage stimulation, possibly by Batf2/Irf1 gene induction. Taken together, these results underline the role of Batf2/Irf1 in inducing inflammatory responses in M. tuberculosis infection.


Asunto(s)
Factores de Transcripción con Cremalleras de Leucina de Carácter Básico/genética , Factor 1 Regulador del Interferón/genética , Macrófagos/inmunología , Macrófagos/metabolismo , Infecciones por Mycobacterium/genética , Infecciones por Mycobacterium/inmunología , Mycobacterium/inmunología , Animales , Factores de Transcripción con Cremalleras de Leucina de Carácter Básico/metabolismo , Análisis por Conglomerados , Modelos Animales de Enfermedad , Expresión Génica , Perfilación de la Expresión Génica , Regulación de la Expresión Génica/efectos de los fármacos , Técnicas de Silenciamiento del Gen , Factor 1 Regulador del Interferón/metabolismo , Interferón gamma/farmacología , Lipopolisacáridos/inmunología , Activación de Macrófagos/inmunología , Masculino , Ratones , Infecciones por Mycobacterium/metabolismo , Óxido Nítrico Sintasa de Tipo II/genética , Óxido Nítrico Sintasa de Tipo II/metabolismo , Unión Proteica , Factores de Necrosis Tumoral/genética , Factores de Necrosis Tumoral/metabolismo
7.
Nucleic Acids Res ; 43(14): 6969-82, 2015 Aug 18.
Artículo en Inglés | MEDLINE | ID: mdl-26117544

RESUMEN

Classically or alternatively activated macrophages (M1 and M2, respectively) play distinct and important roles for microbiocidal activity, regulation of inflammation and tissue homeostasis. Despite this, their transcriptional regulatory dynamics are poorly understood. Using promoter-level expression profiling by non-biased deepCAGE we have studied the transcriptional dynamics of classically and alternatively activated macrophages. Transcription factor (TF) binding motif activity analysis revealed four motifs, NFKB1_REL_RELA, IRF1,2, IRF7 and TBP that are commonly activated but have distinct activity dynamics in M1 and M2 activation. We observe matching changes in the expression profiles of the corresponding TFs and show that only a restricted set of TFs change expression. There is an overall drastic and transient up-regulation in M1 and a weaker and more sustainable up-regulation in M2. Novel TFs, such as Thap6, Maff, (M1) and Hivep1, Nfil3, Prdm1, (M2) among others, were suggested to be involved in the activation processes. Additionally, 52 (M1) and 67 (M2) novel differentially expressed genes and, for the first time, several differentially expressed long non-coding RNA (lncRNA) transcriptome markers were identified. In conclusion, the finding of novel motifs, TFs and protein-coding and lncRNA genes is an important step forward to fully understand the transcriptional machinery of macrophage activation.


Asunto(s)
Regulación de la Expresión Génica , Activación de Macrófagos/genética , Macrófagos/metabolismo , Transcriptoma , Animales , Células Cultivadas , ADN/química , Perfilación de la Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento , Interferón gamma/farmacología , Interleucina-13/farmacología , Interleucina-4/farmacología , Macrófagos/efectos de los fármacos , Masculino , Ratones Endogámicos BALB C , Motivos de Nucleótidos , Regiones Promotoras Genéticas , Análisis de Secuencia de ADN , Factores de Transcripción/metabolismo
8.
Proc Natl Acad Sci U S A ; 111(31): 11467-72, 2014 Aug 05.
Artículo en Inglés | MEDLINE | ID: mdl-25049417

RESUMEN

Next-generation sequencing experiments have shown that microRNAs (miRNAs) are expressed in many different isoforms (isomiRs), whose biological relevance is often unclear. We found that mature miR-21, the most widely researched miRNA because of its importance in human disease, is produced in two prevalent isomiR forms that differ by 1 nt at their 3' end, and moreover that the 3' end of miR-21 is posttranscriptionally adenylated by the noncanonical poly(A) polymerase PAPD5. PAPD5 knockdown caused an increase in the miR-21 expression level, suggesting that PAPD5-mediated adenylation of miR-21 leads to its degradation. Exoribonuclease knockdown experiments followed by small-RNA sequencing suggested that PARN degrades miR-21 in the 3'-to-5' direction. In accordance with this model, microarray expression profiling demonstrated that PAPD5 knockdown results in a down-regulation of miR-21 target mRNAs. We found that disruption of the miR-21 adenylation and degradation pathway is a general feature in tumors across a wide range of tissues, as evidenced by data from The Cancer Genome Atlas, as well as in the noncancerous proliferative disease psoriasis. We conclude that PAPD5 and PARN mediate degradation of oncogenic miRNA miR-21 through a tailing and trimming process, and that this pathway is disrupted in cancer and other proliferative diseases.


Asunto(s)
Adenina/metabolismo , MicroARNs/metabolismo , Neoplasias/genética , ARN Nucleotidiltransferasas/metabolismo , Estabilidad del ARN , Secuencia de Bases , Citosina/metabolismo , Exorribonucleasas/metabolismo , Perfilación de la Expresión Génica , Regulación Neoplásica de la Expresión Génica , Técnicas de Silenciamiento del Gen , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Células MCF-7 , MicroARNs/química , MicroARNs/genética , Modelos Biológicos , Datos de Secuencia Molecular , Neoplasias/patología , Conformación de Ácido Nucleico , Isoformas de Proteínas/química , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Ribonucleasa III/metabolismo
9.
J Allergy Clin Immunol ; 136(3): 638-48, 2015 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-25863981

RESUMEN

BACKGROUND: Children with problematic severe asthma have poor disease control despite high doses of inhaled corticosteroids and additional therapy, leading to personal suffering, early deterioration of lung function, and significant consumption of health care resources. If no exacerbating factors, such as smoking or allergies, are found after extensive investigation, these children are given a diagnosis of therapy-resistant (or therapy-refractory) asthma (SA). OBJECTIVE: We sought to deepen our understanding of childhood SA by analyzing gene expression and modeling the underlying regulatory transcription factor networks in peripheral blood leukocytes. METHODS: Gene expression was analyzed by using Cap Analysis of Gene Expression in children with SA (n = 13), children with controlled persistent asthma (n = 15), and age-matched healthy control subjects (n = 9). Cap Analysis of Gene Expression sequencing detects the transcription start sites of known and novel mRNAs and noncoding RNAs. RESULTS: Sample groups could be separated by hierarchical clustering on 1305 differentially expressed transcription start sites, including 816 known genes and several novel transcripts. Ten of 13 tested novel transcripts were validated by means of RT-PCR and Sanger sequencing. Expression of RAR-related orphan receptor A (RORA), which has been linked to asthma in genome-wide association studies, was significantly upregulated in patients with SA. Gene network modeling revealed decreased glucocorticoid receptor signaling and increased activity of the mitogen-activated protein kinase and Jun kinase cascades in patients with SA. CONCLUSION: Circulating leukocytes from children with controlled asthma and those with SA have distinct gene expression profiles, demonstrating the possible development of specific molecular biomarkers and supporting the need for novel therapeutic approaches.


Asunto(s)
Asma/tratamiento farmacológico , Asma/genética , Resistencia a Medicamentos/genética , Glucocorticoides/uso terapéutico , ARN Mensajero/genética , Transcriptoma , Adolescente , Asma/patología , Estudios de Casos y Controles , Niño , Preescolar , Femenino , Perfilación de la Expresión Génica , Estudio de Asociación del Genoma Completo , Humanos , Proteínas Quinasas JNK Activadas por Mitógenos/genética , Masculino , Miembro 1 del Grupo F de la Subfamilia 1 de Receptores Nucleares/genética , Receptores de Glucocorticoides/genética , Índice de Severidad de la Enfermedad
10.
PLoS One ; 19(5): e0295971, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38709794

RESUMEN

The human genome is pervasively transcribed and produces a wide variety of long non-coding RNAs (lncRNAs), constituting the majority of transcripts across human cell types. Some specific nuclear lncRNAs have been shown to be important regulatory components acting locally. As RNA-chromatin interaction and Hi-C chromatin conformation data showed that chromatin interactions of nuclear lncRNAs are determined by the local chromatin 3D conformation, we used Hi-C data to identify potential target genes of lncRNAs. RNA-protein interaction data suggested that nuclear lncRNAs act as scaffolds to recruit regulatory proteins to target promoters and enhancers. Nuclear lncRNAs may therefore play a role in directing regulatory factors to locations spatially close to the lncRNA gene. We provide the analysis results through an interactive visualization web portal at https://fantom.gsc.riken.jp/zenbu/reports/#F6_3D_lncRNA.


Asunto(s)
Cromatina , ARN Largo no Codificante , ARN Largo no Codificante/genética , ARN Largo no Codificante/metabolismo , Cromatina/metabolismo , Cromatina/genética , Humanos , Anotación de Secuencia Molecular , Núcleo Celular/metabolismo , Núcleo Celular/genética , Genoma Humano , Regiones Promotoras Genéticas
11.
Genome Res ; 20(10): 1398-410, 2010 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-20719920

RESUMEN

Animal microRNA sequences are subject to 3' nucleotide addition. Through detailed analysis of deep-sequenced short RNA data sets, we show adenylation and uridylation of miRNA is globally present and conserved across Drosophila and vertebrates. To better understand 3' adenylation function, we deep-sequenced RNA after knockdown of nucleotidyltransferase enzymes. The PAPD4 nucleotidyltransferase adenylates a wide range of miRNA loci, but adenylation does not appear to affect miRNA stability on a genome-wide scale. Adenine addition appears to reduce effectiveness of miRNA targeting of mRNA transcripts while deep-sequencing of RNA bound to immunoprecipitated Argonaute (AGO) subfamily proteins EIF2C1-EIF2C3 revealed substantial reduction of adenine addition in miRNA associated with EIF2C2 and EIF2C3. Our findings show 3' addition events are widespread and conserved across animals, PAPD4 is a primary miRNA adenylating enzyme, and suggest a role for 3' adenine addition in modulating miRNA effectiveness, possibly through interfering with incorporation into the RNA-induced silencing complex (RISC), a regulatory role that would complement the role of miRNA uridylation in blocking DICER1 uptake.


Asunto(s)
Regiones no Traducidas 3'/genética , Adenina/metabolismo , MicroARNs/metabolismo , Nucleotidiltransferasas/metabolismo , Animales , Proteínas Argonautas , Línea Celular , Factor 2 Eucariótico de Iniciación/metabolismo , Factores Eucarióticos de Iniciación/metabolismo , Humanos , Ratones , MicroARNs/química , MicroARNs/genética , Monocitos , Nucleotidiltransferasas/genética , Polinucleotido Adenililtransferasa , Estabilidad del ARN , Factores de Escisión y Poliadenilación de ARNm
12.
Genome Res ; 20(2): 257-64, 2010 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-20051556

RESUMEN

MicroRNAs (miRNAs) are short (20-23 nt) RNAs that are sequence-specific mediators of transcriptional and post-transcriptional regulation of gene expression. Modern high-throughput technologies enable deep sequencing of such RNA species on an unprecedented scale. We find that the analysis of small RNA deep-sequencing libraries can be affected by cross-mapping, in which RNA sequences originating from one locus are inadvertently mapped to another. Similar to cross-hybridization on microarrays, cross-mapping is prevalent among miRNAs, as they tend to occur in families, are similar or derived from repeat or structural RNAs, or are post-transcriptionally modified. Here, we develop a strategy to correct for cross-mapping, and apply it to the analysis of RNA editing in mature miRNAs. In contrast to previous reports, our analysis suggests that RNA editing in mature miRNAs is rare in animals.


Asunto(s)
Biblioteca de Genes , MicroARNs/genética , Edición de ARN/genética , Alineación de Secuencia/métodos , Análisis de Secuencia de ARN/métodos , Animales , Secuencia de Bases , Ensayos Analíticos de Alto Rendimiento , Humanos , Ratones , MicroARNs/metabolismo
13.
Nucleic Acids Res ; 39(9): e59, 2011 May.
Artículo en Inglés | MEDLINE | ID: mdl-21310714

RESUMEN

The application of isothermal amplification technologies is rapidly expanding and currently covers different areas such as infectious disease, genetic disorder and drug dosage adjustment. Meanwhile, many of such technologies have complex reaction processes and often require a fine-tuned primer set where existing primer design tools are not sufficient. We have developed a primer selection system for one important primer, the turn-back primer (TP), which is commonly used in loop-mediated amplification (LAMP) and smart amplification process (SmartAmp). We chose 78 parameters related to the primer and target sequence, and explored their relationship to amplification speed using experimental data for 1344 primer combinations. We employed the least absolute shrinkage and selection operator (LASSO) method for parameter selection and estimation of their numerical coefficients. We subsequently evaluated our prediction model using additional independent experiments and compared to the LAMP primer design tool, Primer Explorer version4 (PE4). The evaluation showed that our approach yields a superior primer design in isothermal amplification and is robust against variations in the experimental setup. Our LASSO regression analysis revealed that availability of the 3'- and 5'-end of the primer are particularly important factors for efficient isothermal amplification. Our computer script is freely available at: http://gerg.gsc.riken.jp/TP_optimization/.


Asunto(s)
Cartilla de ADN/química , Técnicas de Amplificación de Ácido Nucleico , Humanos , Programas Informáticos , Temperatura
14.
Biochemistry ; 51(31): 6056-67, 2012 Aug 07.
Artículo en Inglés | MEDLINE | ID: mdl-22765348

RESUMEN

Nucleic acid oligonucleotides are widely used in hybridization experiments for specific detection of complementary nucleic acid sequences. For design and application of oligonucleotides, an understanding of their thermodynamic properties is essential. Recently, exciton-controlled hybridization-sensitive fluorescent oligonucleotides (ECHOs) were developed as uniquely labeled DNA oligomers containing commonly one thymidine having two covalently linked thiazole orange dye moieties. The fluorescent signal of an ECHO is strictly hybridization-controlled, where the dye moieties have to intercalate into double-stranded DNA for signal generation. Here we analyzed the hybridization thermodynamics of ECHO/DNA duplexes, and thermodynamic parameters were obtained from melting curves of 64 ECHO/DNA duplexes measured by ultraviolet absorbance and fluorescence. Both methods demonstrated a substantial increase in duplex stability (ΔΔG°(37) ~ -2.6 ± 0.7 kcal mol(-1)) compared to that of DNA/DNA duplexes of the same sequence. With the exception of T·G mismatches, this increased stability was mostly unaffected by other mismatches in the position opposite the labeled nucleotide. A nearest neighbor model was constructed for predicting thermodynamic parameters for duplex stability. Evaluation of the nearest neighbor parameters by cross validation tests showed higher predictive reliability for the fluorescence-based than the absorbance-based parameters. Using our experimental data, a tool for predicting the thermodynamics of formation of ECHO/DNA duplexes was developed that is freely available at http://genome.gsc.riken.jp/echo/thermodynamics/. It provides reliable thermodynamic data for using the unique features of ECHOs in fluorescence-based experiments.


Asunto(s)
Benzotiazoles/química , ADN/química , Quinolinas/química , Timidina/química , Disparidad de Par Base , Secuencia de Bases , ADN/genética , Diseño de Fármacos , Colorantes Fluorescentes/química , Colorantes Fluorescentes/metabolismo , Modelos Moleculares , Conformación de Ácido Nucleico , Desnaturalización de Ácido Nucleico , Hibridación de Ácido Nucleico , Oligodesoxirribonucleótidos/química , Oligodesoxirribonucleótidos/genética , Termodinámica , Temperatura de Transición
15.
BMC Genom Data ; 22(1): 33, 2021 09 14.
Artículo en Inglés | MEDLINE | ID: mdl-34521352

RESUMEN

BACKGROUND: The lymphatic and the blood vasculature are closely related systems that collaborate to ensure the organism's physiological function. Despite their common developmental origin, they present distinct functional fates in adulthood that rely on robust lineage-specific regulatory programs. The recent technological boost in sequencing approaches unveiled long noncoding RNAs (lncRNAs) as prominent regulatory players of various gene expression levels in a cell-type-specific manner. RESULTS: To investigate the potential roles of lncRNAs in vascular biology, we performed antisense oligonucleotide (ASO) knockdowns of lncRNA candidates specifically expressed either in human lymphatic or blood vascular endothelial cells (LECs or BECs) followed by Cap Analysis of Gene Expression (CAGE-Seq). Here, we describe the quality control steps adopted in our analysis pipeline before determining the knockdown effects of three ASOs per lncRNA target on the LEC or BEC transcriptomes. In this regard, we especially observed that the choice of negative control ASOs can dramatically impact the conclusions drawn from the analysis depending on the cellular background. CONCLUSION: In conclusion, the comparison of negative control ASO effects on the targeted cell type transcriptomes highlights the essential need to select a proper control set of multiple negative control ASO based on the investigated cell types.


Asunto(s)
Técnicas de Silenciamiento del Gen/métodos , Oligonucleótidos Antisentido/genética , Especificidad de Órganos/genética , ARN Largo no Codificante/genética , Adulto , Células Endoteliales/metabolismo , Técnicas de Silenciamiento del Gen/normas , Humanos , Sistema Linfático/citología , Sistema Linfático/metabolismo , Oligonucleótidos Antisentido/normas , Transcriptoma
16.
Nat Commun ; 12(1): 925, 2021 02 10.
Artículo en Inglés | MEDLINE | ID: mdl-33568674

RESUMEN

Recent studies have revealed the importance of long noncoding RNAs (lncRNAs) as tissue-specific regulators of gene expression. There is ample evidence that distinct types of vasculature undergo tight transcriptional control to preserve their structure, identity, and functions. We determine a comprehensive map of lineage-specific lncRNAs in human dermal lymphatic and blood vascular endothelial cells (LECs and BECs), combining RNA-Seq and CAGE-Seq. Subsequent antisense oligonucleotide-knockdown transcriptomic profiling of two LEC- and two BEC-specific lncRNAs identifies LETR1 as a critical gatekeeper of the global LEC transcriptome. Deep RNA-DNA, RNA-protein interaction studies, and phenotype rescue analyses reveal that LETR1 is a nuclear trans-acting lncRNA modulating, via key epigenetic factors, the expression of essential target genes, including KLF4 and SEMA3C, governing the growth and migratory ability of LECs. Together, our study provides several lines of evidence supporting the intriguing concept that every cell type expresses precise lncRNA signatures to control lineage-specific regulatory programs.


Asunto(s)
Células Endoteliales/citología , Factores de Transcripción de Tipo Kruppel/metabolismo , Semaforinas/metabolismo , Movimiento Celular , Proliferación Celular , Células Endoteliales/metabolismo , Regulación de la Expresión Génica , Humanos , Factor 4 Similar a Kruppel , Factores de Transcripción de Tipo Kruppel/genética , ARN Largo no Codificante , Semaforinas/genética
17.
Nat Commun ; 12(1): 3297, 2021 06 02.
Artículo en Inglés | MEDLINE | ID: mdl-34078885

RESUMEN

Using the Cap Analysis of Gene Expression (CAGE) technology, the FANTOM5 consortium provided one of the most comprehensive maps of transcription start sites (TSSs) in several species. Strikingly, ~72% of them could not be assigned to a specific gene and initiate at unconventional regions, outside promoters or enhancers. Here, we probe these unassigned TSSs and show that, in all species studied, a significant fraction of CAGE peaks initiate at microsatellites, also called short tandem repeats (STRs). To confirm this transcription, we develop Cap Trap RNA-seq, a technology which combines cap trapping and long read MinION sequencing. We train sequence-based deep learning models able to predict CAGE signal at STRs with high accuracy. These models unveil the importance of STR surrounding sequences not only to distinguish STR classes, but also to predict the level of transcription initiation. Importantly, genetic variants linked to human diseases are preferentially found at STRs with high transcription initiation level, supporting the biological and clinical relevance of transcription initiation at STRs. Together, our results extend the repertoire of non-coding transcription associated with DNA tandem repeats and complexify STR polymorphism.


Asunto(s)
Repeticiones de Microsatélite , Redes Neurales de la Computación , Enfermedades Neurodegenerativas/genética , Sitio de Iniciación de la Transcripción , Iniciación de la Transcripción Genética , Células A549 , Animales , Secuencia de Bases , Biología Computacional/métodos , Aprendizaje Profundo , Elementos de Facilitación Genéticos , Genoma Humano , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Ratones , Enfermedades Neurodegenerativas/diagnóstico , Enfermedades Neurodegenerativas/metabolismo , Polimorfismo Genético , Regiones Promotoras Genéticas
18.
Bioinformatics ; 25(19): 2613-4, 2009 Oct 01.
Artículo en Inglés | MEDLINE | ID: mdl-19605420

RESUMEN

UNLABELLED: Multi-mapping sequence tags are a significant impediment to short-read sequencing platforms. These tags are routinely omitted from further analysis, leading to experimental bias and reduced coverage. Here, we present MuMRescueLite, a low-resource requirement version of the MuMRescue software that has been used by several next generation sequencing projects to probabilistically reincorporate multi-mapping tags into mapped short read data. AVAILABILITY AND IMPLEMENTATION: MuMRescueLite is written in Python; executables and documentation are available from http://genome.gsc.riken.jp/osc/english/software/.


Asunto(s)
Biología Computacional/métodos , Análisis de Secuencia de ADN/métodos , Programas Informáticos
19.
Bioinformatics ; 25(11): 1422-3, 2009 Jun 01.
Artículo en Inglés | MEDLINE | ID: mdl-19304878

RESUMEN

SUMMARY: The Biopython project is a mature open source international collaboration of volunteer developers, providing Python libraries for a wide range of bioinformatics problems. Biopython includes modules for reading and writing different sequence file formats and multiple sequence alignments, dealing with 3D macro molecular structures, interacting with common tools such as BLAST, ClustalW and EMBOSS, accessing key online databases, as well as providing numerical methods for statistical learning. AVAILABILITY: Biopython is freely available, with documentation and source code at (www.biopython.org) under the Biopython license.


Asunto(s)
Biología Computacional/métodos , Programas Informáticos , Bases de Datos Factuales , Internet , Lenguajes de Programación
20.
BMC Bioinformatics ; 8: 47, 2007 Feb 08.
Artículo en Inglés | MEDLINE | ID: mdl-17286872

RESUMEN

BACKGROUND: Computational prediction methods are currently used to identify genes in prokaryote genomes. However, identification of the correct translation initiation sites remains a difficult task. Accurate translation initiation sites (TISs) are important not only for the annotation of unknown proteins but also for the prediction of operons, promoters, and small non-coding RNA genes, as this typically makes use of the intergenic distance. A further problem is that most existing methods are optimized for Escherichia coli data sets; applying these methods to newly sequenced bacterial genomes may not result in an equivalent level of accuracy. RESULTS: Based on a biological representation of the translation process, we applied Bayesian statistics to create a score function for predicting translation initiation sites. In contrast to existing programs, our combination of methods uses supervised learning to optimally use the set of known translation initiation sites. We combined the Ribosome Binding Site (RBS) sequence, the distance between the translation initiation site and the RBS sequence, the base composition of the start codon, the nucleotide composition (A-rich sequences) following start codons, and the expected distribution of the protein length in a Bayesian scoring function. To further increase the prediction accuracy, we also took into account the operon orientation. The outcome of the procedure achieved a prediction accuracy of 93.2% in 858 E. coli genes from the EcoGene data set and 92.7% accuracy in a data set of 1243 Bacillus subtilis 'non-y' genes. We confirmed the performance in the GC-rich Gamma-Proteobacteria Herminiimonas arsenicoxydans, Pseudomonas aeruginosa, and Burkholderia pseudomallei K96243. CONCLUSION: Hon-yaku, being based on a careful choice of elements important in translation, improved the prediction accuracy in B. subtilis data sets and other bacteria except for E. coli. We believe that most remaining mispredictions are due to atypical ribosomal binding sequences used in specific translation control processes, or likely errors in the training data sets.


Asunto(s)
Algoritmos , Mapeo Cromosómico/métodos , ADN Bacteriano/genética , Iniciación de la Cadena Peptídica Traduccional/genética , Biosíntesis de Proteínas/genética , Análisis de Secuencia de ADN/métodos , Sitio de Iniciación de la Transcripción , Teorema de Bayes , Biomimética/métodos , Bases de Datos Genéticas , Reconocimiento de Normas Patrones Automatizadas/métodos , Programas Informáticos , Biología de Sistemas/métodos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA