Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 116
Filtrar
Mais filtros

País/Região como assunto
Intervalo de ano de publicação
1.
Genome Res ; 2022 Aug 12.
Artigo em Inglês | MEDLINE | ID: mdl-35961773

RESUMO

In eukaryotes, capped RNAs include long transcripts such as messenger RNAs and long noncoding RNAs, as well as shorter transcripts such as spliceosomal RNAs, small nucleolar RNAs, and enhancer RNAs. Long capped transcripts can be profiled using cap analysis gene expression (CAGE) sequencing and other methods. Here, we describe a sequencing library preparation protocol for short capped RNAs, apply it to a differentiation time course of the human cell line THP-1, and systematically compare the landscape of short capped RNAs to that of long capped RNAs. Transcription initiation peaks associated with genes in the sense direction have a strong preference to produce either long or short capped RNAs, with one out of six peaks detected in the short capped RNA libraries only. Gene-associated short capped RNAs have highly specific 3' ends, typically overlapping splice sites. Enhancers also preferentially generate either short or long capped RNAs, with 10% of enhancers observed in the short capped RNA libraries only. Enhancers producing either short or long capped RNAs show enrichment for GWAS-associated disease SNPs. We conclude that deep sequencing of short capped RNAs reveals new families of noncoding RNAs and elucidates the diversity of transcripts generated at known and novel promoters and enhancers.

2.
Jpn J Clin Oncol ; 54(1): 38-46, 2024 Jan 07.
Artigo em Inglês | MEDLINE | ID: mdl-37815156

RESUMO

OBJECTIVE: Endometrial cancer is the most common gynaecological cancer, and most patients are identified during early disease stages. Noninvasive evaluation of lymph node metastasis likely will improve the quality of clinical treatment, for example, by omitting unnecessary lymphadenectomy. METHODS: The study population comprised 611 patients with endometrial cancer who underwent lymphadenectomy at four types of institutions, comprising seven hospitals in total. We systematically assessed the association of 18 preoperative clinical variables with postoperative lymph node metastasis. We then constructed statistical models for preoperative lymph node metastasis prediction and assessed their performance with a previously proposed system, in which the score was determined by counting the number of high-risk variables among the four predefined ones. RESULTS: Of the preoperative 18 variables evaluated, 10 were significantly associated with postoperative lymph node metastasis. A logistic regression model achieved an area under the curve of 0.85 in predicting lymph node metastasis; this value is significantly higher than that from the previous system (area under the curve, 0.74). When we set the false-negative rate to ~1%, the new predictive model increased the rate of true negatives to 21%, compared with 6.8% from the previous one. We also provide a spreadsheet-based tool for further evaluation of its ability to predict lymph node metastasis in endometrial cancer. CONCLUSIONS: Our new lymph node metastasis prediction method, which was based solely on preoperative clinical variables, performed significantly better than the previous method. Although additional evaluation is necessary for its clinical use, our noninvasive system may help improve the clinical treatment of endometrial cancer, complementing minimally invasive sentinel lymph node biopsy.


Assuntos
Neoplasias do Endométrio , Biópsia de Linfonodo Sentinela , Feminino , Humanos , Metástase Linfática/patologia , Excisão de Linfonodo , Neoplasias do Endométrio/cirurgia , Neoplasias do Endométrio/patologia , Modelos Estatísticos , Linfonodos/cirurgia , Linfonodos/patologia
3.
Genome Res ; 30(7): 951-961, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32718981

RESUMO

Gene expression profiles in homologous tissues have been observed to be different between species, which may be due to differences between species in the gene expression program in each cell type, but may also reflect differences in cell type composition of each tissue in different species. Here, we compare expression profiles in matching primary cells in human, mouse, rat, dog, and chicken using Cap Analysis Gene Expression (CAGE) and short RNA (sRNA) sequencing data from FANTOM5. While we find that expression profiles of orthologous genes in different species are highly correlated across cell types, in each cell type many genes were differentially expressed between species. Expression of genes with products involved in transcription, RNA processing, and transcriptional regulation was more likely to be conserved, while expression of genes encoding proteins involved in intercellular communication was more likely to have diverged during evolution. Conservation of expression correlated positively with the evolutionary age of genes, suggesting that divergence in expression levels of genes critical for cell function was restricted during evolution. Motif activity analysis showed that both promoters and enhancers are activated by the same transcription factors in different species. An analysis of expression levels of mature miRNAs and of primary miRNAs identified by CAGE revealed that evolutionary old miRNAs are more likely to have conserved expression patterns than young miRNAs. We conclude that key aspects of the regulatory network are conserved, while differential expression of genes involved in cell-to-cell communication may contribute greatly to phenotypic differences between species.


Assuntos
Evolução Molecular , Transcriptoma , Animais , Galinhas/genética , Cães , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Humanos , Camundongos , MicroRNAs/metabolismo , Motivos de Nucleotídeos , Análise de Componente Principal , Regiões Promotoras Genéticas , Ratos , Especificidade da Espécie , Fatores de Transcrição/metabolismo
4.
Genome Res ; 30(7): 1073-1081, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32079618

RESUMO

Long noncoding RNAs (lncRNAs) have emerged as key coordinators of biological and cellular processes. Characterizing lncRNA expression across cells and tissues is key to understanding their role in determining phenotypes, including human diseases. We present here FC-R2, a comprehensive expression atlas across a broadly defined human transcriptome, inclusive of over 109,000 coding and noncoding genes, as described in the FANTOM CAGE-Associated Transcriptome (FANTOM-CAT) study. This atlas greatly extends the gene annotation used in the original recount2 resource. We demonstrate the utility of the FC-R2 atlas by reproducing key findings from published large studies and by generating new results across normal and diseased human samples. In particular, we (a) identify tissue-specific transcription profiles for distinct classes of coding and noncoding genes, (b) perform differential expression analysis across thirteen cancer types, identifying novel noncoding genes potentially involved in tumor pathogenesis and progression, and (c) confirm the prognostic value for several enhancer lncRNAs expression in cancer. Our resource is instrumental for the systematic molecular characterization of lncRNA by the FANTOM6 Consortium. In conclusion, comprised of over 70,000 samples, the FC-R2 atlas will empower other researchers to investigate functions and biological roles of both known coding genes and novel lncRNAs.


Assuntos
Transcriptoma , Bases de Dados Genéticas , Elementos Facilitadores Genéticos , Perfilação da Expressão Gênica , Genoma Humano , Humanos , Neoplasias/genética , Especificidade de Órgãos , Prognóstico , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , RNA Mensageiro/metabolismo
5.
Genome Res ; 30(7): 1060-1072, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32718982

RESUMO

Long noncoding RNAs (lncRNAs) constitute the majority of transcripts in the mammalian genomes, and yet, their functions remain largely unknown. As part of the FANTOM6 project, we systematically knocked down the expression of 285 lncRNAs in human dermal fibroblasts and quantified cellular growth, morphological changes, and transcriptomic responses using Capped Analysis of Gene Expression (CAGE). Antisense oligonucleotides targeting the same lncRNAs exhibited global concordance, and the molecular phenotype, measured by CAGE, recapitulated the observed cellular phenotypes while providing additional insights on the affected genes and pathways. Here, we disseminate the largest-to-date lncRNA knockdown data set with molecular phenotyping (over 1000 CAGE deep-sequencing libraries) for further exploration and highlight functional roles for ZNF213-AS1 and lnc-KHDC3L-2.


Assuntos
RNA Longo não Codificante/fisiologia , Processos de Crescimento Celular/genética , Movimento Celular/genética , Fibroblastos/citologia , Fibroblastos/metabolismo , Humanos , Canais de Potássio KCNQ/metabolismo , Anotação de Sequência Molecular , Oligonucleotídeos Antissenso , RNA Longo não Codificante/antagonistas & inibidores , RNA Longo não Codificante/metabolismo , RNA Interferente Pequeno
6.
Nature ; 543(7644): 199-204, 2017 03 09.
Artigo em Inglês | MEDLINE | ID: mdl-28241135

RESUMO

Long non-coding RNAs (lncRNAs) are largely heterogeneous and functionally uncharacterized. Here, using FANTOM5 cap analysis of gene expression (CAGE) data, we integrate multiple transcript collections to generate a comprehensive atlas of 27,919 human lncRNA genes with high-confidence 5' ends and expression profiles across 1,829 samples from the major human primary cell types and tissues. Genomic and epigenomic classification of these lncRNAs reveals that most intergenic lncRNAs originate from enhancers rather than from promoters. Incorporating genetic and expression data, we show that lncRNAs overlapping trait-associated single nucleotide polymorphisms are specifically expressed in cell types relevant to the traits, implicating these lncRNAs in multiple diseases. We further demonstrate that lncRNAs overlapping expression quantitative trait loci (eQTL)-associated single nucleotide polymorphisms of messenger RNAs are co-expressed with the corresponding messenger RNAs, suggesting their potential roles in transcriptional regulation. Combining these findings with conservation data, we identify 19,175 potentially functional lncRNAs in the human genome.


Assuntos
Bases de Dados Genéticas , RNA Longo não Codificante/química , RNA Longo não Codificante/genética , Transcriptoma/genética , Células Cultivadas , Sequência Conservada/genética , Conjuntos de Dados como Assunto , Elementos Facilitadores Genéticos/genética , Epigênese Genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Genoma Humano/genética , Estudo de Associação Genômica Ampla , Genômica , Humanos , Internet , Anotação de Sequência Molecular , Especificidade de Órgãos/genética , Polimorfismo de Nucleotídeo Único , Regiões Promotoras Genéticas/genética , Locos de Características Quantitativas/genética , Estabilidade de RNA , RNA Mensageiro/genética
7.
Cancer Sci ; 112(2): 884-892, 2021 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-33280191

RESUMO

Discrimination of Philadelphia-negative myeloproliferative neoplasms (Ph-MPNs) from reactive hypercytosis and myelofibrosis requires a constellation of testing including driver mutation analysis and bone marrow biopsies. We searched for a biomarker that can more easily distinguish Ph-MPNs from reactive hypercytosis and myelofibrosis by using RNA-seq analysis utilizing platelet-rich plasma (PRP)-derived RNAs from patients with essential thrombocythemia (ET) and reactive thrombocytosis, and CREB3L1 was found to have an extremely high impact in discriminating the two disorders. To validate and further explore the result, expression levels of CREB3L1 in PRP were quantified by reverse-transcription quantitative PCR and compared among patients with ET, other Ph-MPNs, chronic myeloid leukemia (CML), and reactive hypercytosis and myelofibrosis. A CREB3L1 expression cutoff value determined based on PRP of 18 healthy volunteers accurately discriminated 150 driver mutation-positive Ph-MPNs from other entities (71 reactive hypercytosis and myelofibrosis, 6 CML, and 18 healthy volunteers) and showed both sensitivity and specificity of 1.0000. Importantly, CREB3L1 expression levels were significantly higher in ET compared with reactive thrombocytosis (P < .0001), and polycythemia vera compared with reactive erythrocytosis (P < .0001). Pathology-affirmed triple-negative ET (TN-ET) patients were divided into a high- and low-CREB3L1-expression group, and some patients in the low-expression group achieved a spontaneous remission during the clinical course. In conclusion, CREB3L1 analysis has the potential to single-handedly discriminate driver mutation-positive Ph-MPNs from reactive hypercytosis and myelofibrosis, and also may identify a subgroup within TN-ET showing distinct clinical features including spontaneous remission.


Assuntos
Biomarcadores Tumorais/sangue , Proteína de Ligação ao Elemento de Resposta ao AMP Cíclico/sangue , Transtornos Mieloproliferativos/diagnóstico , Proteínas do Tecido Nervoso/sangue , Diagnóstico Diferencial , Humanos , Leucemia Mielogênica Crônica BCR-ABL Positiva/sangue , Leucemia Mielogênica Crônica BCR-ABL Positiva/diagnóstico , Transtornos Mieloproliferativos/sangue
8.
PLoS Biol ; 15(9): e2002887, 2017 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-28873399

RESUMO

Cap Analysis of Gene Expression (CAGE) in combination with single-molecule sequencing technology allows precision mapping of transcription start sites (TSSs) and genome-wide capture of promoter activities in differentiated and steady state cell populations. Much less is known about whether TSS profiling can characterize diverse and non-steady state cell populations, such as the approximately 400 transitory and heterogeneous cell types that arise during ontogeny of vertebrate animals. To gain such insight, we used the chick model and performed CAGE-based TSS analysis on embryonic samples covering the full 3-week developmental period. In total, 31,863 robust TSS peaks (>1 tag per million [TPM]) were mapped to the latest chicken genome assembly, of which 34% to 46% were active in any given developmental stage. ZENBU, a web-based, open-source platform, was used for interactive data exploration. TSSs of genes critical for lineage differentiation could be precisely mapped and their activities tracked throughout development, suggesting that non-steady state and heterogeneous cell populations are amenable to CAGE-based transcriptional analysis. Our study also uncovered a large set of extremely stable housekeeping TSSs and many novel stage-specific ones. We furthermore demonstrated that TSS mapping could expedite motif-based promoter analysis for regulatory modules associated with stage-specific and housekeeping genes. Finally, using Brachyury as an example, we provide evidence that precise TSS mapping in combination with Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR)-on technology enables us, for the first time, to efficiently target endogenous avian genes for transcriptional activation. Taken together, our results represent the first report of genome-wide TSS mapping in birds and the first systematic developmental TSS analysis in any amniote species (birds and mammals). By facilitating promoter-based molecular analysis and genetic manipulation, our work also underscores the value of avian models in unravelling the complex regulatory mechanism of cell lineage specification during amniote development.


Assuntos
Desenvolvimento Embrionário , Estudo de Associação Genômica Ampla , Sítio de Iniciação de Transcrição , Animais , Evolução Biológica , Embrião de Galinha , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas
9.
Nature ; 507(7493): 462-70, 2014 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-24670764

RESUMO

Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly 'housekeeping', whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.


Assuntos
Atlas como Assunto , Anotação de Sequência Molecular , Regiões Promotoras Genéticas/genética , Transcriptoma/genética , Animais , Linhagem Celular , Células Cultivadas , Análise por Conglomerados , Sequência Conservada/genética , Regulação da Expressão Gênica/genética , Redes Reguladoras de Genes/genética , Genes Essenciais/genética , Genoma/genética , Humanos , Camundongos , Fases de Leitura Aberta/genética , Especificidade de Órgãos , RNA Mensageiro/análise , RNA Mensageiro/genética , Fatores de Transcrição/metabolismo , Sítio de Iniciação de Transcrição , Transcrição Gênica/genética
10.
Nature ; 507(7493): 455-461, 2014 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-24670763

RESUMO

Enhancers control the correct temporal and cell-type-specific activation of gene expression in multicellular eukaryotes. Knowing their properties, regulatory activity and targets is crucial to understand the regulation of differentiation and homeostasis. Here we use the FANTOM5 panel of samples, covering the majority of human tissues and cell types, to produce an atlas of active, in vivo-transcribed enhancers. We show that enhancers share properties with CpG-poor messenger RNA promoters but produce bidirectional, exosome-sensitive, relatively short unspliced RNAs, the generation of which is strongly related to enhancer activity. The atlas is used to compare regulatory programs between different cells at unprecedented depth, to identify disease-associated regulatory single nucleotide polymorphisms, and to classify cell-type-specific and ubiquitous enhancers. We further explore the utility of enhancer redundancy, which explains gene expression strength rather than expression patterns. The online FANTOM5 enhancer atlas represents a unique resource for studies on cell-type-specific enhancers and gene regulation.


Assuntos
Atlas como Assunto , Elementos Facilitadores Genéticos/genética , Regulação da Expressão Gênica/genética , Anotação de Sequência Molecular , Especificidade de Órgãos , Linhagem Celular , Células Cultivadas , Análise por Conglomerados , Predisposição Genética para Doença/genética , Células HeLa , Humanos , Polimorfismo de Nucleotídeo Único/genética , Regiões Promotoras Genéticas/genética , RNA Mensageiro/biossíntese , RNA Mensageiro/genética , Sítio de Iniciação de Transcrição , Iniciação da Transcrição Genética
11.
Nucleic Acids Res ; 46(22): 11898-11909, 2018 12 14.
Artigo em Inglês | MEDLINE | ID: mdl-30407537

RESUMO

MicroRNAs (miRNAs) modulate the post-transcriptional regulation of target genes and are related to biology of complex human traits, but genetic landscape of miRNAs remains largely unknown. Given the strikingly tissue-specific miRNA expression profiles, we here expand a previous method to quantitatively evaluate enrichment of genome-wide association study (GWAS) signals on miRNA-target gene networks (MIGWAS) to further estimate tissue-specific enrichment. Our approach integrates tissue-specific expression profiles of miRNAs (∼1800 miRNAs in 179 cells) with GWAS to test whether polygenic signals enrich in miRNA-target gene networks and whether they fall within specific tissues. We applied MIGWAS to 49 GWASs (nTotal = 3 520 246), and successfully identified biologically relevant tissues. Further, MIGWAS could point miRNAs as candidate biomarkers of the trait. As an illustrative example, we performed differentially expressed miRNA analysis between rheumatoid arthritis (RA) patients and healthy controls (n = 63). We identified novel biomarker miRNAs (e.g. hsa-miR-762) by integrating differentially expressed miRNAs with MIGWAS results for RA, as well as novel associated loci with significant genetic risk (rs56656810 at MIR762 at 16q11; n = 91 482, P = 3.6 × 10-8). Our result highlighted that miRNA-target gene network contributes to human disease genetics in a cell type-specific manner, which could yield an efficient screening of miRNAs as promising biomarkers.


Assuntos
Artrite Reumatoide/genética , Asma/genética , Colite Ulcerativa/genética , Redes Reguladoras de Genes , Genoma Humano , Doença de Graves/genética , MicroRNAs/genética , Algoritmos , Artrite Reumatoide/imunologia , Artrite Reumatoide/patologia , Asma/imunologia , Asma/patologia , Biomarcadores/metabolismo , Estudos de Casos e Controles , Colite Ulcerativa/imunologia , Colite Ulcerativa/patologia , Biologia Computacional/métodos , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Loci Gênicos , Estudo de Associação Genômica Ampla , Doença de Graves/imunologia , Doença de Graves/patologia , Humanos , MicroRNAs/classificação , MicroRNAs/metabolismo , Herança Multifatorial/genética , Herança Multifatorial/imunologia , Especificidade de Órgãos , Transdução de Sinais
12.
PLoS Genet ; 13(3): e1006641, 2017 03.
Artigo em Inglês | MEDLINE | ID: mdl-28263993

RESUMO

The FANTOM5 consortium utilised cap analysis of gene expression (CAGE) to provide an unprecedented insight into transcriptional regulation in human cells and tissues. In the current study, we have used CAGE-based transcriptional profiling on an extended dense time course of the response of human monocyte-derived macrophages grown in macrophage colony-stimulating factor (CSF1) to bacterial lipopolysaccharide (LPS). We propose that this system provides a model for the differentiation and adaptation of monocytes entering the intestinal lamina propria. The response to LPS is shown to be a cascade of successive waves of transient gene expression extending over at least 48 hours, with hundreds of positive and negative regulatory loops. Promoter analysis using motif activity response analysis (MARA) identified some of the transcription factors likely to be responsible for the temporal profile of transcriptional activation. Each LPS-inducible locus was associated with multiple inducible enhancers, and in each case, transient eRNA transcription at multiple sites detected by CAGE preceded the appearance of promoter-associated transcripts. LPS-inducible long non-coding RNAs were commonly associated with clusters of inducible enhancers. We used these data to re-examine the hundreds of loci associated with susceptibility to inflammatory bowel disease (IBD) in genome-wide association studies. Loci associated with IBD were strongly and specifically (relative to rheumatoid arthritis and unrelated traits) enriched for promoters that were regulated in monocyte differentiation or activation. Amongst previously-identified IBD susceptibility loci, the vast majority contained at least one promoter that was regulated in CSF1-dependent monocyte-macrophage transitions and/or in response to LPS. On this basis, we concluded that IBD loci are strongly-enriched for monocyte-specific genes, and identified at least 134 additional candidate genes associated with IBD susceptibility from reanalysis of published GWA studies. We propose that dysregulation of monocyte adaptation to the environment of the gastrointestinal mucosa is the key process leading to inflammatory bowel disease.


Assuntos
Doenças Inflamatórias Intestinais/genética , Macrófagos/citologia , Monócitos/citologia , Transcriptoma , Motivos de Aminoácidos , Diferenciação Celular , Citocinas/metabolismo , Regulação da Expressão Gênica , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Genômica , Humanos , Inflamação , Doenças Inflamatórias Intestinais/etiologia , Mucosa Intestinal/metabolismo , Ligantes , Lipopolissacarídeos/farmacologia , Fator Estimulador de Colônias de Macrófagos/farmacologia , Família Multigênica , Regiões Promotoras Genéticas , Fatores de Tempo , Fatores de Transcrição/metabolismo , Transcrição Gênica , Ativação Transcricional
13.
BMC Genomics ; 20(1): 718, 2019 Sep 18.
Artigo em Inglês | MEDLINE | ID: mdl-31533632

RESUMO

BACKGROUND: The work of the FANTOM5 Consortium has brought forth a new level of understanding of the regulation of gene transcription and the cellular processes involved in creating diversity of cell types. In this study, we extended the analysis of the FANTOM5 Cap Analysis of Gene Expression (CAGE) transcriptome data to focus on understanding the genetic regulators involved in mouse cerebellar development. RESULTS: We used the HeliScopeCAGE library sequencing on cerebellar samples over 8 embryonic and 4 early postnatal times. This study showcases temporal expression pattern changes during cerebellar development. Through a bioinformatics analysis that focused on transcription factors, their promoters and binding sites, we identified genes that appear as strong candidates for involvement in cerebellar development. We selected several candidate transcriptional regulators for validation experiments including qRT-PCR and shRNA transcript knockdown. We observed marked and reproducible developmental defects in Atf4, Rfx3, and Scrt2 knockdown embryos, which support the role of these genes in cerebellar development. CONCLUSIONS: The successful identification of these novel gene regulators in cerebellar development demonstrates that the FANTOM5 cerebellum time series is a high-quality transcriptome database for functional investigation of gene regulatory networks in cerebellar development.


Assuntos
Cerebelo/crescimento & desenvolvimento , Perfilação da Expressão Gênica , Motivos de Nucleotídeos/genética , Transcrição Gênica/genética , Fator 4 Ativador da Transcrição/deficiência , Fator 4 Ativador da Transcrição/genética , Fator 4 Ativador da Transcrição/metabolismo , Animais , Cerebelo/embriologia , Cerebelo/metabolismo , Regulação da Expressão Gênica no Desenvolvimento , Técnicas de Silenciamento de Genes , Camundongos , Camundongos Endogâmicos C57BL , Regiões Promotoras Genéticas/genética , Fatores de Transcrição de Fator Regulador X/deficiência , Fatores de Transcrição de Fator Regulador X/genética , Fatores de Transcrição de Fator Regulador X/metabolismo , Fatores de Transcrição/deficiência , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo
14.
PLoS Comput Biol ; 14(3): e1005934, 2018 03.
Artigo em Inglês | MEDLINE | ID: mdl-29494619

RESUMO

Genetic variants underlying complex traits, including disease susceptibility, are enriched within the transcriptional regulatory elements, promoters and enhancers. There is emerging evidence that regulatory elements associated with particular traits or diseases share similar patterns of transcriptional activity. Accordingly, shared transcriptional activity (coexpression) may help prioritise loci associated with a given trait, and help to identify underlying biological processes. Using cap analysis of gene expression (CAGE) profiles of promoter- and enhancer-derived RNAs across 1824 human samples, we have analysed coexpression of RNAs originating from trait-associated regulatory regions using a novel quantitative method (network density analysis; NDA). For most traits studied, phenotype-associated variants in regulatory regions were linked to tightly-coexpressed networks that are likely to share important functional characteristics. Coexpression provides a new signal, independent of phenotype association, to enable fine mapping of causative variants. The NDA coexpression approach identifies new genetic variants associated with specific traits, including an association between the regulation of the OCT1 cation transporter and genetic variants underlying circulating cholesterol levels. NDA strongly implicates particular cell types and tissues in disease pathogenesis. For example, distinct groupings of disease-associated regulatory regions implicate two distinct biological processes in the pathogenesis of ulcerative colitis; a further two separate processes are implicated in Crohn's disease. Thus, our functional analysis of genetic predisposition to disease defines new distinct disease endotypes. We predict that patients with a preponderance of susceptibility variants in each group are likely to respond differently to pharmacological therapy. Together, these findings enable a deeper biological understanding of the causal basis of complex traits.


Assuntos
Predisposição Genética para Doença/genética , Genômica/métodos , Regiões Promotoras Genéticas/genética , Doença de Crohn/genética , Bases de Dados Genéticas , Perfilação da Expressão Gênica , Humanos , Transcriptoma/genética
15.
BMC Genomics ; 19(1): 39, 2018 01 11.
Artigo em Inglês | MEDLINE | ID: mdl-29325522

RESUMO

CORRECTION: The authors of the original article [1] would like to recognize the critical contribution of core members of the FANTOM5 Consortium, who played the critical role of HeliScopeCAGE sequencing experiments, quality control of tag reads and processing of the raw sequencing data.

16.
J Cell Sci ; 129(13): 2573-85, 2016 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-27199372

RESUMO

Lymphangiogenesis plays a crucial role during development, in cancer metastasis and in inflammation. Activation of VEGFR-3 (also known as FLT4) by VEGF-C is one of the main drivers of lymphangiogenesis, but the transcriptional events downstream of VEGFR-3 activation are largely unknown. Recently, we identified a wave of immediate early transcription factors that are upregulated in human lymphatic endothelial cells (LECs) within the first 30 to 80 min after VEGFR-3 activation. Expression of these transcription factors must be regulated by additional pre-existing transcription factors that are rapidly activated by VEGFR-3 signaling. Using transcription factor activity analysis, we identified the homeobox transcription factor HOXD10 to be specifically activated at early time points after VEGFR-3 stimulation, and to regulate expression of immediate early transcription factors, including NR4A1. Gain- and loss-of-function studies revealed that HOXD10 is involved in LECs migration and formation of cord-like structures. Furthermore, HOXD10 regulates expression of VE-cadherin, claudin-5 and NOS3 (also known as e-NOS), and promotes lymphatic endothelial permeability. Taken together, these results reveal an important and unanticipated role of HOXD10 in the regulation of VEGFR-3 signaling in lymphatic endothelial cells, and in the control of lymphangiogenesis and permeability.


Assuntos
Proteínas de Homeodomínio/genética , Neoplasias/genética , Membro 1 do Grupo A da Subfamília 4 de Receptores Nucleares/genética , Fatores de Transcrição/genética , Fator C de Crescimento do Endotélio Vascular/genética , Receptor 3 de Fatores de Crescimento do Endotélio Vascular/genética , Linhagem Celular , Permeabilidade da Membrana Celular/genética , Movimento Celular/genética , Células Endoteliais/metabolismo , Células Endoteliais/patologia , Regulação Neoplásica da Expressão Gênica , Humanos , Linfangiogênese/genética , Metástase Neoplásica , Neoplasias/patologia , Transdução de Sinais , Fator C de Crescimento do Endotélio Vascular/biossíntese , Receptor 3 de Fatores de Crescimento do Endotélio Vascular/biossíntese
17.
Genome Res ; 25(10): 1546-57, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26228054

RESUMO

Promoters are central to the regulation of gene expression. Changes in gene regulation are thought to underlie much of the adaptive diversification between species and phenotypic variation within populations. In contrast to earlier work emphasizing the importance of enhancer evolution and subtle sequence changes at promoters, we show that dramatic changes such as the complete gain and loss (collectively, turnover) of functional promoters are common. Using quantitative measures of transcription initiation in both humans and mice across 52 matched tissues, we discriminate promoter sequence gains from losses and resolve the lineage of changes. We also identify expression divergence and functional turnover between orthologous promoters, finding only the latter is associated with local sequence changes. Promoter turnover has occurred at the majority (>56%) of protein-coding genes since humans and mice diverged. Tissue-restricted promoters are the most evolutionarily volatile where retrotransposition is an important, but not the sole, source of innovation. There is considerable heterogeneity of turnover rates between promoters in different tissues, but the consistency of these in both lineages suggests that the same biological systems are similarly inclined to transcriptional rewiring. The genes affected by promoter turnover show evidence of adaptive evolution. In mice, promoters are primarily lost through deletion of the promoter containing sequence, whereas in humans, many promoters appear to be gradually decaying with weak transcriptional output and relaxed selective constraint. Our results suggest that promoter gain and loss is an important process in the evolutionary rewiring of gene regulation and may be a significant source of phenotypic diversification.


Assuntos
Evolução Molecular , Regiões Promotoras Genéticas , Animais , Sequência de Bases , Sequência Conservada , DNA , Elementos de DNA Transponíveis , Humanos , Camundongos , Mutagênese Insercional , Deleção de Sequência , Especificidade da Espécie
18.
Cerebellum ; 17(3): 308-325, 2018 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-29307116

RESUMO

Laser-capture microdissection was used to isolate external germinal layer tissue from three developmental periods of mouse cerebellar development: embryonic days 13, 15, and 18. The cerebellar granule cell-enriched mRNA library was generated with next-generation sequencing using the Helicos technology. Our objective was to discover transcriptional regulators that could be important for the development of cerebellar granule cells-the most numerous neuron in the central nervous system. Through differential expression analysis, we have identified 82 differentially expressed transcription factors (TFs) from a total of 1311 differentially expressed genes. In addition, with TF-binding sequence analysis, we have identified 46 TF candidates that could be key regulators responsible for the variation in the granule cell transcriptome between developmental stages. Altogether, we identified 125 potential TFs (82 from differential expression analysis, 46 from motif analysis with 3 overlaps in the two sets). From this gene set, 37 TFs are considered novel due to the lack of previous knowledge about their roles in cerebellar development. The results from transcriptome-wide analyses were validated with existing online databases, qRT-PCR, and in situ hybridization. This study provides an initial insight into the TFs of cerebellar granule cells that might be important for development and provide valuable information for further functional studies on these transcriptional regulators.


Assuntos
Cerebelo/embriologia , Cerebelo/metabolismo , Neurônios/metabolismo , Fatores de Transcrição/metabolismo , Animais , Simulação por Computador , Perfilação da Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento , Hibridização In Situ , Microdissecção e Captura a Laser , Camundongos Endogâmicos C57BL , Reação em Cadeia da Polimerase em Tempo Real , Transcriptoma
19.
Nucleic Acids Res ; 44(7): 3233-52, 2016 Apr 20.
Artigo em Inglês | MEDLINE | ID: mdl-27001520

RESUMO

Functionality of the non-coding transcripts encoded by the human genome is the coveted goal of the modern genomics research. While commonly relied on the classical methods of forward genetics, integration of different genomics datasets in a global Systems Biology fashion presents a more productive avenue of achieving this very complex aim. Here we report application of a Systems Biology-based approach to dissect functionality of a newly identified vast class of very long intergenic non-coding (vlinc) RNAs. Using highly quantitative FANTOM5 CAGE dataset, we show that these RNAs could be grouped into 1542 novel human genes based on analysis of insulators that we show here indeed function as genomic barrier elements. We show that vlinc RNAs genes likely function in cisto activate nearby genes. This effect while most pronounced in closely spaced vlinc RNA-gene pairs can be detected over relatively large genomic distances. Furthermore, we identified 101 vlinc RNA genes likely involved in early embryogenesis based on patterns of their expression and regulation. We also found another 109 such genes potentially involved in cellular functions also happening at early stages of development such as proliferation, migration and apoptosis. Overall, we show that Systems Biology-based methods have great promise for functional annotation of non-coding RNAs.


Assuntos
RNA Longo não Codificante/genética , Núcleo Celular/genética , Desenvolvimento Embrionário/genética , Regulação da Expressão Gênica , Humanos , Elementos Isolantes , Anotação de Sequência Molecular , Regiões Promotoras Genéticas , RNA Longo não Codificante/classificação , RNA Longo não Codificante/metabolismo , Retroviridae/genética , Biologia de Sistemas , Sequências Repetidas Terminais , Fatores de Transcrição/metabolismo
20.
Genome Res ; 24(4): 708-17, 2014 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-24676093

RESUMO

CAGE (cap analysis gene expression) and RNA-seq are two major technologies used to identify transcript abundances as well as structures. They measure expression by sequencing from either the 5' end of capped molecules (CAGE) or tags randomly distributed along the length of a transcript (RNA-seq). Library protocols for clonally amplified (Illumina, SOLiD, 454 Life Sciences [Roche], Ion Torrent), second-generation sequencing platforms typically employ PCR preamplification prior to clonal amplification, while third-generation, single-molecule sequencers can sequence unamplified libraries. Although these transcriptome profiling platforms have been demonstrated to be individually reproducible, no systematic comparison has been carried out between them. Here we compare CAGE, using both second- and third-generation sequencers, and RNA-seq, using a second-generation sequencer based on a panel of RNA mixtures from two human cell lines to examine power in the discrimination of biological states, detection of differentially expressed genes, linearity of measurements, and quantification reproducibility. We found that the quantified levels of gene expression are largely comparable across platforms and conclude that CAGE and RNA-seq are complementary technologies that can be used to improve incomplete gene models. We also found systematic bias in the second- and third-generation platforms, which is likely due to steps such as linker ligation, cleavage by restriction enzymes, and PCR amplification. This study provides a perspective on the performance of these platforms, which will be a baseline in the design of further experiments to tackle complex transcriptomes uncovered in a wide range of cell types.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , RNA/genética , Transcriptoma/genética , Perfilação da Expressão Gênica , Humanos , Análise de Sequência de RNA/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA