RESUMO
N6-methyladenosine (m6A) is the most abundant internal eukaryotic mRNA modification, and is involved in the regulation of various biological processes. Direct Nanopore sequencing of native RNA (dRNA-seq) emerged as a leading approach for its identification. Several software were published for m6A detection and there is a strong need for independent studies benchmarking their performance on data from different species, and against various reference datasets. Moreover, a computational workflow is needed to streamline the execution of tools whose installation and execution remains complicated. We developed NanOlympicsMod, a Nextflow pipeline exploiting containerized technology for comparing 14 tools for m6A detection on dRNA-seq data. NanOlympicsMod was tested on dRNA-seq data generated from in vitro (un)modified synthetic oligos. The m6A hits returned by each tool were compared to the m6A position known by design of the oligos. In addition, NanOlympicsMod was used on dRNA-seq datasets from wild-type and m6A-depleted yeast, mouse and human, and each tool's hits were compared to reference m6A sets generated by leading orthogonal methods. The performance of the tools markedly differed across datasets, and methods adopting different approaches showed different preferences in terms of precision and recall. Changing the stringency cut-offs allowed for tuning the precision-recall trade-off towards user preferences. Finally, we determined that precision and recall of tools are markedly influenced by sequencing depth, and that additional sequencing would likely reveal additional m6A sites. Thanks to the possibility of including novel tools, NanOlympicsMod will streamline the benchmarking of m6A detection tools on dRNA-seq data, improving future RNA modification characterization.
Assuntos
Adenina/análogos & derivados , Sequenciamento por Nanoporos , Nanoporos , Humanos , Animais , Camundongos , RNA/genética , Benchmarking , Análise de Sequência de RNA/métodosRESUMO
A comprehensive understanding of molecular changes during brain aging is essential to mitigate cognitive decline and delay neurodegenerative diseases. The interpretation of mRNA alterations during brain aging is influenced by the health and age of the animal cohorts studied. Here, we carefully consider these factors and provide an in-depth investigation of mRNA splicing and dynamics in the aging mouse brain, combining short- and long-read sequencing technologies with extensive bioinformatic analyses. Our findings encompass a spectrum of age-related changes, including differences in isoform usage, decreased mRNA dynamics and a module showing increased expression of neuronal genes. Notably, our results indicate a reduced abundance of mRNA isoforms leading to nonsense-mediated RNA decay and suggest a regulatory role for RNA-binding proteins, indicating that their regulation may be altered leading to the reshaping of the aged brain transcriptome. Collectively, our study highlights the importance of studying mRNA splicing events during brain aging.
Assuntos
Processamento Alternativo , Encéfalo , Splicing de RNA , Animais , Camundongos , Encéfalo/metabolismo , Perfilação da Expressão Gênica/métodos , Splicing de RNA/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Transcriptoma/genéticaRESUMO
Post-transcriptional repression of gene expression by miRNAs occurs through transcript destabilization or translation inhibition. mRNA decay is known to account for most miRNA-dependent repression. However, because transcript decay occurs co-translationally, whether target translation is a requirement for miRNA-dependent transcript destabilization remains unknown. To decouple these two molecular processes, we used cytosolic long noncoding RNAs (lncRNAs) as models for endogenous transcripts that are not translated. We show that, despite interacting with the miRNA-loaded RNA-induced silencing complex, the steady-state abundance and decay rates of these transcripts are minimally affected by miRNA loss. To further validate the apparent requirement of translation for miRNA-dependent decay, we fused two lncRNA candidates to the 3'-end of a protein-coding gene reporter and found this results in their miRNA-dependent destabilization. Further analysis revealed that the few natural lncRNAs whose levels are regulated by miRNAs in mESCs tend to associate with translating ribosomes, and possibly represent misannotated micropeptides, further substantiating the necessity of target translation for miRNA-dependent transcript decay. In summary, our analyses suggest that translation is required for miRNA-dependent transcript destabilization, and demonstrate that the levels of coding and noncoding transcripts are differently affected by miRNAs.
Assuntos
MicroRNAs/genética , RNA Longo não Codificante/genética , RNA Mensageiro/química , RNA Mensageiro/metabolismo , Animais , Fusão Gênica Artificial , Linhagem Celular , Regulação da Expressão Gênica , Genes Reporter , Sequenciamento de Nucleotídeos em Larga Escala , Camundongos , Células-Tronco Embrionárias Murinas/citologia , Células-Tronco Embrionárias Murinas/metabolismo , Biossíntese de Proteínas , Estabilidade de RNA , Ribossomos/metabolismo , Análise de Sequência de RNARESUMO
The quantification of the kinetic rates of RNA synthesis, processing, and degradation are largely based on the integrative analysis of total and nascent transcription, the latter being quantified through RNA metabolic labeling. We developed INSPEcT-, a computational method based on the mathematical modeling of premature and mature RNA expression that is able to quantify kinetic rates from steady-state or time course total RNA-seq data without requiring any information on nascent transcripts. Our approach outperforms available solutions, closely recapitulates the kinetic rates obtained through RNA metabolic labeling, improves the ability to detect changes in transcript half-lives, reduces the cost and complexity of the experiments, and can be adopted to study experimental conditions in which nascent transcription cannot be readily profiled. Finally, we applied INSPEcT- to the characterization of post-transcriptional regulation landscapes in dozens of physiological and disease conditions. This approach was included in the INSPEcT Bioconductor package, which can now unveil RNA dynamics from steady-state or time course data, with or without the profiling of nascent RNA.
Assuntos
RNA-Seq , RNA/metabolismo , Biologia Computacional/métodos , Doença/genética , Expressão Gênica , Genoma , Humanos , Cinética , RNA/biossíntese , Processamento Pós-Transcricional do RNA , RNA-Seq/métodos , TiouridinaRESUMO
Despite gene expression programs being notoriously complex, RNA abundance is usually assumed as a proxy for transcriptional activity. Recently developed approaches, able to disentangle transcriptional and post-transcriptional regulatory processes, have revealed a more complex scenario. It is now possible to work out how synthesis, processing and degradation kinetic rates collectively determine the abundance of each gene's RNA. It has become clear that the same transcriptional output can correspond to different combinations of the kinetic rates. This underscores the fact that markedly different modes of gene expression regulation exist, each with profound effects on a gene's ability to modulate its own expression. This review describes the development of the experimental and computational approaches, including RNA metabolic labeling and mathematical modeling, that have been disclosing the mechanisms underlying complex transcriptional programs. Current limitations and future perspectives in the field are also discussed.
Assuntos
Modelos Genéticos , Processamento Pós-Transcricional do RNA , RNA/biossíntese , RNA/genética , Transcrição Gênica , Animais , HumanosRESUMO
MOTIVATION: Approaches such as chromatin immunoprecipitation followed by sequencing (ChIP-seq) represent the standard for the identification of binding sites of DNA-associated proteins, including transcription factors and histone marks. Public repositories of omics data contain a huge number of experimental ChIP-seq data, but their reuse and integrative analysis across multiple conditions remain a daunting task. RESULTS: We present the Combinatorial and Semantic Analysis of Functional Elements (CombSAFE), an efficient computational method able to integrate and take advantage of the valuable and numerous, but heterogeneous, ChIP-seq data publicly available in big data repositories. Leveraging natural language processing techniques, it integrates omics data samples with semantic annotations from selected biomedical ontologies; then, using hidden Markov models, it identifies combinations of static and dynamic functional elements throughout the genome for the corresponding samples. CombSAFE allows analyzing the whole genome, by clustering patterns of regions with similar functional elements and through enrichment analyses to discover ontological terms significantly associated with them. Moreover, it allows comparing functional states of a specific genomic region to analyze their different behavior throughout the various semantic annotations. Such findings can provide novel insights by identifying unexpected combinations of functional elements in different biological conditions. AVAILABILITY AND IMPLEMENTATION: The Python implementation of the CombSAFE pipeline is freely available for non-commercial use at: https://github.com/DEIB-GECO/CombSAFE. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Genômica , Semântica , Análise de Sequência de DNA/métodos , Genômica/métodos , Sequenciamento de Cromatina por Imunoprecipitação , GenomaRESUMO
Upon recruitment to active enhancers and promoters, RNA polymerase II (Pol II) generates short non-coding transcripts of unclear function. The mechanisms that control the length and the amount of ncRNAs generated by cis-regulatory elements are largely unknown. Here, we show that the adaptor protein WDR82 and its associated complexes actively limit such non-coding transcription. WDR82 targets the SET1 H3K4 methyltransferases and the nuclear protein phosphatase 1 (PP1) complexes to the initiating Pol II. WDR82 and PP1 also interact with components of the transcriptional termination and RNA processing machineries. Depletion of WDR82, SET1, or the PP1 subunit required for its nuclear import caused distinct but overlapping transcription termination defects at highly expressed genes and active enhancers and promoters, thus enabling the increased synthesis of unusually long ncRNAs. These data indicate that transcription initiated from cis-regulatory elements is tightly coordinated with termination mechanisms that impose the synthesis of short RNAs.
Assuntos
Núcleo Celular/metabolismo , Elementos Facilitadores Genéticos/fisiologia , Regiões Promotoras Genéticas/fisiologia , RNA Polimerase II/metabolismo , RNA não Traduzido/biossíntese , Terminação da Transcrição Genética/fisiologia , Transporte Ativo do Núcleo Celular/fisiologia , Animais , Núcleo Celular/genética , Proteínas Cromossômicas não Histona/genética , Proteínas Cromossômicas não Histona/metabolismo , Histona-Lisina N-Metiltransferase/genética , Histona-Lisina N-Metiltransferase/metabolismo , Camundongos , Fosfoproteínas Fosfatases/genética , Fosfoproteínas Fosfatases/metabolismo , RNA Polimerase II/genética , RNA não Traduzido/genéticaRESUMO
The histone demethylase LSD1 is a key chromatin regulator that is often deregulated in cancer. Its ortholog, dLsd1 plays a crucial role in Drosophila oogenesis; however, our knowledge of dLsd1 function is insufficient to explain its role in the ovary. Here, we have performed genome-wide analysis of dLsd1 binding in the ovary, and we document that dLsd1 is preferentially associated to the transcription start site of developmental genes. We uncovered an unanticipated interplay between dLsd1 and the GATA transcription factor Serpent and we report an unexpected role for Serpent in oogenesis. Besides, our transcriptomic data show that reducing dLsd1 levels results in ectopic transposable elements (TE) expression correlated with changes in H3K4me2 and H3K9me2 at TE loci. In addition, our results suggest that dLsd1 is required for Piwi dependent TE silencing. Hence, we propose that dLsd1 plays crucial roles in establishing specific gene expression programs and in repressing transposons during oogenesis.
Assuntos
Elementos de DNA Transponíveis/genética , Proteínas de Drosophila/genética , Fatores de Transcrição GATA/genética , Oogênese/genética , Oxirredutases N-Desmetilantes/genética , Animais , Proteínas Argonautas/genética , Cromatina/genética , Drosophila melanogaster/genética , Drosophila melanogaster/crescimento & desenvolvimento , Feminino , Regulação da Expressão Gênica no Desenvolvimento/genética , Genes Controladores do Desenvolvimento/genética , Histonas/genética , Ovário/crescimento & desenvolvimento , Ovário/metabolismo , Sítio de Iniciação de TranscriçãoRESUMO
Satellite cells (SCs) are muscle stem cells that remain quiescent during homeostasis and are activated in response to acute muscle damage or in chronic degenerative conditions such as Duchenne Muscular Dystrophy. The activity of SCs is supported by specialized cells which either reside in the muscle or are recruited in regenerating skeletal muscles, such as for instance macrophages (MΦs). By using a dystrophic mouse model of transient MΦ depletion, we describe a shift in identity of muscle stem cells dependent on the crosstalk between MΦs and SCs. Indeed MΦ depletion determines adipogenic conversion of SCs and exhaustion of the SC pool leading to an exacerbated dystrophic phenotype. The reported data could also provide new insights into therapeutic approaches targeting inflammation in dystrophic muscles.
Assuntos
Diferenciação Celular/genética , Macrófagos/metabolismo , Distrofia Muscular de Duchenne/genética , Regeneração/genética , Animais , Linhagem da Célula/genética , Modelos Animais de Doenças , Distrofina/genética , Humanos , Macrófagos/patologia , Camundongos , Camundongos Endogâmicos mdx , Músculo Esquelético/metabolismo , Músculo Esquelético/patologia , Distrofia Muscular de Duchenne/metabolismo , Distrofia Muscular de Duchenne/patologia , Mioblastos/metabolismo , Células Satélites de Músculo Esquelético/metabolismo , Células Satélites de Músculo Esquelético/patologiaRESUMO
Upon activation, lymphocytes exit quiescence and undergo substantial increases in cell size, accompanied by activation of energy-producing and anabolic pathways, widespread chromatin decompaction, and elevated transcriptional activity. These changes depend upon prior induction of the Myc transcription factor, but how Myc controls them remains unclear. We addressed this issue by profiling the response to LPS stimulation in wild-type and c-myc-deleted primary mouse B-cells. Myc is rapidly induced, becomes detectable on virtually all active promoters and enhancers, but has no direct impact on global transcriptional activity. Instead, Myc contributes to the swift up- and down-regulation of several hundred genes, including many known regulators of the aforementioned cellular processes. Myc-activated promoters are enriched for E-box consensus motifs, bind Myc at the highest levels, and show enhanced RNA Polymerase II recruitment, the opposite being true at down-regulated loci. Remarkably, the Myc-dependent signature identified in activated B-cells is also enriched in Myc-driven B-cell lymphomas: hence, besides modulation of new cancer-specific programs, the oncogenic action of Myc may largely rely on sustained deregulation of its normal physiological targets.
Assuntos
Linfócitos B/metabolismo , Proteínas Proto-Oncogênicas c-myc/metabolismo , Animais , Ciclo Celular/genética , Ciclo Celular/fisiologia , Proliferação de Células/genética , Proliferação de Células/fisiologia , Imunoprecipitação da Cromatina , Feminino , Regulação Neoplásica da Expressão Gênica/genética , Sequenciamento de Nucleotídeos em Larga Escala , Immunoblotting , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Regiões Promotoras Genéticas/genética , Proteínas Proto-Oncogênicas c-myc/genética , RNA Polimerase II/genética , RNA Polimerase II/metabolismo , Transcrição Gênica/genéticaRESUMO
The covalent modification of RNA molecules is a pervasive feature of all classes of RNAs and has fundamental roles in the regulation of several cellular processes. Mapping the location of RNA modifications transcriptome-wide is key to unveiling their role and dynamic behaviour, but technical limitations have often hampered these efforts. Nanopore direct RNA sequencing is a third-generation sequencing technology that allows the sequencing of native RNA molecules, thus providing a direct way to detect modifications at single-molecule resolution. Despite recent advances, the analysis of nanopore sequencing data for RNA modification detection is still a complex task that presents many challenges. Many works have addressed this task using different approaches, resulting in a large number of tools with different features and performances. Here we review the diverse approaches proposed so far and outline the principles underlying currently available algorithms.
Assuntos
Algoritmos , Biologia Computacional/métodos , Sequenciamento por Nanoporos/métodos , Processamento Pós-Transcricional do RNA , RNA/química , RNA/genética , Transcriptoma , Animais , Humanos , SoftwareRESUMO
Overexpression of the MYC transcription factor causes its widespread interaction with regulatory elements in the genome but leads to the up- and down-regulation of discrete sets of genes. The molecular determinants of these selective transcriptional responses remain elusive. Here, we present an integrated time-course analysis of transcription and mRNA dynamics following MYC activation in proliferating mouse fibroblasts, based on chromatin immunoprecipitation, metabolic labeling of newly synthesized RNA, extensive sequencing, and mathematical modeling. Transcriptional activation correlated with the highest increases in MYC binding at promoters. Repression followed a reciprocal scenario, with the lowest gains in MYC binding. Altogether, the relative abundance (henceforth, "share") of MYC at promoters was the strongest predictor of transcriptional responses in diverse cell types, predominating over MYC's association with the corepressor ZBTB17 (also known as MIZ1). MYC activation elicited immediate loading of RNA polymerase II (RNAPII) at activated promoters, followed by increases in pause-release, while repressed promoters showed opposite effects. Gains and losses in RNAPII loading were proportional to the changes in the MYC share, suggesting that repression by MYC may be partly indirect, owing to competition for limiting amounts of RNAPII. Secondary to the changes in RNAPII loading, the dynamics of elongation and pre-mRNA processing were also rapidly altered at MYC regulated genes, leading to the transient accumulation of partially or aberrantly processed mRNAs. Altogether, our results shed light on how overexpressed MYC alters the various phases of the RNAPII cycle and the resulting transcriptional response.
Assuntos
Regiões Promotoras Genéticas/fisiologia , Proteínas Proto-Oncogênicas c-myc/metabolismo , RNA Polimerase II/metabolismo , Precursores de RNA/biossíntese , Transcrição Gênica/fisiologia , Animais , Linhagem Celular Transformada , Camundongos , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , Proteínas Inibidoras de STAT Ativados/genética , Proteínas Inibidoras de STAT Ativados/metabolismo , Proteínas Proto-Oncogênicas c-myc/genética , RNA Polimerase II/genética , Precursores de RNA/genética , Processamento Pós-Transcricional do RNA/fisiologia , Ubiquitina-Proteína LigasesRESUMO
The c-myc proto-oncogene product, Myc, is a transcription factor that binds thousands of genomic loci. Recent work suggested that rather than up- and downregulating selected groups of genes, Myc targets all active promoters and enhancers in the genome (a phenomenon termed 'invasion') and acts as a general amplifier of transcription. However, the available data did not readily discriminate between direct and indirect effects of Myc on RNA biogenesis. We addressed this issue with genome-wide chromatin immunoprecipitation and RNA expression profiles during B-cell lymphomagenesis in mice, in cultured B cells and fibroblasts. Consistent with long-standing observations, we detected general increases in total RNA or messenger RNA copies per cell (hereby termed 'amplification') when comparing actively proliferating cells with control quiescent cells: this was true whether cells were stimulated by mitogens (requiring endogenous Myc for a proliferative response) or by deregulated, oncogenic Myc activity. RNA amplification and promoter/enhancer invasion by Myc were separable phenomena that could occur without one another. Moreover, whether or not associated with RNA amplification, Myc drove the differential expression of distinct subsets of target genes. Hence, although having the potential to interact with all active or poised regulatory elements in the genome, Myc does not directly act as a global transcriptional amplifier. Instead, our results indicate that Myc activates and represses transcription of discrete gene sets, leading to changes in cellular state that can in turn feed back on global RNA production and turnover.
Assuntos
Proliferação de Células , Transformação Celular Neoplásica/genética , Regulação Neoplásica da Expressão Gênica , Linfoma de Células B/genética , Linfoma de Células B/patologia , Proteínas Proto-Oncogênicas c-myc/metabolismo , Transcrição Gênica , Animais , Linfócitos B/metabolismo , Linfócitos B/patologia , Transformação Celular Neoplásica/patologia , Cromatina/genética , Cromatina/metabolismo , Imunoprecipitação da Cromatina , Progressão da Doença , Regulação para Baixo/genética , Feminino , Fibroblastos/citologia , Fibroblastos/metabolismo , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica/genética , Genoma/genética , Linfoma de Células B/metabolismo , Masculino , Camundongos , Mitógenos/farmacologia , Regiões Promotoras Genéticas/genética , Proteínas Proto-Oncogênicas c-myc/genética , RNA Mensageiro/biossíntese , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Fatores de Transcrição/metabolismo , Transcrição Gênica/genética , Regulação para Cima/genéticaRESUMO
The regulation of miRNAs is critical to the definition of cell identity and behavior in normal physiology and disease. To date, the dynamics of miRNA degradation and the mechanisms involved in remain largely obscure, in particular, in higher organisms. Here, we developed a pulse-chase approach based on metabolic RNA labeling to calculate miRNA decay rates at genome-wide scale in mammalian cells. Our analysis revealed heterogeneous miRNA half-lives, with many species behaving as stable molecules (T1/2> 24 h), while others, including passenger miRNAs and a number (25/129) of guide miRNAs, are quickly turned over (T1/2= 4-14 h). Decay rates were coupled with other features, including genomic organization, transcription rates, structural heterogeneity (isomiRs), and target abundance, measured through quantitative experimental approaches. This comprehensive analysis highlighted functional mechanisms that mediate miRNA degradation, as well as the importance of decay dynamics in the regulation of the miRNA pool under both steady-state conditions and during cell transitions.
Assuntos
MicroRNAs/genética , Animais , Proteínas Argonautas/metabolismo , Fibroblastos , Regulação da Expressão Gênica , Estudo de Associação Genômica Ampla , Camundongos , MicroRNAs/metabolismo , Interferência de RNA , Estabilidade de RNA , Ribonuclease III/metabolismo , Fatores de Tempo , Transcrição GênicaRESUMO
Public repositories of large-scale biological data currently contain hundreds of thousands of experiments, including high-throughput sequencing and microarray data. The potential of using these resources to assemble data sets combining samples previously not associated is vastly unexplored. This requires the ability to associate samples with clear annotations and to relate experiments matched with different annotation terms. In this study, we illustrate the semantic annotation of Gene Expression Omnibus samples metadata using concepts from biomedical ontologies, focusing on the association of thousands of chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) samples with a given target, tissue and disease state. Next, we demonstrate the feasibility of quantitatively measuring the semantic similarity between different samples, with the aim of combining experiments associated with the same or similar semantic annotations, thus allowing the generation of large data sets without the need of additional experiments. We compared tools based on Unified Medical Language System with tools that use topic-specific ontologies, showing that the second approach outperforms the first both in the annotation process and in the computation of semantic similarity measures. Finally, we demonstrated the potential of this approach by identifying semantically homogeneous groups of ChIP-seq samples targeting the Myc transcription factor, and expanding this data set with semantically coherent epigenetic samples. The semantic information of these data sets proved to be coherent with the ChIP-seq signal and with the current knowledge about this transcription factor.
Assuntos
Genômica , Ontologias Biológicas , Imunoprecipitação da Cromatina , Humanos , SemânticaRESUMO
BACKGROUND: Covalent RNA modifications, such as N-6-methyladenosine (m6A), have been associated with various biological processes, but their role in cancer remains largely unexplored. m6A dynamics depends on specific enzymes whose deregulation may also impact in tumorigenesis. Herein, we assessed the differential abundance of m6A, its writer VIRMA and its reader YTHDF3, in testicular germ cell tumors (TGCTs), looking for clinicopathological correlates. METHODS: In silico analysis of TCGA data disclosed altered expression of VIRMA (52%) and YTHDF3 (48%), prompting subsequent validation. Formalin-fixed paraffin-embedded tissues from 122 TGCTs (2005-2016) were selected. RNA extraction, cDNA synthesis and real-time qPCR (Taqman assays) for VIRMA and YTHDF3 were performed, as well as immunohistochemistry for VIRMA, YTHDF3 and m6A, for staining intensity assessment. Associations between categorical variables were assessed using Chi square and Fisher's exact test. Distribution of continuous variables between groups was compared using the nonparametric Mann-Whitney and Kruskal-Wallis tests. Biomarker performance was assessed through receiver operating characteristics (ROC) curve construction and a cut-off was established by Youden's index method. Statistical significance was set at p < 0.05. RESULTS: In our cohort, VIRMA and YTHDF3 mRNA expression levels differed among TGCT subtypes, with Seminomas (SEs) depicting higher levels than Non-Seminomatous tumors (NSTs) (p < 0.01 for both). A positive correlation was found between VIRMA and YTHDF3 expression levels. VIRMA discriminated SEs from NSTs with AUC = 0.85 (Sensitivity 77.3%, Specificity 81.1%, PPV 71.6%, NPV 85.3%, Accuracy 79.7%). Immunohistochemistry paralleled transcript findings, as patients with strong m6A immunostaining intensity depicted significantly higher VIRMA mRNA expression levels and stronger VIRMA immunoexpression intensity (p < 0.001 and p < 0.01, respectively). CONCLUSION: Abundance of m6A and expression of VIRMA/YTHDF3 were different among TGCT subtypes, with higher levels in SEs, suggesting a contribution to SE phenotype maintenance. VIRMA and YTHDF3 might cooperate in m6A establishment in TGCTs, and their transcript levels accurately discriminate between SEs and NSTs, constituting novel candidate biomarkers for patient management.
Assuntos
Adenosina/análogos & derivados , Neoplasias Embrionárias de Células Germinativas/genética , Neoplasias Embrionárias de Células Germinativas/patologia , Proteínas de Ligação a RNA/genética , Seminoma/genética , Seminoma/patologia , Neoplasias Testiculares/genética , Neoplasias Testiculares/patologia , Adenosina/metabolismo , Adulto , Animais , Estudos de Coortes , Simulação por Computador , Regulação Neoplásica da Expressão Gênica , Humanos , Masculino , Camundongos , Metástase Neoplásica , Fenótipo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Proteínas de Ligação a RNA/metabolismo , Reprodutibilidade dos Testes , Adulto JovemRESUMO
Natural epigenetic variation provides a source for the generation of phenotypic diversity, but to understand its contribution to such diversity, its interaction with genetic variation requires further investigation. Here we report population-wide DNA sequencing of genomes, transcriptomes and methylomes of wild Arabidopsis thaliana accessions. Single cytosine methylation polymorphisms are not linked to genotype. However, the rate of linkage disequilibrium decay amongst differentially methylated regions targeted by RNA-directed DNA methylation is similar to the rate for single nucleotide polymorphisms. Association analyses of these RNA-directed DNA methylation regions with genetic variants identified thousands of methylation quantitative trait loci, which revealed the population estimate of genetically dependent methylation variation. Analysis of invariably methylated transposons and genes across this population indicates that loci targeted by RNA-directed DNA methylation are epigenetically activated in pollen and seeds, which facilitates proper development of these structures.
Assuntos
Arabidopsis/genética , Epigênese Genética/genética , Variação Genética/genética , Genoma de Planta/genética , Metilação de DNA/genética , Elementos de DNA Transponíveis/genética , Epigenômica , Desequilíbrio de Ligação/genética , Pólen/genética , Polimorfismo Genético/genética , Locos de Características Quantitativas , RNA Mensageiro/análise , RNA Mensageiro/genética , RNA de Plantas/genética , Sementes/genéticaRESUMO
The genetic alphabet consists of the four letters: C, A, G, and T in DNA and C,A,G, and U in RNA. Triplets of these four letters jointly encode 20 different amino acids out of which proteins of all organisms are built. This system is universal and is found in all kingdoms of life. However, bases in DNA and RNA can be chemically modified. In DNA, around 10 different modifications are known, and those have been studied intensively over the past 20 years. Scientific studies on DNA modifications and proteins that recognize them gave rise to the large field of epigenetic and epigenomic research. The outcome of this intense research field is the discovery that development, ageing, and stem-cell dependent regeneration but also several diseases including cancer are largely controlled by the epigenetic state of cells. Consequently, this research has already led to the first FDA approved drugs that exploit the gained knowledge to combat disease. In recent years, the ~150 modifications found in RNA have come to the focus of intense research. Here we provide a perspective on necessary and expected developments in the fast expanding area of RNA modifications, termed epitranscriptomics.
Assuntos
DNA de Neoplasias , Epigênese Genética , Epigenômica/normas , Perfilação da Expressão Gênica/normas , Regulação Neoplásica da Expressão Gênica , Neoplasias , RNA Neoplásico , Transcriptoma , DNA de Neoplasias/genética , DNA de Neoplasias/metabolismo , Europa (Continente) , Perfilação da Expressão Gênica/métodos , Humanos , Neoplasias/genética , Neoplasias/metabolismo , RNA Neoplásico/genética , RNA Neoplásico/metabolismoRESUMO
Both diffusible factors acting in trans and chromatin components acting in cis are implicated in gene regulation, but the extent to which either process causally determines a cell's transcriptional identity is unclear. We recently used cell fusion to define a class of silent genes termed "cis-silenced" (or "occluded") genes, which remain silent even in the presence of trans-acting transcriptional activators. We further showed that occlusion of lineage-inappropriate genes plays a critical role in maintaining the transcriptional identities of somatic cells. Here, we present, for the first time, a comprehensive map of occluded genes in somatic cells. Specifically, we mapped occluded genes in mouse fibroblasts via fusion to a dozen different rat cell types followed by whole-transcriptome profiling. We found that occluded genes are highly prevalent and stable in somatic cells, representing a sizeable fraction of silent genes. Occluded genes are also highly enriched for important developmental regulators of alternative lineages, consistent with the role of occlusion in safeguarding cell identities. Alongside this map, we also present whole-genome maps of DNA methylation and eight other chromatin marks. These maps uncover a complex relationship between chromatin state and occlusion. Furthermore, we found that DNA methylation functions as the memory of occlusion in a subset of occluded genes, while histone deacetylation contributes to the implementation but not memory of occlusion. Our data suggest that the identities of individual cell types are defined largely by the occlusion status of their genomes. The comprehensive reference maps reported here provide the foundation for future studies aimed at understanding the role of occlusion in development and disease.
Assuntos
Regulação da Expressão Gênica , Inativação Gênica , Sequências Reguladoras de Ácido Nucleico , Transativadores/genética , Transcrição Gênica , Animais , Fusão Celular , Linhagem Celular , Cromatina/genética , Metilação de DNA/genética , Genoma , Histonas/genética , Histonas/metabolismo , Camundongos , RatosRESUMO
Induced pluripotent stem cells (iPSCs) offer immense potential for regenerative medicine and studies of disease and development. Somatic cell reprogramming involves epigenomic reconfiguration, conferring iPSCs with characteristics similar to embryonic stem (ES) cells. However, it remains unknown how complete the reestablishment of ES-cell-like DNA methylation patterns is throughout the genome. Here we report the first whole-genome profiles of DNA methylation at single-base resolution in five human iPSC lines, along with methylomes of ES cells, somatic cells, and differentiated iPSCs and ES cells. iPSCs show significant reprogramming variability, including somatic memory and aberrant reprogramming of DNA methylation. iPSCs share megabase-scale differentially methylated regions proximal to centromeres and telomeres that display incomplete reprogramming of non-CG methylation, and differences in CG methylation and histone modifications. Lastly, differentiation of iPSCs into trophoblast cells revealed that errors in reprogramming CG methylation are transmitted at a high frequency, providing an iPSC reprogramming signature that is maintained after differentiation.