Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 95
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Nat Rev Mol Cell Biol ; 24(6): 430-447, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-36596869

RESUMEN

Genes specifying long non-coding RNAs (lncRNAs) occupy a large fraction of the genomes of complex organisms. The term 'lncRNAs' encompasses RNA polymerase I (Pol I), Pol II and Pol III transcribed RNAs, and RNAs from processed introns. The various functions of lncRNAs and their many isoforms and interleaved relationships with other genes make lncRNA classification and annotation difficult. Most lncRNAs evolve more rapidly than protein-coding sequences, are cell type specific and regulate many aspects of cell differentiation and development and other physiological processes. Many lncRNAs associate with chromatin-modifying complexes, are transcribed from enhancers and nucleate phase separation of nuclear condensates and domains, indicating an intimate link between lncRNA expression and the spatial control of gene expression during development. lncRNAs also have important roles in the cytoplasm and beyond, including in the regulation of translation, metabolism and signalling. lncRNAs often have a modular structure and are rich in repeats, which are increasingly being shown to be relevant to their function. In this Consensus Statement, we address the definition and nomenclature of lncRNAs and their conservation, expression, phenotypic visibility, structure and functions. We also discuss research challenges and provide recommendations to advance the understanding of the roles of lncRNAs in development, cell biology and disease.


Asunto(s)
ARN Largo no Codificante , ARN Largo no Codificante/genética , Núcleo Celular/genética , Cromatina/genética , Secuencias Reguladoras de Ácidos Nucleicos , ARN Polimerasa II/genética
2.
Nature ; 622(7981): 41-47, 2023 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-37794265

RESUMEN

Scientists have been trying to identify every gene in the human genome since the initial draft was published in 2001. In the years since, much progress has been made in identifying protein-coding genes, currently estimated to number fewer than 20,000, with an ever-expanding number of distinct protein-coding isoforms. Here we review the status of the human gene catalogue and the efforts to complete it in recent years. Beside the ongoing annotation of protein-coding genes, their isoforms and pseudogenes, the invention of high-throughput RNA sequencing and other technological breakthroughs have led to a rapid growth in the number of reported non-coding RNA genes. For most of these non-coding RNAs, the functional relevance is currently unclear; we look at recent advances that offer paths forward to identifying their functions and towards eventually completing the human gene catalogue. Finally, we examine the need for a universal annotation standard that includes all medically significant genes and maintains their relationships with different reference genomes for the use of the human gene catalogue in clinical settings.


Asunto(s)
Genes , Genoma Humano , Anotación de Secuencia Molecular , Isoformas de Proteínas , Humanos , Genoma Humano/genética , Anotación de Secuencia Molecular/normas , Anotación de Secuencia Molecular/tendencias , Isoformas de Proteínas/genética , Proyecto Genoma Humano , Seudogenes , ARN/genética
3.
Cell ; 148(1-2): 84-98, 2012 Jan 20.
Artículo en Inglés | MEDLINE | ID: mdl-22265404

RESUMEN

Higher-order chromosomal organization for transcription regulation is poorly understood in eukaryotes. Using genome-wide Chromatin Interaction Analysis with Paired-End-Tag sequencing (ChIA-PET), we mapped long-range chromatin interactions associated with RNA polymerase II in human cells and uncovered widespread promoter-centered intragenic, extragenic, and intergenic interactions. These interactions further aggregated into higher-order clusters, wherein proximal and distal genes were engaged through promoter-promoter interactions. Most genes with promoter-promoter interactions were active and transcribed cooperatively, and some interacting promoters could influence each other implying combinatorial complexity of transcriptional controls. Comparative analyses of different cell lines showed that cell-specific chromatin interactions could provide structural frameworks for cell-specific transcription, and suggested significant enrichment of enhancer-promoter interactions for cell-specific functions. Furthermore, genetically-identified disease-associated noncoding elements were found to be spatially engaged with corresponding genes through long-range interactions. Overall, our study provides insights into transcription regulation by three-dimensional chromatin interactions for both housekeeping and cell-specific genes in human cells.


Asunto(s)
Cromatina/metabolismo , Regulación de la Expresión Génica , Regiones Promotoras Genéticas , ARN Polimerasa II/metabolismo , Transcripción Genética , Línea Celular Tumoral , Inmunoprecipitación de Cromatina , Elementos de Facilitación Genéticos , Estudio de Asociación del Genoma Completo , Humanos
4.
Nature ; 583(7818): 693-698, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-32728248

RESUMEN

The Encylopedia of DNA Elements (ENCODE) Project launched in 2003 with the long-term goal of developing a comprehensive map of functional elements in the human genome. These included genes, biochemical regions associated with gene regulation (for example, transcription factor binding sites, open chromatin, and histone marks) and transcript isoforms. The marks serve as sites for candidate cis-regulatory elements (cCREs) that may serve functional roles in regulating gene expression1. The project has been extended to model organisms, particularly the mouse. In the third phase of ENCODE, nearly a million and more than 300,000 cCRE annotations have been generated for human and mouse, respectively, and these have provided a valuable resource for the scientific community.


Asunto(s)
Bases de Datos Genéticas , Genoma/genética , Genómica , Anotación de Secuencia Molecular , Animales , Sitios de Unión , Cromatina/genética , Cromatina/metabolismo , Metilación de ADN , Bases de Datos Genéticas/normas , Bases de Datos Genéticas/tendencias , Regulación de la Expresión Génica/genética , Genoma Humano/genética , Genómica/normas , Genómica/tendencias , Histonas/metabolismo , Humanos , Ratones , Anotación de Secuencia Molecular/normas , Control de Calidad , Secuencias Reguladoras de Ácidos Nucleicos/genética , Factores de Transcripción/metabolismo
5.
EMBO J ; 38(8)2019 04 15.
Artículo en Inglés | MEDLINE | ID: mdl-30918008

RESUMEN

Long noncoding RNAs (lncRNAs) can regulate target gene expression by acting in cis (locally) or in trans (non-locally). Here, we performed genome-wide expression analysis of Toll-like receptor (TLR)-stimulated human macrophages to identify pairs of cis-acting lncRNAs and protein-coding genes involved in innate immunity. A total of 229 gene pairs were identified, many of which were commonly regulated by signaling through multiple TLRs and were involved in the cytokine responses to infection by group B Streptococcus We focused on elucidating the function of one lncRNA, named lnc-MARCKS or ROCKI (Regulator of Cytokines and Inflammation), which was induced by multiple TLR stimuli and acted as a master regulator of inflammatory responses. ROCKI interacted with APEX1 (apurinic/apyrimidinic endodeoxyribonuclease 1) to form a ribonucleoprotein complex at the MARCKS promoter. In turn, ROCKI-APEX1 recruited the histone deacetylase HDAC1, which removed the H3K27ac modification from the promoter, thus reducing MARCKS transcription and subsequent Ca2+ signaling and inflammatory gene expression. Finally, genetic variants affecting ROCKI expression were linked to a reduced risk of certain inflammatory and infectious disease in humans, including inflammatory bowel disease and tuberculosis. Collectively, these data highlight the importance of cis-acting lncRNAs in TLR signaling, innate immunity, and pathophysiological inflammation.


Asunto(s)
Regulación de la Expresión Génica , Inmunidad Innata/inmunología , Inflamación/inmunología , Macrófagos/inmunología , ARN Largo no Codificante/metabolismo , Infecciones Estreptocócicas/microbiología , Receptores Toll-Like/metabolismo , Células Cultivadas , Citocinas/metabolismo , ADN-(Sitio Apurínico o Apirimidínico) Liasa/genética , ADN-(Sitio Apurínico o Apirimidínico) Liasa/metabolismo , Genoma Humano , Histona Desacetilasa 1/genética , Histona Desacetilasa 1/metabolismo , Humanos , Inflamación/genética , Inflamación/microbiología , Macrófagos/metabolismo , Macrófagos/microbiología , Sustrato de la Proteína Quinasa C Rico en Alanina Miristoilada/genética , Sustrato de la Proteína Quinasa C Rico en Alanina Miristoilada/metabolismo , Regiones Promotoras Genéticas , ARN Largo no Codificante/genética , Infecciones Estreptocócicas/inmunología , Streptococcus agalactiae/aislamiento & purificación , Receptores Toll-Like/genética
6.
Genome Res ; 30(7): 1047-1059, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-32759341

RESUMEN

We have produced RNA sequencing data for 53 primary cells from different locations in the human body. The clustering of these primary cells reveals that most cells in the human body share a few broad transcriptional programs, which define five major cell types: epithelial, endothelial, mesenchymal, neural, and blood cells. These act as basic components of many tissues and organs. Based on gene expression, these cell types redefine the basic histological types by which tissues have been traditionally classified. We identified genes whose expression is specific to these cell types, and from these genes, we estimated the contribution of the major cell types to the composition of human tissues. We found this cellular composition to be a characteristic signature of tissues and to reflect tissue morphological heterogeneity and histology. We identified changes in cellular composition in different tissues associated with age and sex, and found that departures from the normal cellular composition correlate with histological phenotypes associated with disease.


Asunto(s)
Transcripción Genética , Línea Celular , Células Endoteliales/metabolismo , Células Epiteliales/metabolismo , Femenino , Perfilación de la Expresión Génica , Ginecomastia/genética , Ginecomastia/metabolismo , Humanos , Masculino , Mesodermo/citología , Mesodermo/metabolismo , Neoplasias/genética , Especificidad de Órganos , Análisis de Secuencia de ARN
7.
Nat Rev Genet ; 18(7): 425-440, 2017 07.
Artículo en Inglés | MEDLINE | ID: mdl-28479595

RESUMEN

Cross-species comparisons of genomes, transcriptomes and gene regulation are now feasible at unprecedented resolution and throughput, enabling the comparison of human and mouse biology at the molecular level. Insights have been gained into the degree of conservation between human and mouse at the level of not only gene expression but also epigenetics and inter-individual variation. However, a number of limitations exist, including incomplete transcriptome characterization and difficulties in identifying orthologous phenotypes and cell types, which are beginning to be addressed by emerging technologies. Ultimately, these comparisons will help to identify the conditions under which the mouse is a suitable model of human physiology and disease, and optimize the use of animal models.


Asunto(s)
Modelos Animales de Enfermedad , Evolución Molecular , Regulación de la Expresión Génica , Transcriptoma , Animales , Secuencia Conservada , Genoma Humano , Humanos , Ratones , ARN Largo no Codificante/genética
9.
Genome Res ; 29(9): 1402-1414, 2019 09.
Artículo en Inglés | MEDLINE | ID: mdl-31413151

RESUMEN

Alu elements are one of the most successful families of transposons in the human genome. A portion of Alu elements is transcribed by RNA Pol III, whereas the remaining ones are part of Pol II transcripts. Because Alu elements are highly repetitive, it has been difficult to identify the Pol III-transcribed elements and quantify their expression levels. In this study, we generated high-resolution, long-genomic-span RAMPAGE data in 155 biosamples all with matching RNA-seq data and built an atlas of 17,249 Pol III-transcribed Alu elements. We further performed an integrative analysis on the ChIP-seq data of 10 histone marks and hundreds of transcription factors, whole-genome bisulfite sequencing data, ChIA-PET data, and functional data in several biosamples, and our results revealed that although the human-specific Alu elements are transcriptionally repressed, the older, expressed Alu elements may be exapted by the human host to function as cell-type-specific enhancers for their nearby protein-coding genes.


Asunto(s)
Elementos Alu , Análisis de Secuencia de ARN/métodos , Secuenciación Completa del Genoma/métodos , Biología Computacional/métodos , Elementos de Facilitación Genéticos , Evolución Molecular , Regulación de la Expresión Génica , Histonas/genética , Humanos , Anotación de Secuencia Molecular , ARN Polimerasa III/metabolismo , Sitio de Iniciación de la Transcripción
10.
Genome Res ; 29(11): 1900-1909, 2019 11.
Artículo en Inglés | MEDLINE | ID: mdl-31645363

RESUMEN

MicroRNAs (miRNAs) play a critical role as posttranscriptional regulators of gene expression. The ENCODE Project profiled the expression of miRNAs in an extensive set of organs during a time-course of mouse embryonic development and captured the expression dynamics of 785 miRNAs. We found distinct organ-specific and developmental stage-specific miRNA expression clusters, with an overall pattern of increasing organ-specific expression as embryonic development proceeds. Comparative analysis of conserved miRNAs in mouse and human revealed stronger clustering of expression patterns by organ type rather than by species. An analysis of messenger RNA expression clusters compared with miRNA expression clusters identifies the potential role of specific miRNA expression clusters in suppressing the expression of mRNAs specific to other developmental programs in the organ in which these miRNAs are expressed during embryonic development. Our results provide the most comprehensive time-course of miRNA expression as part of an integrated ENCODE reference data set for mouse embryonic development.


Asunto(s)
Desarrollo Embrionario/genética , MicroARNs/genética , Animales , Femenino , Regulación del Desarrollo de la Expresión Génica , Ratones , Embarazo , ARN Mensajero/genética
11.
Nucleic Acids Res ; 48(14): 8035-8049, 2020 08 20.
Artículo en Inglés | MEDLINE | ID: mdl-32609822

RESUMEN

Extracellular RNAs participate in intercellular communication, and are being studied as promising minimally invasive diagnostic markers. Several studies in recent years showed that tRNA halves and distinct Y RNA fragments are abundant in the extracellular space, including in biofluids. While their regulatory and diagnostic potential has gained a substantial amount of attention, the biogenesis of these extracellular RNA fragments remains largely unexplored. Here, we demonstrate that these fragments are produced by RNase 1, a highly active secreted nuclease. We use RNA sequencing to investigate the effect of a null mutation of RNase 1 on the levels of tRNA halves and Y RNA fragments in the extracellular environment of cultured human cells. We complement and extend our RNA sequencing results with northern blots, showing that tRNAs and Y RNAs in the non-vesicular extracellular compartment are released from cells as full-length precursors and are subsequently cleaved to distinct fragments. In support of these results, formation of tRNA halves is recapitulated by recombinant human RNase 1 in our in vitro assay. These findings assign a novel function for RNase 1, and position it as a strong candidate for generation of tRNA halves and Y RNA fragments in biofluids.


Asunto(s)
ARN de Transferencia/metabolismo , ARN no Traducido/metabolismo , Ribonucleasas/metabolismo , Humanos , Células K562 , Mutación , División del ARN , Procesamiento Postranscripcional del ARN , ARN de Transferencia/química , ARN no Traducido/química , RNA-Seq
12.
Nature ; 512(7515): 393-9, 2014 Aug 28.
Artículo en Inglés | MEDLINE | ID: mdl-24670639

RESUMEN

Animal transcriptomes are dynamic, with each cell type, tissue and organ system expressing an ensemble of transcript isoforms that give rise to substantial diversity. Here we have identified new genes, transcripts and proteins using poly(A)+ RNA sequencing from Drosophila melanogaster in cultured cell lines, dissected organ systems and under environmental perturbations. We found that a small set of mostly neural-specific genes has the potential to encode thousands of transcripts each through extensive alternative promoter usage and RNA splicing. The magnitudes of splicing changes are larger between tissues than between developmental stages, and most sex-specific splicing is gonad-specific. Gonads express hundreds of previously unknown coding and long non-coding RNAs (lncRNAs), some of which are antisense to protein-coding genes and produce short regulatory RNAs. Furthermore, previously identified pervasive intergenic transcription occurs primarily within newly identified introns. The fly transcriptome is substantially more complex than previously recognized, with this complexity arising from combinatorial usage of promoters, splice sites and polyadenylation sites.


Asunto(s)
Drosophila melanogaster/genética , Perfilación de la Expresión Génica , Transcriptoma/genética , Empalme Alternativo/genética , Animales , Drosophila melanogaster/anatomía & histología , Drosophila melanogaster/citología , Femenino , Masculino , Anotación de Secuencia Molecular , Tejido Nervioso/metabolismo , Especificidad de Órganos , Poli A/genética , Poliadenilación , Regiones Promotoras Genéticas/genética , ARN Largo no Codificante/genética , ARN Mensajero/genética , ARN Mensajero/metabolismo , Caracteres Sexuales , Estrés Fisiológico/genética
13.
Nucleic Acids Res ; 46(10): 5125-5138, 2018 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-29718481

RESUMEN

Many tools are available for RNA-seq alignment and expression quantification, with comparative value being hard to establish. Benchmarking assessments often highlight methods' good performance, but are focused on either model data or fail to explain variation in performance. This leaves us to ask, what is the most meaningful way to assess different alignment choices? And importantly, where is there room for progress? In this work, we explore the answers to these two questions by performing an exhaustive assessment of the STAR aligner. We assess STAR's performance across a range of alignment parameters using common metrics, and then on biologically focused tasks. We find technical metrics such as fraction mapping or expression profile correlation to be uninformative, capturing properties unlikely to have any role in biological discovery. Surprisingly, we find that changes in alignment parameters within a wide range have little impact on both technical and biological performance. Yet, when performance finally does break, it happens in difficult regions, such as X-Y paralogs and MHC genes. We believe improved reporting by developers will help establish where results are likely to be robust or fragile, providing a better baseline to establish where methodological progress can still occur.


Asunto(s)
Expresión Génica , Alineación de Secuencia/métodos , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Algoritmos , Cromosomas Humanos Y , Bases de Datos Genéticas , Femenino , Humanos , Masculino , Factores Sexuales
14.
RNA ; 21(11): 1966-79, 2015 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-26392588

RESUMEN

Extracellular vesicles (EVs) have been proposed as a means to promote intercellular communication. We show that when human primary cells are exposed to cancer cell EVs, rapid cell death of the primary cells is observed, while cancer cells treated with primary or cancer cell EVs do not display this response. The active agents that trigger cell death are 29- to 31-nucleotide (nt) or 22- to 23-nt processed fragments of an 83-nt primary transcript of the human RNY5 gene that are highly likely to be formed within the EVs. Primary cells treated with either cancer cell EVs, deproteinized total RNA from either primary or cancer cell EVs, or synthetic versions of 31- and 23-nt fragments trigger rapid cell death in a dose-dependent manner. The transfer of processed RNY5 fragments through EVs may reflect a novel strategy used by cancer cells toward the establishment of a favorable microenvironment for their proliferation and invasion.


Asunto(s)
Vesículas Extracelulares/metabolismo , Neoplasias/metabolismo , ARN/metabolismo , Comunicación Celular/fisiología , Muerte Celular/fisiología , Línea Celular Tumoral , Proliferación Celular/fisiología , Humanos , Células K562
15.
Nature ; 471(7339): 473-9, 2011 Mar 24.
Artículo en Inglés | MEDLINE | ID: mdl-21179090

RESUMEN

Drosophila melanogaster is one of the most well studied genetic model organisms; nonetheless, its genome still contains unannotated coding and non-coding genes, transcripts, exons and RNA editing sites. Full discovery and annotation are pre-requisites for understanding how the regulation of transcription, splicing and RNA editing directs the development of this complex organism. Here we used RNA-Seq, tiling microarrays and cDNA sequencing to explore the transcriptome in 30 distinct developmental stages. We identified 111,195 new elements, including thousands of genes, coding and non-coding transcripts, exons, splicing and editing events, and inferred protein isoforms that previously eluded discovery using established experimental, prediction and conservation-based approaches. These data substantially expand the number of known transcribed elements in the Drosophila genome and provide a high-resolution view of transcriptome dynamics throughout development.


Asunto(s)
Drosophila melanogaster/crecimiento & desarrollo , Drosophila melanogaster/genética , Perfilación de la Expresión Génica , Regulación del Desarrollo de la Expresión Génica/genética , Transcripción Genética/genética , Empalme Alternativo/genética , Animales , Secuencia de Bases , Proteínas de Drosophila/genética , Drosophila melanogaster/embriología , Exones/genética , Femenino , Genes de Insecto/genética , Genoma de los Insectos/genética , Masculino , MicroARNs/genética , Análisis de Secuencia por Matrices de Oligonucleótidos , Isoformas de Proteínas/genética , Edición de ARN/genética , ARN Mensajero/análisis , ARN Mensajero/genética , ARN Pequeño no Traducido/análisis , ARN Pequeño no Traducido/genética , Análisis de Secuencia , Caracteres Sexuales
16.
Proc Natl Acad Sci U S A ; 111(48): 17224-9, 2014 Dec 02.
Artículo en Inglés | MEDLINE | ID: mdl-25413365

RESUMEN

Although the similarities between humans and mice are typically highlighted, morphologically and genetically, there are many differences. To better understand these two species on a molecular level, we performed a comparison of the expression profiles of 15 tissues by deep RNA sequencing and examined the similarities and differences in the transcriptome for both protein-coding and -noncoding transcripts. Although commonalities are evident in the expression of tissue-specific genes between the two species, the expression for many sets of genes was found to be more similar in different tissues within the same species than between species. These findings were further corroborated by associated epigenetic histone mark analyses. We also find that many noncoding transcripts are expressed at a low level and are not detectable at appreciable levels across individuals. Moreover, the majority lack obvious sequence homologs between species, even when we restrict our attention to those which are most highly reproducible across biological replicates. Overall, our results indicate that there is considerable RNA expression diversity between humans and mice, well beyond what was described previously, likely reflecting the fundamental physiological differences between these two organisms.


Asunto(s)
ADN Intergénico/genética , Perfilación de la Expresión Génica/métodos , Especificidad de Órganos/genética , Proteínas/genética , Animales , Epigenómica/métodos , Evolución Molecular , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Ratones Endogámicos C57BL , Análisis de Secuencia de ARN , Especificidad de la Especie , Transcriptoma/genética
17.
Proc Natl Acad Sci U S A ; 111(17): 6131-8, 2014 Apr 29.
Artículo en Inglés | MEDLINE | ID: mdl-24753594

RESUMEN

With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease.


Asunto(s)
ADN/genética , Genoma Humano/genética , Evolución Biológica , Enfermedad/genética , Humanos , Secuencias Reguladoras de Ácidos Nucleicos/genética , Programas Informáticos
18.
Genome Res ; 23(1): 169-80, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22936248

RESUMEN

Many eukaryotic genes possess multiple alternative promoters with distinct expression specificities. Therefore, comprehensively annotating promoters and deciphering their individual regulatory dynamics is critical for gene expression profiling applications and for our understanding of regulatory complexity. We introduce RAMPAGE, a novel promoter activity profiling approach that combines extremely specific 5'-complete cDNA sequencing with an integrated data analysis workflow, to address the limitations of current techniques. RAMPAGE features a streamlined protocol for fast and easy generation of highly multiplexed sequencing libraries, offers very high transcription start site specificity, generates accurate and reproducible promoter expression measurements, and yields extensive transcript connectivity information through paired-end cDNA sequencing. We used RAMPAGE in a genome-wide study of promoter activity throughout 36 stages of the life cycle of Drosophila melanogaster, and describe here a comprehensive data set that represents the first available developmental time-course of promoter usage. We found that >40% of developmentally expressed genes have at least two promoters and that alternative promoters generally implement distinct regulatory programs. Transposable elements, long proposed to play a central role in the evolution of their host genomes through their ability to regulate gene expression, contribute at least 1300 promoters shaping the developmental transcriptome of D. melanogaster. Hundreds of these promoters drive the expression of annotated genes, and transposons often impart their own expression specificity upon the genes they regulate. These observations provide support for the theory that transposons may drive regulatory innovation through the distribution of stereotyped cis-regulatory modules throughout their host genomes.


Asunto(s)
Elementos Transponibles de ADN , Regulación del Desarrollo de la Expresión Génica , Regiones Promotoras Genéticas , Animales , Drosophila melanogaster/genética , Drosophila melanogaster/crecimiento & desarrollo , Biblioteca de Genes , Genes del Desarrollo , Genes de Insecto , Estadios del Ciclo de Vida/genética , Análisis de Secuencia de ADN/métodos , Sitio de Iniciación de la Transcripción , Transcripción Genética , Transcriptoma
19.
Genome Res ; 23(10): 1601-14, 2013 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-23811145

RESUMEN

Deep sequencing of mammalian DNA methylomes has uncovered a previously unpredicted number of discrete hypomethylated regions in intergenic space (iHMRs). Here, we combined whole-genome bisulfite sequencing data with extensive gene expression and chromatin-state data to define functional classes of iHMRs, and to reconstruct the dynamics of their establishment in a developmental setting. Comparing HMR profiles in embryonic stem and primary blood cells, we show that iHMRs mark an exclusive subset of active DNase hypersensitive sites (DHS), and that both developmentally constitutive and cell-type-specific iHMRs display chromatin states typical of distinct regulatory elements. We also observe that iHMR changes are more predictive of nearby gene activity than the promoter HMR itself, and that expression of noncoding RNAs within the iHMR accompanies full activation and complete demethylation of mature B cell enhancers. Conserved sequence features corresponding to iHMR transcript start sites, including a discernible TATA motif, suggest a conserved, functional role for transcription in these regions. Similarly, we explored both primate-specific and human population variation at iHMRs, finding that while enhancer iHMRs are more variable in sequence and methylation status than any other functional class, conservation of the TATA box is highly predictive of iHMR maintenance, reflecting the impact of sequence plasticity and transcriptional signals on iHMR establishment. Overall, our analysis allowed us to construct a three-step timeline in which (1) intergenic DHS are pre-established in the stem cell, (2) partial demethylation of blood-specific intergenic DHSs occurs in blood progenitors, and (3) complete iHMR formation and transcription coincide with enhancer activation in lymphoid-specified cells.


Asunto(s)
Cromatina/genética , Metilación de ADN , ADN Intergénico/química , ARN no Traducido/genética , Elementos Reguladores de la Transcripción , Animales , Linfocitos B/citología , Linfocitos B/fisiología , Diferenciación Celular , Línea Celular , Cromatina/metabolismo , Islas de CpG , Elementos de Facilitación Genéticos , Evolución Molecular , Femenino , Perfilación de la Expresión Génica , Células Madre Hematopoyéticas/citología , Células Madre Hematopoyéticas/fisiología , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Linfopoyesis , Pan troglodytes , Filogenia , Regiones Promotoras Genéticas , Análisis de Secuencia de ADN , Iniciación de la Transcripción Genética
20.
Genome Res ; 22(9): 1616-25, 2012 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-22955974

RESUMEN

Splicing remains an incompletely understood process. Recent findings suggest that chromatin structure participates in its regulation. Here, we analyze the RNA from subcellular fractions obtained through RNA-seq in the cell line K562. We show that in the human genome, splicing occurs predominantly during transcription. We introduce the coSI measure, based on RNA-seq reads mapping to exon junctions and borders, to assess the degree of splicing completion around internal exons. We show that, as expected, splicing is almost fully completed in cytosolic polyA+ RNA. In chromatin-associated RNA (which includes the RNA that is being transcribed), for 5.6% of exons, the removal of the surrounding introns is fully completed, compared with 0.3% of exons for which no intron-removal has occurred. The remaining exons exist as a mixture of spliced and fewer unspliced molecules, with a median coSI of 0.75. Thus, most RNAs undergo splicing while being transcribed: "co-transcriptional splicing." Consistent with co-transcriptional spliceosome assembly and splicing, we have found significant enrichment of spliceosomal snRNAs in chromatin-associated RNA compared with other cellular RNA fractions and other nonspliceosomal snRNAs. CoSI scores decrease along the gene, pointing to a "first transcribed, first spliced" rule, yet more downstream exons carry other characteristics, favoring rapid, co-transcriptional intron removal. Exons with low coSI values, that is, in the process of being spliced, are enriched with chromatin marks, consistent with a role for chromatin in splicing during transcription. For alternative exons and long noncoding RNAs, splicing tends to occur later, and the latter might remain unspliced in some cases.


Asunto(s)
Genoma Humano , Empalme del ARN , ARN Largo no Codificante/metabolismo , Transcripción Genética , Cromatina/metabolismo , Análisis por Conglomerados , Biología Computacional/métodos , Exones , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , ARN/genética , ARN/metabolismo , Análisis de Secuencia de ARN , Empalmosomas/genética , Empalmosomas/metabolismo , Fracciones Subcelulares/química
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA