RESUMO
Transposable element (TE)-derived sequences make up approximately half of most mammalian genomes, and many TEs have been co-opted into gene regulatory elements. However, we lack a comprehensive tissue- and genome-wide understanding of how and when TEs gain regulatory activity in their hosts. We evaluated the prevalence of TE-derived DNA in enhancers and promoters across hundreds of human and mouse cell lines and primary tissues. Promoters are significantly depleted of TEs in all tissues compared with their overall prevalence in the genome (P < 0.001); enhancers are also depleted of TEs, though not as strongly as promoters. The degree of enhancer depletion also varies across contexts (1.5-3×), with reproductive and immune cells showing the highest levels of TE regulatory activity in humans. Overall, in spite of the regulatory potential of many TE sequences, they are significantly less active in gene regulation than expected from their prevalence. TE age is predictive of the likelihood of enhancer activity; TEs originating before the divergence of amniotes are 9.2 times more likely to have enhancer activity than TEs that integrated in great apes. Context-specific enhancers are more likely to be TE-derived than enhancers active in multiple tissues, and young TEs are more likely to overlap context-specific enhancers than old TEs (86% vs. 47%). Once TEs obtain enhancer activity in the host, they have similar functional dynamics to one another and non-TE-derived enhancers, likely driven by pleiotropic constraints. However, a few TE families, most notably endogenous retroviruses, have greater regulatory potential. Our observations suggest a model of regulatory co-option in which TE-derived sequences are initially repressed, after which a small fraction obtains context-specific enhancer activity, with further gains subject to pleiotropic constraints.
Assuntos
Elementos de DNA Transponíveis/fisiologia , Sequências Reguladoras de Ácido Nucleico/genética , Animais , Evolução Biológica , Elementos de DNA Transponíveis/genética , Regulação da Expressão Gênica/genética , Pleiotropia Genética/genética , Humanos , Camundongos , Regiões Promotoras Genéticas/genéticaRESUMO
BACKGROUND: RPB1, the largest subunit of RNA polymerase II, contains a highly modifiable C-terminal domain (CTD) that consists of variations of a consensus heptad repeat sequence (Y1S2P3T4S5P6S7). The consensus CTD repeat motif and tandem organization represent the ancestral state of eukaryotic RPB1, but across eukaryotes CTDs show considerable diversity in repeat organization and sequence content. These differences may reflect lineage-specific CTD functions mediated by protein interactions. Mammalian CTDs contain eight non-consensus repeats with a lysine in the seventh position (K7). Posttranslational acetylation of these sites was recently shown to be required for proper polymerase pausing and regulation of two growth factor-regulated genes. RESULTS: To investigate the origins and function of RPB1 CTD acetylation (acRPB1), we computationally reconstructed the evolution of the CTD repeat sequence across eukaryotes and analyzed the evolution and function of genes dysregulated when acRPB1 is disrupted. Modeling the evolutionary dynamics of CTD repeat count and sequence content across diverse eukaryotes revealed an expansion of the CTD in the ancestors of Metazoa. The new CTD repeats introduced the potential for acRPB1 due to the appearance of distal repeats with lysine at position seven. This was followed by a further increase in the number of lysine-containing repeats in developmentally complex clades like Deuterostomia. Mouse genes enriched for acRPB1 occupancy at their promoters and genes with significant expression changes when acRPB1 is disrupted are enriched for several functions, such as growth factor response, gene regulation, cellular adhesion, and vascular development. Genes occupied and regulated by acRPB1 show significant enrichment for evolutionary origins in the early history of eukaryotes through early vertebrates. CONCLUSIONS: Our combined functional and evolutionary analyses show that RPB1 CTD acetylation was possible in the early history of animals, and that the K7 content of the CTD expanded in specific developmentally complex metazoan lineages. The functional analysis of genes regulated by acRPB1 highlight functions involved in the origin of and diversification of complex Metazoa. This suggests that acRPB1 may have played a role in the success of animals.
Assuntos
Evolução Molecular , Lisina/metabolismo , Mamíferos/genética , RNA Polimerase II/metabolismo , Acetilação , Animais , Adesão Celular , Processos de Crescimento Celular , Regulação da Expressão Gênica , Humanos , Mamíferos/metabolismo , Regiões Promotoras Genéticas , Estrutura Terciária de Proteína , RNA Polimerase II/química , Vertebrados/genética , Vertebrados/metabolismoRESUMO
Editors' selections from the current scientific literature.
RESUMO
Editors' selections from the current scientific literature.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.
RESUMO
Highlights from the Science family of journals.