RESUMO
Transcription and translation are two main pillars of gene expression. Due to the different timings, spots of action, and mechanisms of regulation, these processes are mainly regarded as distinct and generally uncoupled, despite serving a common purpose. Here, we sought for a possible connection between transcription and translation. Employing an unbiased screen of multiple human promoters, we identified a positive effect of TATA box on translation and a general coupling between mRNA expression and translational efficiency. Using a CRISPR-Cas9-mediated approach, genome-wide analyses, and in vitro experiments, we show that the rate of transcription regulates the efficiency of translation. Furthermore, we demonstrate that m6A modification of mRNAs is co-transcriptional and depends upon the dynamics of the transcribing RNAPII. Suboptimal transcription rates lead to elevated m6A content, which may result in reduced translation. This study uncovers a general and widespread link between transcription and translation that is governed by epigenetic modification of mRNAs.
Assuntos
Adenosina/análogos & derivados , Regulação da Expressão Gênica , Biossíntese de Proteínas , Processamento Pós-Transcricional do RNA , RNA Mensageiro/metabolismo , Transcrição Gênica , Adenosina/metabolismo , Humanos , Metilação , Iniciação Traducional da Cadeia Peptídica , RNA Polimerase II/metabolismo , TATA BoxRESUMO
The RNA polymerase II core promoter is the site of convergence of the signals that lead to the initiation of transcription. Here, we performed a comparative analysis of the downstream core promoter region (DPR) in Drosophila and humans by using machine learning. These studies revealed a distinct human-specific version of the DPR and led to the use of machine learning models for the identification of synthetic extreme DPR motifs with specificity for human transcription factors relative to Drosophila factors and vice versa. More generally, machine learning models could similarly be used to design synthetic DNA elements with customized functional properties.
Assuntos
Drosophila , Fatores de Transcrição , Animais , Humanos , Drosophila/genética , Drosophila/metabolismo , TATA Box , Regiões Promotoras Genéticas/genética , Fatores de Transcrição/genética , RNA Polimerase II/metabolismo , Transcrição GênicaRESUMO
RNA polymerase II (RNA Pol II) transcription reconstituted from purified factors suggests pre-initiation complexes (PICs) can assemble by sequential incorporation of factors at the TATA box. However, these basal transcription reactions are generally independent of activators and co-activators. To study PIC assembly under more realistic conditions, we used single-molecule microscopy to visualize factor dynamics during activator-dependent reactions in nuclear extracts. Surprisingly, RNA Pol II, TFIIF, and TFIIE can pre-assemble on enhancer-bound activators before loading into PICs, and multiple RNA Pol II complexes can bind simultaneously to create a localized cluster. Unlike TFIIF and TFIIE, TFIIH binding is singular and dependent on the basal promoter. Activator-tethered factors exhibit dwell times on the order of seconds. In contrast, PICs can persist on the order of minutes in the absence of nucleotide triphosphates, although TFIIE remains unexpectedly dynamic even after TFIIH incorporation. Our kinetic measurements lead to a new branched model for activator-dependent PIC assembly.
Assuntos
Complexo Mediador/metabolismo , RNA Polimerase II/metabolismo , Iniciação da Transcrição Genética/fisiologia , Núcleo Celular/metabolismo , Complexo Mediador/genética , Regiões Promotoras Genéticas , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Imagem Individual de Molécula , TATA Box/genética , Proteína de Ligação a TATA-Box/genética , Fator de Transcrição TFIIH/metabolismo , Fatores de Transcrição TFII/metabolismo , Transcrição Gênica/genéticaRESUMO
Histone modifications regulate chromatin-dependent processes, yet the mechanisms by which they contribute to specific outcomes remain unclear. H3K4me3 is a prominent histone mark that is associated with active genes and promotes transcription through interactions with effector proteins that include initiation factor TFIID. We demonstrate that H3K4me3-TAF3 interactions direct global TFIID recruitment to active genes, some of which are p53 targets. Further analyses show that (1) H3K4me3 enhances p53-dependent transcription by stimulating preinitiation complex (PIC) formation; (2) H3K4me3, through TAF3 interactions, can act either independently or cooperatively with the TATA box to direct PIC formation and transcription; and (3) H3K4me3-TAF3/TFIID interactions regulate gene-selective functions of p53 in response to genotoxic stress. Our findings indicate a mechanism by which H3K4me3 directs PIC assembly for the rapid induction of specific p53 target genes.
Assuntos
Código das Histonas , Histonas/metabolismo , Fator de Transcrição TFIID/metabolismo , Iniciação da Transcrição Genética , Linhagem Celular Tumoral , Humanos , Lisina/metabolismo , Metilação , TATA Box , Fatores Associados à Proteína de Ligação a TATA , Ativação Transcricional , Proteína Supressora de Tumor p53/metabolismo , Fatores de Transcrição de p300-CBP/metabolismoRESUMO
A mechanistic description of metazoan transcription is essential for understanding the molecular processes that govern cellular decisions. To provide structural insights into the DNA recognition step of transcription initiation, we used single-particle electron microscopy (EM) to visualize human TFIID with promoter DNA. This analysis revealed that TFIID coexists in two predominant and distinct structural states that differ by a 100 Å translocation of TFIID's lobe A. The transition between these structural states is modulated by TFIIA, as the presence of TFIIA and promoter DNA facilitates the formation of a rearranged state of TFIID that enables promoter recognition and binding. DNA labeling and footprinting, together with cryo-EM studies, were used to map the locations of TATA, Initiator (Inr), motif ten element (MTE), and downstream core promoter element (DPE) promoter motifs within the TFIID-TFIIA-DNA structure. The existence of two structurally and functionally distinct forms of TFIID suggests that the different conformers may serve as specific targets for the action of regulatory factors.
Assuntos
Regiões Promotoras Genéticas , Fator de Transcrição TFIID/química , Fator de Transcrição TFIID/metabolismo , Transcrição Gênica , Microscopia Crioeletrônica , DNA/genética , Humanos , Conformação Proteica , RNA Polimerase II/química , RNA Polimerase II/metabolismo , TATA Box , Fator de Transcrição TFIIA/metabolismo , Fator de Transcrição TFIID/ultraestrutura , Fatores de Transcrição/química , Fatores de Transcrição/metabolismoRESUMO
Core promoters are sites where transcriptional regulatory inputs of a gene are integrated to direct the assembly of the preinitiation complex (PIC) and RNA polymerase II (Pol II) transcription output. Until now, core promoter functions have been investigated by distinct methods, including Pol II transcription initiation site mappings and structural characterization of PICs on distinct promoters. Here, we bring together these previously unconnected observations and hypothesize how, on metazoan TATA promoters, the precisely structured building up of transcription factor (TF) IID-based PICs results in sharp transcription start site (TSS) selection; or, in contrast, how the less strictly controlled positioning of the TATA-less promoter DNA relative to TFIID-core PIC components results in alternative broad TSS selections by Pol II.
Assuntos
Fator de Transcrição TFIID , Transcrição Gênica , Animais , Fator de Transcrição TFIID/genética , Fator de Transcrição TFIID/metabolismo , TATA Box , Regiões Promotoras Genéticas , RNA Polimerase II/metabolismoRESUMO
The initiation of transcription is a focal point for the regulation of gene activity during mammalian cell differentiation and development. To initiate transcription, RNA polymerase II (Pol II) assembles with general transcription factors into a pre-initiation complex (PIC) that opens promoter DNA. Previous work provided the molecular architecture of the yeast1-9 and human10,11 PIC and a topological model for DNA opening by the general transcription factor TFIIH12-14. Here we report the high-resolution cryo-electron microscopy structure of PIC comprising human general factors and Sus scrofa domesticus Pol II, which is 99.9% identical to human Pol II. We determine the structures of PIC with closed and opened promoter DNA at 2.5-2.8 Å resolution, and resolve the structure of TFIIH at 2.9-4.0 Å resolution. We capture the TFIIH translocase XPB in the pre- and post-translocation states, and show that XPB induces and propagates a DNA twist to initiate the opening of DNA approximately 30 base pairs downstream of the TATA box. We also provide evidence that DNA opening occurs in two steps and leads to the detachment of TFIIH from the core PIC, which may stop DNA twisting and enable RNA chain initiation.
Assuntos
DNA/química , DNA/metabolismo , Regiões Promotoras Genéticas/genética , RNA Polimerase II/química , RNA Polimerase II/metabolismo , Animais , Pareamento de Bases , DNA/genética , DNA Helicases/metabolismo , Proteínas de Ligação a DNA/metabolismo , Humanos , Mamíferos/genética , Modelos Moleculares , Saccharomyces cerevisiae/enzimologia , Saccharomyces cerevisiae/genética , TATA Box/genética , Fator de Transcrição TFIIH/química , Fator de Transcrição TFIIH/metabolismo , Sítio de Iniciação de Transcrição , Iniciação da Transcrição GenéticaRESUMO
The RNA polymerase II (Pol II) core promoter is the strategic site of convergence of the signals that lead to the initiation of DNA transcription1-5, but the downstream core promoter in humans has been difficult to understand1-3. Here we analyse the human Pol II core promoter and use machine learning to generate predictive models for the downstream core promoter region (DPR) and the TATA box. We developed a method termed HARPE (high-throughput analysis of randomized promoter elements) to create hundreds of thousands of DPR (or TATA box) variants, each with known transcriptional strength. We then analysed the HARPE data by support vector regression (SVR) to provide comprehensive models for the sequence motifs, and found that the SVR-based approach is more effective than a consensus-based method for predicting transcriptional activity. These results show that the DPR is a functionally important core promoter element that is widely used in human promoters. Notably, there appears to be a duality between the DPR and the TATA box, as many promoters contain one or the other element. More broadly, these findings show that functional DNA motifs can be identified by machine learning analysis of a comprehensive set of sequence variants.
Assuntos
Sequência Consenso/genética , Regulação da Expressão Gênica/genética , Regiões Promotoras Genéticas/genética , RNA Polimerase II/metabolismo , Máquina de Vetores de Suporte , Transcrição Gênica , Sequência de Bases , Células/metabolismo , Simulação por Computador , Conjuntos de Dados como Assunto , Células HeLa , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Modelos Genéticos , Mutagênese , TATA Box/genéticaRESUMO
Viruses are master remodelers of the host cell environment in support of infection and virus production. For example, viruses typically regulate cell gene expression through modulating canonical cell promoter activity. Here, we show that Epstein Barr virus (EBV) replication causes 'de novo' transcription initiation at 29674 new transcription start sites throughout the cell genome. De novo transcription initiation is facilitated in part by the unique properties of the viral pre-initiation complex (vPIC) that binds a TATT[T/A]AA, TATA box-like sequence and activates transcription with minimal support by additional transcription factors. Other de novo promoters are driven by the viral transcription factors, Zta and Rta and are influenced by directional proximity to existing canonical cell promoters, a configuration that fosters transcription through existing promoters and transcriptional interference. These studies reveal a new way that viruses interact with the host transcriptome to inhibit host gene expression and they shed light on primal features driving eukaryotic promoter function.
Assuntos
Infecções por Vírus Epstein-Barr , Herpesvirus Humano 4 , Iniciação da Transcrição Genética , Replicação Viral , Humanos , Herpesvirus Humano 4/fisiologia , Interações Hospedeiro-Patógeno , Regiões Promotoras Genéticas , TATA Box , Fatores de Transcrição/metabolismo , Sítio de Iniciação de Transcrição , Transcrição Gênica , Proteínas Virais/metabolismo , Proteínas Virais/genética , Infecções por Vírus Epstein-Barr/metabolismo , Infecções por Vírus Epstein-Barr/virologiaRESUMO
RNA polymerase II (Pol II) small nuclear RNA (snRNA) promoters and type 3 Pol III promoters have highly similar structures; both contain an interchangeable enhancer and "proximal sequence element" (PSE), which recruits the SNAP complex (SNAPc). The main distinguishing feature is the presence, in the type 3 promoters only, of a TATA box, which determines Pol III specificity. To understand the mechanism by which the absence or presence of a TATA box results in specific Pol recruitment, we examined how SNAPc and general transcription factors required for Pol II or Pol III transcription of SNAPc-dependent genes (i.e., TATA-box-binding protein [TBP], TFIIB, and TFIIA for Pol II transcription and TBP and BRF2 for Pol III transcription) assemble to ensure specific Pol recruitment. TFIIB and BRF2 could each, in a mutually exclusive fashion, be recruited to SNAPc. In contrast, TBP-TFIIB and TBP-BRF2 complexes were not recruited unless a TATA box was present, which allowed selective and efficient recruitment of the TBP-BRF2 complex. Thus, TBP both prevented BRF2 recruitment to Pol II promoters and enhanced BRF2 recruitment to Pol III promoters. On Pol II promoters, TBP recruitment was separate from TFIIB recruitment and enhanced by TFIIA. Our results provide a model for specific Pol recruitment at SNAPc-dependent promoters.
Assuntos
Regiões Promotoras Genéticas , RNA Polimerase III/metabolismo , RNA Polimerase II/metabolismo , RNA Nuclear Pequeno/genética , RNA Nuclear Pequeno/metabolismo , Células HEK293 , Humanos , Mutação , Ligação Proteica , Domínios Proteicos , Transporte Proteico , TATA Box/genética , Proteína de Ligação a TATA-Box/metabolismo , Fator de Transcrição TFIIB/metabolismo , Fatores de Transcrição/metabolismoRESUMO
Mammalian gene expression is inherently stochastic1,2, and results in discrete bursts of RNA molecules that are synthesized from each allele3-7. Although transcription is known to be regulated by promoters and enhancers, it is unclear how cis-regulatory sequences encode transcriptional burst kinetics. Characterization of transcriptional bursting, including the burst size and frequency, has mainly relied on live-cell4,6,8 or single-molecule RNA fluorescence in situ hybridization3,5,8,9 recordings of selected loci. Here we determine transcriptome-wide burst frequencies and sizes for endogenous mouse and human genes using allele-sensitive single-cell RNA sequencing. We show that core promoter elements affect burst size and uncover synergistic effects between TATA and initiator elements, which were masked at mean expression levels. Notably, we provide transcriptome-wide evidence that enhancers control burst frequencies, and demonstrate that cell-type-specific gene expression is primarily shaped by changes in burst frequencies. Together, our data show that burst frequency is primarily encoded in enhancers and burst size in core promoters, and that allelic single-cell RNA sequencing is a powerful model for investigating transcriptional kinetics.
Assuntos
Genes/genética , Genômica , Transcrição Gênica/genética , Alelos , Animais , Elementos Facilitadores Genéticos/genética , Fibroblastos/metabolismo , Humanos , Cinética , Masculino , Camundongos , Células-Tronco Embrionárias Murinas/metabolismo , Especificidade de Órgãos/genética , Polimorfismo Genético , Regiões Promotoras Genéticas/genética , Análise de Sequência de RNA , Deleção de Sequência , Análise de Célula Única , Processos Estocásticos , TATA Box/genética , Transcriptoma/genéticaRESUMO
Transcriptional cofactors (COFs) communicate regulatory cues from enhancers to promoters and are central effectors of transcription activation and gene expression1. Although some COFs have been shown to prefer certain promoter types2-5 over others (for example, see refs 6,7), the extent to which different COFs display intrinsic specificities for distinct promoters is unclear. Here we use a high-throughput promoter-activity assay in Drosophila melanogaster S2 cells to screen 23 COFs for their ability to activate 72,000 candidate core promoters (CPs). We observe differential activation of CPs, indicating distinct regulatory preferences or 'compatibilities'8,9 between COFs and specific types of CPs. These functionally distinct CP types are differentially enriched for known sequence elements2,4, such as the TATA box, downstream promoter element (DPE) or TCT motif, and display distinct chromatin properties at endogenous loci. Notably, the CP types differ in their relative abundance of H3K4me3 and H3K4me1 marks (see also refs 10-12), suggesting that these histone modifications might distinguish trans-regulatory factors rather than promoter- versus enhancer-type cis-regulatory elements. We confirm the existence of distinct COF-CP compatibilities in two additional Drosophila cell lines and in human cells, for which we find COFs that prefer TATA-box or CpG-island promoters, respectively. Distinct compatibilities between COFs and promoters can explain how different enhancers specifically activate distinct sets of genes9, alternative promoters within the same genes, and distinct transcription start sites within the same promoter13. Thus, COF-promoter compatibilities may underlie distinct transcriptional programs in species as divergent as flies and humans.
Assuntos
Regiões Promotoras Genéticas/genética , Fatores de Transcrição/metabolismo , Transcrição Gênica , Ativação Transcricional , Animais , Linhagem Celular , Cromatina/genética , Ilhas de CpG/genética , Drosophila melanogaster/citologia , Drosophila melanogaster/genética , Elementos Facilitadores Genéticos/genética , Histonas/metabolismo , Humanos , Especificidade por Substrato , TATA Box/genética , Sítio de Iniciação de TranscriçãoRESUMO
Transcription initiation entails chromatin opening followed by pre-initiation complex formation and RNA polymerase II recruitment. Subsequent polymerase elongation requires additional signals, resulting in increased residence time downstream of the start site, a phenomenon referred to as pausing. Here, we harnessed single-molecule footprinting to quantify distinct steps of initiation in vivo throughout the Drosophila genome. This identifies the impact of promoter structure on initiation dynamics in relation to nucleosomal occupancy. Additionally, perturbation of transcriptional initiation reveals an unexpectedly high turnover of polymerases at paused promoters-an observation confirmed at the level of nascent RNAs. These observations argue that absence of elongation is largely caused by premature termination rather than by stable polymerase stalling. In support of this non-processive model, we observe that induction of the paused heat shock promoter depends on continuous initiation. Our study provides a framework to quantify protein binding at single-molecule resolution and refines concepts of transcriptional pausing.
Assuntos
DNA/metabolismo , Proteínas de Drosophila/metabolismo , Drosophila melanogaster/enzimologia , Regiões Promotoras Genéticas , RNA Polimerase II/metabolismo , RNA/biossíntese , Imagem Individual de Molécula , Transcrição Gênica , Animais , Sítios de Ligação , DNA/genética , Proteínas de Drosophila/genética , Drosophila melanogaster/genética , Estudo de Associação Genômica Ampla , Proteínas de Choque Térmico HSP70/genética , Proteínas de Choque Térmico HSP70/metabolismo , Meia-Vida , Cinética , Ligação Proteica , Estabilidade Proteica , Proteólise , RNA/genética , RNA Polimerase II/genética , TATA Box , Sítio de Iniciação de Transcrição , Iniciação da Transcrição Genética , Terminação da Transcrição GenéticaRESUMO
Transcription by RNA polymerase II (Pol II) is dictated in part by core promoter elements, which are DNA sequences flanking the transcription start site (TSS) that help direct the proper initiation of transcription. Taking advantage of recent advances in genome-wide sequencing approaches, Vo ngoc and colleagues (pp. 6-11) identified transcripts with focused sites of initiation and found that many were transcribed from promoters containing a new consensus sequence for the human initiator (Inr) core promoter element.
Assuntos
Regiões Promotoras Genéticas , Sítio de Iniciação de Transcrição , Sequência de Bases , Sequência Consenso , Humanos , RNA Polimerase II/genética , TATA Box , Transcrição GênicaRESUMO
DNA sequence signals in the core promoter, such as the initiator (Inr), direct transcription initiation by RNA polymerase II. Here we show that the human Inr has the consensus of BBCA+1BW at focused promoters in which transcription initiates at a single site or a narrow cluster of sites. The analysis of 7678 focused transcription start sites revealed 40% with a perfect match to the Inr and 16% with a single mismatch outside of the CA+1 core. TATA-like sequences are underrepresented in Inr promoters. This consensus is a key component of the DNA sequence rules that specify transcription initiation in humans.
Assuntos
Regiões Promotoras Genéticas/genética , Sítio de Iniciação de Transcrição , Sequência Conservada/genética , Análise Mutacional de DNA , Humanos , Células MCF-7 , Mutação , Homologia de Sequência do Ácido Nucleico , TATA Box/genéticaRESUMO
A nucleosome is typically positioned with its proximal edge (NPE) â¼50 bp downstream from the transcription start site of metazoan RNA polymerase II promoters. This +1 nucleosome has distinctive characteristics, including the presence of variant histone types and trimethylation of histone H3 at lysine 4. To address the role of these features in transcription complex assembly, we generated templates with four different promoters and nucleosomes located at a variety of downstream positions, which were transcribed in vitro using HeLa nuclear extracts. Two promoters lacked TATA elements, but all supported strong initiation from a single transcription start site. In contrast to results with minimal in vitro systems based on the TATA-binding protein (TBP), TATA promoter templates with a +51 NPE were transcriptionally inhibited in extracts; activity continuously increased as the nucleosome was moved downstream to +100. Inhibition was much more pronounced for the TATA-less promoters: +51 NPE templates were inactive, and substantial activity was only seen with the +100 NPE templates. Substituting the histone variants H2A.Z, H3.3, or both did not eliminate the inhibition. However, addition of excess TBP restored activity on nucleosomal templates with TATA promoters, even with an NPE at +20. Remarkably, nucleosomal templates with histone H3 trimethylated at lysine 4 are active with an NPE at +51 for both TATA and TATA-less promoters. Our results strongly suggest that the +1 nucleosome interferes with promoter recognition by TFIID. This inhibition can be overcome with TBP alone at TATA promoters or through positive interactions with histone modifications and TFIID.
Assuntos
RNA Polimerase II , Fator de Transcrição TFIID , Animais , Humanos , Fator de Transcrição TFIID/genética , Fator de Transcrição TFIID/metabolismo , RNA Polimerase II/genética , RNA Polimerase II/metabolismo , Fatores de Transcrição/metabolismo , Nucleossomos/genética , Transcrição Gênica , Histonas/metabolismo , Lisina/genética , Proteína de Ligação a TATA-Box/genética , Proteína de Ligação a TATA-Box/metabolismo , TATA Box , Sequência de BasesRESUMO
The effects of rapid acute depletion of components of RNA polymerase II (Pol II) general transcription factors (GTFs) that are thought to be critical for formation of preinitiation complexes (PICs) and initiation in vitro were quantified in HAP1 cells using precision nuclear run-on sequencing (PRO-Seq). The average dependencies for each factor across >70 000 promoters varied widely even though levels of depletions were similar. Some of the effects could be attributed to the presence or absence of core promoter elements such as the upstream TBP-specificity motif or downstream G-rich sequences, but some dependencies anti-correlated with such sequences. While depletion of TBP had a large effect on most Pol III promoters only a small fraction of Pol II promoters were similarly affected. TFIIB depletion had the largest general effect on Pol II and also correlated with apparent termination defects downstream of genes. Our results demonstrate that promoter activity is combinatorially influenced by recruitment of TFIID and sequence-specific transcription factors. They also suggest that interaction of the preinitiation complex (PIC) with nucleosomes can affect activity and that recruitment of TFIID containing TBP only plays a positive role at a subset of promoters.
Assuntos
RNA Polimerase II , Fatores de Transcrição , Humanos , Fator de Transcrição TFIIB/genética , Fator de Transcrição TFIIB/metabolismo , RNA Polimerase II/genética , RNA Polimerase II/metabolismo , Regiões Promotoras Genéticas , Fatores de Transcrição/genética , Fator de Transcrição TFIID/genética , Fator de Transcrição TFIID/metabolismo , Proteína de Ligação a TATA-Box/genética , Proteína de Ligação a TATA-Box/metabolismo , Transcrição Gênica , TATA Box/genética , RNA Polimerase III/genéticaRESUMO
The structural maintenance of chromosome (SMC) complex cohesin mediates sister chromatid cohesion established during replication, and damage-induced cohesion formed in response to DSBs post-replication. The translesion synthesis polymerase Polη is required for damage-induced cohesion through a hitherto unknown mechanism. Since Polη is functionally associated with transcription, and transcription triggers de novo cohesion in Schizosaccharomyces pombe, we hypothesized that transcription facilitates damage-induced cohesion in Saccharomyces cerevisiae. Here, we show dysregulated transcriptional profiles in the Polη null mutant (rad30Δ), where genes involved in chromatin assembly and positive transcription regulation were downregulated. In addition, chromatin association of RNA polymerase II was reduced at promoters and coding regions in rad30Δ compared to WT cells, while occupancy of the H2A.Z variant (Htz1) at promoters was increased in rad30Δ cells. Perturbing histone exchange at promoters inactivated damage-induced cohesion, similarly to deletion of the RAD30 gene. Conversely, altering regulation of transcription elongation suppressed the deficient damage-induced cohesion in rad30Δ cells. Furthermore, transcription inhibition negatively affected formation of damage-induced cohesion. These results indicate that the transcriptional deregulation of the Polη null mutant is connected with its reduced capacity to establish damage-induced cohesion. This also suggests a linkage between regulation of transcription and formation of damage-induced cohesion after replication.
Assuntos
Proteínas de Ciclo Celular/biossíntese , Proteínas Cromossômicas não Histona/biossíntese , DNA Polimerase Dirigida por DNA/genética , RNA Polimerase II/genética , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/enzimologia , Transcrição Gênica , Cromatina/metabolismo , DNA Polimerase Dirigida por DNA/metabolismo , Regulação Enzimológica da Expressão Gênica , Regulação Fúngica da Expressão Gênica , Genes Fúngicos , Mutação , Regiões Promotoras Genéticas , RNA Polimerase II/metabolismo , Saccharomyces cerevisiae/genética , TATA Box , CoesinasRESUMO
Since the discovery of physical peculiarities around transcription start sites (TSSs) and a site corresponding to the TATA box, research has revealed only the average features of these sites. Unsettled enigmas include the individual genes with these features and whether they relate to gene function. Herein, using 10 physical properties of DNA, including duplex DNA free energy, base stacking energy, protein-induced deformability, and stabilizing energy of Z-DNA, we clarified for the first time that approximately 97% of the promoters of 21,056 human protein-coding genes have distinctive physical properties around the TSS and/or position -27; of these, nearly 65% exhibited such properties at both sites. Furthermore, about 55% of the 21,056 genes had a minimum value of regional duplex DNA free energy within TSS-centered ±300 bp regions. Notably, distinctive physical properties within the promoters and free energies of the surrounding regions separated human protein-coding genes into five groups; each contained specific gene ontology (GO) terms. The group represented by immune response genes differed distinctly from the other four regarding the parameter of the free energies of the surrounding regions. A vital suggestion from this study is that physical-feature-based analyses of genomes may reveal new aspects of the organization and regulation of genes.
Assuntos
DNA , Humanos , Regiões Promotoras Genéticas , TATA Box/genética , Sítio de Iniciação de TranscriçãoRESUMO
The precise and accurate identification and quantification of transcriptional start sites (TSSs) is key to understanding the control of transcription. The core promoter consists of the TSS and proximal non-coding sequences, which are critical in transcriptional regulation. Therefore, the accurate identification of TSSs is important for understanding the molecular regulation of transcription. Existing protocols for TSS identification are challenging and expensive, leaving high-quality data available for a small subset of organisms. This sparsity of data impairs study of TSS usage across tissues or in an evolutionary context. To address these shortcomings, we developed Smart-Seq2 Rolling Circle to Concatemeric Consensus (Smar2C2), which identifies and quantifies TSSs and transcription termination sites. Smar2C2 incorporates unique molecular identifiers that allowed for the identification of as many as 70 million sites, with no known upper limit. We have also generated TSS data sets from as little as 40 pg of total RNA, which was the smallest input tested. In this study, we used Smar2C2 to identify TSSs in Glycine max (soybean), Oryza sativa (rice), Sorghum bicolor (sorghum), Triticum aestivum (wheat) and Zea mays (maize) across multiple tissues. This wide panel of plant TSSs facilitated the identification of evolutionarily conserved features, such as novel patterns in the dinucleotides that compose the initiator element (Inr), that correlated with promoter expression levels across all species examined. We also discovered sequence variations in known promoter motifs that are positioned reliably close to the TSS, such as differences in the TATA box and in the Inr that may prove significant to our understanding and control of transcription initiation. Smar2C2 allows for the easy study of these critical sequences, providing a tool to facilitate discovery.