RESUMEN
Phosphorylation of the carboxyl-terminal domain (CTD) of the largest subunit of RNA polymerase II (Pol II) governs stage-specific interactions with different cellular machines. The CTD consists of Y1S2P3T4S5P6S7 heptad repeats and sequential phosphorylations of Ser7, Ser5 and Ser2 occur universally at Pol II-transcribed genes. Phosphorylation of Thr4, however, appears to selectively modulate transcription of specific classes of genes. Here, we identify ten new Thr4 kinases from different kinase structural groups. Irreversible chemical inhibition of the most active Thr4 kinase, Hrr25, reveals a novel role for this kinase in transcription termination of specific class of noncoding snoRNA genes. Genome-wide profiles of Hrr25 reveal a selective enrichment at 3' regions of noncoding genes that display termination defects. Importantly, phospho-Thr4 marks placed by Hrr25 are recognized by Rtt103, a key component of the termination machinery. Our results suggest that these uncommon CTD kinases place phospho-Thr4 marks to regulate expression of targeted genes.
Asunto(s)
Proteínas Quinasas/metabolismo , ARN Polimerasa II/genética , ARN Polimerasa II/fisiología , Secuencia de Aminoácidos , Quinasa de la Caseína I/metabolismo , Fosforilación , Filogenia , Dominios Proteicos , Proteínas Serina-Treonina Quinasas/metabolismo , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Treonina/metabolismo , Transcripción GenéticaRESUMEN
The carboxyl-terminal domain (CTD) of the largest subunit of RNA polymerase II (Pol II) orchestrates dynamic recruitment of specific cellular machines during different stages of transcription. Signature phosphorylation patterns of Y1S2P3T4S5P6S7 heptapeptide repeats of the CTD engage specific "readers." Whereas phospho-Ser5 and phospho-Ser2 marks are ubiquitous, phospho-Thr4 is reported to only impact specific genes. Here, we identify a role for phospho-Thr4 in transcription termination at noncoding small nucleolar RNA (snoRNA) genes. Quantitative proteomics reveals an interactome of known readers as well as protein complexes that were not known to rely on Thr4 for association with Pol II. The data indicate a key role for Thr4 in engaging the machinery used for transcription elongation and termination. We focus on Rtt103, a protein that binds phospho-Ser2 and phospho-Thr4 marks and facilitates transcription termination at protein-coding genes. To elucidate how Rtt103 engages two distinct CTD modifications that are differentially enriched at noncoding genes, we relied on NMR analysis of Rtt103 in complex with phospho-Thr4- or phospho-Ser2-bearing CTD peptides. The structural data reveal that Rtt103 interacts with phospho-Thr4 in a manner analogous to its interaction with phospho-Ser2-modified CTD. The same set of hydrogen bonds involving either the oxygen on phospho-Thr4 and the hydroxyl on Ser2, or the phosphate on Ser2 and the Thr4 hydroxyl, can be formed by rotation of an arginine side chain, leaving the intermolecular interface otherwise unperturbed. This economy of design enables Rtt103 to engage Pol II at distinct sets of genes with differentially enriched CTD marks.
Asunto(s)
ARN Polimerasa II/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Factores de Transcripción/metabolismo , Secuencia de Aminoácidos , Factores de Terminación de Péptidos/metabolismo , Fosforilación , Dominios Proteicos/fisiología , Isoformas de Proteínas/metabolismo , ARN Polimerasa II/fisiología , ARN Nucleolar Pequeño/metabolismo , ARN Pequeño no Traducido/metabolismo , ARN no Traducido/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/fisiología , Serina/metabolismo , Treonina/metabolismo , Factores de Transcripción/fisiología , Transcripción Genética/genéticaRESUMEN
Stressed cells coordinate a multi-faceted response spanning many levels of physiology. Yet knowledge of the complete stress-activated regulatory network as well as design principles for signal integration remains incomplete. We developed an experimental and computational approach to integrate available protein interaction data with gene fitness contributions, mutant transcriptome profiles, and phospho-proteome changes in cells responding to salt stress, to infer the salt-responsive signaling network in yeast. The inferred subnetwork presented many novel predictions by implicating new regulators, uncovering unrecognized crosstalk between known pathways, and pointing to previously unknown 'hubs' of signal integration. We exploited these predictions to show that Cdc14 phosphatase is a central hub in the network and that modification of RNA polymerase II coordinates induction of stress-defense genes with reduction of growth-related transcripts. We find that the orthologous human network is enriched for cancer-causing genes, underscoring the importance of the subnetwork's predictions in understanding stress biology.
Asunto(s)
Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Ciclo Celular/metabolismo , Biología Computacional/métodos , Perfilación de la Expresión Génica , Regulación Fúngica de la Expresión Génica , Aptitud Genética , Proteínas Tirosina Fosfatasas/metabolismo , ARN Polimerasa II/metabolismo , Proteínas de Saccharomyces cerevisiae/genética , Transducción de Señal , Cloruro de Sodio/metabolismo , Estrés FisiológicoRESUMEN
The comparison of gene regulatory networks between diseased versus healthy individuals or between two different treatments is an important scientific problem. Here, we propose sc-compReg as a method for the comparative analysis of gene expression regulatory networks between two conditions using single cell gene expression (scRNA-seq) and single cell chromatin accessibility data (scATAC-seq). Our software, sc-compReg, can be used as a stand-alone package that provides joint clustering and embedding of the cells from both scRNA-seq and scATAC-seq, and the construction of differential regulatory networks across two conditions. We apply the method to compare the gene regulatory networks of an individual with chronic lymphocytic leukemia (CLL) versus a healthy control. The analysis reveals a tumor-specific B cell subpopulation in the CLL patient and identifies TOX2 as a potential regulator of this subpopulation.
Asunto(s)
Redes Reguladoras de Genes , Leucemia Linfocítica Crónica de Células B/genética , Análisis de la Célula Individual/métodos , Linfocitos B , Cromatina , Regulación Neoplásica de la Expresión Génica , Proteínas HMGB , Humanos , ARN Citoplasmático Pequeño , Programas InformáticosRESUMEN
The mammary epithelial cell (MEC) system is a bilayered ductal epithelium of luminal and basal cells, maintained by a lineage of stem and progenitor populations. Here, we used integrated single-cell transcriptomics and chromatin accessibility analysis to reconstruct the cell types of the mouse MEC system and their underlying gene regulatory features in an unbiased manner. We define differentiation states within the secretory type of luminal cells, which forms a continuous spectrum of general luminal progenitor and lactation-committed progenitor cells. By integrating single-cell transcriptomics and chromatin accessibility landscapes, we identify cis- and trans-regulatory elements that are differentially activated in the specific epithelial cell types and our newly defined luminal differentiation states. Our work provides a resource to reveal cis/trans-regulatory elements associated with MEC identity and differentiation that will serve as a reference to determine how the chromatin accessibility landscape changes during breast cancer.
Asunto(s)
Cromatina/genética , Células Epiteliales/metabolismo , Animales , Secuencia de Bases , Diferenciación Celular/genética , Linaje de la Célula , Proliferación Celular/genética , Cromatina/fisiología , Biología Computacional/métodos , Células Epiteliales/fisiología , Epitelio/metabolismo , Femenino , Perfilación de la Expresión Génica/métodos , Regulación de la Expresión Génica , Glándulas Mamarias Animales/citología , Glándulas Mamarias Animales/metabolismo , Ratones , Ratones Endogámicos C57BL , Ratones Endogámicos , Secuencias Reguladoras de Ácidos Nucleicos , Análisis de la Célula Individual/métodos , Células Madre/metabolismo , TranscriptomaRESUMEN
Understanding complex tissues requires single-cell deconstruction of gene regulation with precision and scale. Here, we assess the performance of a massively parallel droplet-based method for mapping transposase-accessible chromatin in single cells using sequencing (scATAC-seq). We apply scATAC-seq to obtain chromatin profiles of more than 200,000 single cells in human blood and basal cell carcinoma. In blood, application of scATAC-seq enables marker-free identification of cell type-specific cis- and trans-regulatory elements, mapping of disease-associated enhancer activity and reconstruction of trajectories of cellular differentiation. In basal cell carcinoma, application of scATAC-seq reveals regulatory networks in malignant, stromal and immune cells in the tumor microenvironment. Analysis of scATAC-seq profiles from serial tumor biopsies before and after programmed cell death protein 1 blockade identifies chromatin regulators of therapy-responsive T cell subsets and reveals a shared regulatory program that governs intratumoral CD8+ T cell exhaustion and CD4+ T follicular helper cell development. We anticipate that scATAC-seq will enable the unbiased discovery of gene regulatory factors across diverse biological systems.
Asunto(s)
Células de la Médula Ósea/metabolismo , Cromatina/química , Análisis de la Célula Individual/métodos , Linfocitos T/metabolismo , Línea Celular , Simulación por Computador , Regulación de la Expresión Génica , Hematopoyesis , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Leucocitos Mononucleares , Factores de Transcripción/metabolismoRESUMEN
The C-terminal domain (CTD) of RNA polymerase II (Pol II) consists of conserved heptapeptide repeats that function as a binding platform for different protein complexes involved in transcription, RNA processing, export, and chromatin remodeling. The CTD repeats are subject to sequential waves of posttranslational modifications during specific stages of the transcription cycle. These patterned modifications have led to the postulation of the "CTD code" hypothesis, where stage-specific patterns define a spatiotemporal code that is recognized by the appropriate interacting partners. Here, we highlight the role of CTD modifications in directing transcription initiation, elongation, and termination. We examine the major readers, writers, and erasers of the CTD code and examine the relevance of describing patterns of posttranslational modifications as a "code." Finally, we discuss major questions regarding the function of the newly discovered CTD modifications and the fundamental insights into transcription regulation that will necessarily emerge upon addressing those challenges.