ABSTRACT
Phosphorylation of the carboxyl-terminal domain (CTD) of the largest subunit of RNA polymerase II (Pol II) governs stage-specific interactions with different cellular machines. The CTD consists of Y1S2P3T4S5P6S7 heptad repeats and sequential phosphorylations of Ser7, Ser5 and Ser2 occur universally at Pol II-transcribed genes. Phosphorylation of Thr4, however, appears to selectively modulate transcription of specific classes of genes. Here, we identify ten new Thr4 kinases from different kinase structural groups. Irreversible chemical inhibition of the most active Thr4 kinase, Hrr25, reveals a novel role for this kinase in transcription termination of specific class of noncoding snoRNA genes. Genome-wide profiles of Hrr25 reveal a selective enrichment at 3' regions of noncoding genes that display termination defects. Importantly, phospho-Thr4 marks placed by Hrr25 are recognized by Rtt103, a key component of the termination machinery. Our results suggest that these uncommon CTD kinases place phospho-Thr4 marks to regulate expression of targeted genes.
Subject(s)
Protein Kinases/metabolism , RNA Polymerase II/genetics , RNA Polymerase II/physiology , Amino Acid Sequence , Casein Kinase I/metabolism , Phosphorylation , Phylogeny , Protein Domains , Protein Serine-Threonine Kinases/metabolism , Saccharomyces cerevisiae/metabolism , Saccharomyces cerevisiae Proteins/metabolism , Threonine/metabolism , Transcription, GeneticABSTRACT
The carboxyl-terminal domain (CTD) of the largest subunit of RNA polymerase II (Pol II) orchestrates dynamic recruitment of specific cellular machines during different stages of transcription. Signature phosphorylation patterns of Y1S2P3T4S5P6S7 heptapeptide repeats of the CTD engage specific "readers." Whereas phospho-Ser5 and phospho-Ser2 marks are ubiquitous, phospho-Thr4 is reported to only impact specific genes. Here, we identify a role for phospho-Thr4 in transcription termination at noncoding small nucleolar RNA (snoRNA) genes. Quantitative proteomics reveals an interactome of known readers as well as protein complexes that were not known to rely on Thr4 for association with Pol II. The data indicate a key role for Thr4 in engaging the machinery used for transcription elongation and termination. We focus on Rtt103, a protein that binds phospho-Ser2 and phospho-Thr4 marks and facilitates transcription termination at protein-coding genes. To elucidate how Rtt103 engages two distinct CTD modifications that are differentially enriched at noncoding genes, we relied on NMR analysis of Rtt103 in complex with phospho-Thr4- or phospho-Ser2-bearing CTD peptides. The structural data reveal that Rtt103 interacts with phospho-Thr4 in a manner analogous to its interaction with phospho-Ser2-modified CTD. The same set of hydrogen bonds involving either the oxygen on phospho-Thr4 and the hydroxyl on Ser2, or the phosphate on Ser2 and the Thr4 hydroxyl, can be formed by rotation of an arginine side chain, leaving the intermolecular interface otherwise unperturbed. This economy of design enables Rtt103 to engage Pol II at distinct sets of genes with differentially enriched CTD marks.
Subject(s)
RNA Polymerase II/metabolism , Saccharomyces cerevisiae Proteins/metabolism , Transcription Factors/metabolism , Amino Acid Sequence , Peptide Termination Factors/metabolism , Phosphorylation , Protein Domains/physiology , Protein Isoforms/metabolism , RNA Polymerase II/physiology , RNA, Small Nucleolar/metabolism , RNA, Small Untranslated/metabolism , RNA, Untranslated/metabolism , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Saccharomyces cerevisiae Proteins/genetics , Saccharomyces cerevisiae Proteins/physiology , Serine/metabolism , Threonine/metabolism , Transcription Factors/physiology , Transcription, Genetic/geneticsABSTRACT
Stressed cells coordinate a multi-faceted response spanning many levels of physiology. Yet knowledge of the complete stress-activated regulatory network as well as design principles for signal integration remains incomplete. We developed an experimental and computational approach to integrate available protein interaction data with gene fitness contributions, mutant transcriptome profiles, and phospho-proteome changes in cells responding to salt stress, to infer the salt-responsive signaling network in yeast. The inferred subnetwork presented many novel predictions by implicating new regulators, uncovering unrecognized crosstalk between known pathways, and pointing to previously unknown 'hubs' of signal integration. We exploited these predictions to show that Cdc14 phosphatase is a central hub in the network and that modification of RNA polymerase II coordinates induction of stress-defense genes with reduction of growth-related transcripts. We find that the orthologous human network is enriched for cancer-causing genes, underscoring the importance of the subnetwork's predictions in understanding stress biology.
Subject(s)
Saccharomyces cerevisiae Proteins/metabolism , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Cell Cycle Proteins/metabolism , Computational Biology/methods , Gene Expression Profiling , Gene Expression Regulation, Fungal , Genetic Fitness , Protein Tyrosine Phosphatases/metabolism , RNA Polymerase II/metabolism , Saccharomyces cerevisiae Proteins/genetics , Signal Transduction , Sodium Chloride/metabolism , Stress, PhysiologicalABSTRACT
The comparison of gene regulatory networks between diseased versus healthy individuals or between two different treatments is an important scientific problem. Here, we propose sc-compReg as a method for the comparative analysis of gene expression regulatory networks between two conditions using single cell gene expression (scRNA-seq) and single cell chromatin accessibility data (scATAC-seq). Our software, sc-compReg, can be used as a stand-alone package that provides joint clustering and embedding of the cells from both scRNA-seq and scATAC-seq, and the construction of differential regulatory networks across two conditions. We apply the method to compare the gene regulatory networks of an individual with chronic lymphocytic leukemia (CLL) versus a healthy control. The analysis reveals a tumor-specific B cell subpopulation in the CLL patient and identifies TOX2 as a potential regulator of this subpopulation.
Subject(s)
Gene Regulatory Networks , Leukemia, Lymphocytic, Chronic, B-Cell/genetics , Single-Cell Analysis/methods , B-Lymphocytes , Chromatin , Gene Expression Regulation, Neoplastic , HMGB Proteins , Humans , RNA, Small Cytoplasmic , SoftwareABSTRACT
The mammary epithelial cell (MEC) system is a bilayered ductal epithelium of luminal and basal cells, maintained by a lineage of stem and progenitor populations. Here, we used integrated single-cell transcriptomics and chromatin accessibility analysis to reconstruct the cell types of the mouse MEC system and their underlying gene regulatory features in an unbiased manner. We define differentiation states within the secretory type of luminal cells, which forms a continuous spectrum of general luminal progenitor and lactation-committed progenitor cells. By integrating single-cell transcriptomics and chromatin accessibility landscapes, we identify cis- and trans-regulatory elements that are differentially activated in the specific epithelial cell types and our newly defined luminal differentiation states. Our work provides a resource to reveal cis/trans-regulatory elements associated with MEC identity and differentiation that will serve as a reference to determine how the chromatin accessibility landscape changes during breast cancer.
Subject(s)
Chromatin/genetics , Epithelial Cells/metabolism , Animals , Base Sequence , Cell Differentiation/genetics , Cell Lineage , Cell Proliferation/genetics , Chromatin/physiology , Computational Biology/methods , Epithelial Cells/physiology , Epithelium/metabolism , Female , Gene Expression Profiling/methods , Gene Expression Regulation , Mammary Glands, Animal/cytology , Mammary Glands, Animal/metabolism , Mice , Mice, Inbred C57BL , Mice, Inbred Strains , Regulatory Sequences, Nucleic Acid , Single-Cell Analysis/methods , Stem Cells/metabolism , TranscriptomeABSTRACT
Understanding complex tissues requires single-cell deconstruction of gene regulation with precision and scale. Here, we assess the performance of a massively parallel droplet-based method for mapping transposase-accessible chromatin in single cells using sequencing (scATAC-seq). We apply scATAC-seq to obtain chromatin profiles of more than 200,000 single cells in human blood and basal cell carcinoma. In blood, application of scATAC-seq enables marker-free identification of cell type-specific cis- and trans-regulatory elements, mapping of disease-associated enhancer activity and reconstruction of trajectories of cellular differentiation. In basal cell carcinoma, application of scATAC-seq reveals regulatory networks in malignant, stromal and immune cells in the tumor microenvironment. Analysis of scATAC-seq profiles from serial tumor biopsies before and after programmed cell death protein 1 blockade identifies chromatin regulators of therapy-responsive T cell subsets and reveals a shared regulatory program that governs intratumoral CD8+ T cell exhaustion and CD4+ T follicular helper cell development. We anticipate that scATAC-seq will enable the unbiased discovery of gene regulatory factors across diverse biological systems.
Subject(s)
Bone Marrow Cells/metabolism , Chromatin/chemistry , Single-Cell Analysis/methods , T-Lymphocytes/metabolism , Cell Line , Computer Simulation , Gene Expression Regulation , Hematopoiesis , High-Throughput Nucleotide Sequencing , Humans , Leukocytes, Mononuclear , Transcription Factors/metabolismABSTRACT
The C-terminal domain (CTD) of RNA polymerase II (Pol II) consists of conserved heptapeptide repeats that function as a binding platform for different protein complexes involved in transcription, RNA processing, export, and chromatin remodeling. The CTD repeats are subject to sequential waves of posttranslational modifications during specific stages of the transcription cycle. These patterned modifications have led to the postulation of the "CTD code" hypothesis, where stage-specific patterns define a spatiotemporal code that is recognized by the appropriate interacting partners. Here, we highlight the role of CTD modifications in directing transcription initiation, elongation, and termination. We examine the major readers, writers, and erasers of the CTD code and examine the relevance of describing patterns of posttranslational modifications as a "code." Finally, we discuss major questions regarding the function of the newly discovered CTD modifications and the fundamental insights into transcription regulation that will necessarily emerge upon addressing those challenges.