RESUMEN
The relative importance of regulation at the mRNA versus protein level is subject to ongoing debate. To address this question in a dynamic system, we mapped proteomic and transcriptomic changes in mammalian cells responding to stress induced by dithiothreitol over 30 h. Specifically, we estimated the kinetic parameters for the synthesis and degradation of RNA and proteins, and deconvoluted the response patterns into common and unique to each regulatory level using a new statistical tool. Overall, the two regulatory levels were equally important, but differed in their impact on molecule concentrations. Both mRNA and protein changes peaked between two and eight hours, but mRNA expression fold changes were much smaller than those of the proteins. mRNA concentrations shifted in a transient, pulse-like pattern and returned to values close to pre-treatment levels by the end of the experiment. In contrast, protein concentrations switched only once and established a new steady state, consistent with the dominant role of protein regulation during misfolding stress. Finally, we generated hypotheses on specific regulatory modes for some genes.
Asunto(s)
Regulación de la Expresión Génica/genética , Biosíntesis de Proteínas/genética , ARN Mensajero/biosíntesis , Transcripción Genética , Animales , Cinética , Mamíferos , Pliegue de Proteína , Procesamiento Proteico-Postraduccional , Proteómica , ARN Mensajero/genéticaRESUMEN
Chromatin accessibility is integral to the process by which transcription factors (TFs) read out cis-regulatory DNA sequences, but it is difficult to differentiate between TFs that drive accessibility and those that do not. Deep learning models that learn complex sequence rules provide an unprecedented opportunity to dissect this problem. Using zygotic genome activation in Drosophila as a model, we analyzed high-resolution TF binding and chromatin accessibility data with interpretable deep learning and performed genetic validation experiments. We identify a hierarchical relationship between the pioneer TF Zelda and the TFs involved in axis patterning. Zelda consistently pioneers chromatin accessibility proportional to motif affinity, whereas patterning TFs augment chromatin accessibility in sequence contexts where they mediate enhancer activation. We conclude that chromatin accessibility occurs in two tiers: one through pioneering, which makes enhancers accessible but not necessarily active, and the second when the correct combination of TFs leads to enhancer activation.
RESUMEN
The arrangement (syntax) of transcription factor (TF) binding motifs is an important part of the cis-regulatory code, yet remains elusive. We introduce a deep learning model, BPNet, that uses DNA sequence to predict base-resolution chromatin immunoprecipitation (ChIP)-nexus binding profiles of pluripotency TFs. We develop interpretation tools to learn predictive motif representations and identify soft syntax rules for cooperative TF binding interactions. Strikingly, Nanog preferentially binds with helical periodicity, and TFs often cooperate in a directional manner, which we validate using clustered regularly interspaced short palindromic repeat (CRISPR)-induced point mutations. Our model represents a powerful general approach to uncover the motifs and syntax of cis-regulatory sequences in genomics data.
Asunto(s)
Biología Computacional/métodos , Motivos de Nucleótidos , Factores de Transcripción/metabolismo , Animales , Sitios de Unión , Inmunoprecipitación de Cromatina , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas , Aprendizaje Profundo , Ratones , Células Madre Embrionarias de Ratones/fisiología , Proteína Homeótica Nanog/metabolismo , Redes Neurales de la Computación , Factor 3 de Transcripción de Unión a Octámeros/metabolismo , Reproducibilidad de los Resultados , Factores de Transcripción SOXB1/metabolismoRESUMEN
Transcriptional enhancers regulate spatio-temporal gene expression. While genomic assays can identify putative enhancers en masse, assigning target genes is a complex challenge. We devised a machine learning approach, McEnhancer, which links target genes to putative enhancers via a semi-supervised learning algorithm that predicts gene expression patterns based on enriched sequence features. Predicted expression patterns were 73-98% accurate, predicted assignments showed strong Hi-C interaction enrichment, enhancer-associated histone modifications were evident, and known functional motifs were recovered. Our model provides a general framework to link globally identified enhancers to targets and contributes to deciphering the regulatory genome.