RESUMEN
Regulation of gene expression is a complex but highly guided process. While genomic technologies and computational approaches have allowed high-throughput mapping of cis-regulatory elements (CREs) and their interactions in 3D, their precise role in regulating gene expression remains obscure. Recent complementary observations revealed that interactions between CREs frequently result in the formation of small-scale functional modules within topologically associating domains. Such chromatin modules likely emerge from a complex interplay between regulatory machineries assembled at CREs, including site-specific binding of transcription factors. Here, we review the methods that allow identifying chromatin modules, summarize possible mechanisms that steer CRE interactions within these modules, and discuss outstanding challenges to uncover how chromatin modules fit in our current understanding of the functional 3D genome.
Asunto(s)
Cromatina , Regulación de la Expresión Génica , Cromatina/genética , Regulación de la Expresión Génica/genética , Genoma/genética , Genómica , Secuencias Reguladoras de Ácidos Nucleicos/genéticaRESUMEN
Many enhancers control gene expression by assembling regulatory factor clusters, also referred to as condensates. This process is vital for facilitating enhancer communication and establishing cellular identity. However, how DNA sequence and transcription factor (TF) binding instruct the formation of high regulatory factor environments remains poorly understood. Here we developed a new approach leveraging enhancer-centric chromatin accessibility quantitative trait loci (caQTLs) to nominate regulatory factor clusters genome-wide. By analyzing TF-binding signatures within the context of caQTLs and comparing episomal versus endogenous enhancer activities, we discovered a class of regulators, 'context-only' TFs, that amplify the activity of cell type-specific caQTL-binding TFs, that is, 'context-initiator' TFs. Similar to super-enhancers, enhancers enriched for context-only TF-binding sites display high coactivator binding and sensitivity to bromodomain-inhibiting molecules. We further show that binding sites for context-only and context-initiator TFs underlie enhancer coordination, providing a mechanistic rationale for how a loose TF syntax confers regulatory specificity.
Asunto(s)
Cromatina , Elementos de Facilitación Genéticos , Sitios de Carácter Cuantitativo , Factores de Transcripción , Factores de Transcripción/metabolismo , Factores de Transcripción/genética , Humanos , Sitios de Unión/genética , Cromatina/genética , Cromatina/metabolismo , Unión Proteica , Animales , Regulación de la Expresión Génica , RatonesRESUMEN
BACKGROUND: Interactions among cis-regulatory elements (CREs) play a crucial role in gene regulation. Various approaches have been developed to map these interactions genome-wide, including those relying on interindividual epigenomic variation to identify groups of covariable regulatory elements, referred to as chromatin modules (CMs). While CM mapping allows to investigate the relationship between chromatin modularity and gene expression, the computational principles used for CM identification vary in their application and outcomes. RESULTS: We comprehensively evaluate and streamline existing CM mapping tools and present guidelines for optimal utilization of epigenome data from a diverse population of individuals to assess regulatory coordination across the human genome. We showcase the effectiveness of our recommended practices by analyzing distinct cell types and demonstrate cell type specificity of CRE interactions in CMs and their relevance for gene expression. Integration of genotype information revealed that many non-coding disease-associated variants affect the activity of CMs in a cell type-specific manner by affecting the binding of cell type-specific transcription factors. We provide example cases that illustrate in detail how CMs can be used to deconstruct GWAS loci, assess variable expression of cell surface receptors in immune cells, and reveal how genetic variation can impact the expression of prognostic markers in chronic lymphocytic leukemia. CONCLUSIONS: Our study presents an optimal strategy for CM mapping and reveals how CMs capture the coordination of CREs and its impact on gene expression. Non-coding genetic variants can disrupt this coordination, and we highlight how this may lead to disease predisposition in a cell type-specific manner.