A flexible repertoire of transcription factor binding sites and a diversity threshold determines enhancer activity in embryonic stem cells.
Genome Res
; 31(4): 564-575, 2021 04.
Article
in En
| MEDLINE
| ID: mdl-33712417
ABSTRACT
Transcriptional enhancers are critical for development and phenotype evolution and are often mutated in disease contexts; however, even in well-studied cell types, the sequence code conferring enhancer activity remains unknown. To examine the enhancer regulatory code for pluripotent stem cells, we identified genomic regions with conserved binding of multiple transcription factors in mouse and human embryonic stem cells (ESCs). Examination of these regions revealed that they contain on average 12.6 conserved transcription factor binding site (TFBS) sequences. Enriched TFBSs are a diverse repertoire of 70 different sequences representing the binding sequences of both known and novel ESC regulators. Using a diverse set of TFBSs from this repertoire was sufficient to construct short synthetic enhancers with activity comparable to native enhancers. Site-directed mutagenesis of conserved TFBSs in endogenous enhancers or TFBS deletion from synthetic sequences revealed a requirement for 10 or more different TFBSs. Furthermore, specific TFBSs, including the POU5F1SOX2 comotif, are dispensable, despite cobinding the POU5F1 (also known as OCT4), SOX2, and NANOG master regulators of pluripotency. These findings reveal that a TFBS sequence diversity threshold overrides the need for optimized regulatory grammar and individual TFBSs that recruit specific master regulators.
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Main subject:
Transcription Factors
/
Enhancer Elements, Genetic
/
Embryonic Stem Cells
Type of study:
Prognostic_studies
Limits:
Animals
/
Humans
Language:
En
Journal:
Genome Res
Year:
2021
Document type:
Article