A synergistic DNA logic predicts genome-wide chromatin accessibility.
Genome Res
; 26(10): 1430-1440, 2016 10.
Article
in En
| MEDLINE
| ID: mdl-27456004
Enhancers and promoters commonly occur in accessible chromatin characterized by depleted nucleosome contact; however, it is unclear how chromatin accessibility is governed. We show that log-additive cis-acting DNA sequence features can predict chromatin accessibility at high spatial resolution. We develop a new type of high-dimensional machine learning model, the Synergistic Chromatin Model (SCM), which when trained with DNase-seq data for a cell type is capable of predicting expected read counts of genome-wide chromatin accessibility at every base from DNA sequence alone, with the highest accuracy at hypersensitive sites shared across cell types. We confirm that a SCM accurately predicts chromatin accessibility for thousands of synthetic DNA sequences using a novel CRISPR-based method of highly efficient site-specific DNA library integration. SCMs are directly interpretable and reveal that a logic based on local, nonspecific synergistic effects, largely among pioneer TFs, is sufficient to predict a large fraction of cellular chromatin accessibility in a wide variety of cell types.
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Main subject:
Chromatin
/
Chromatin Assembly and Disassembly
/
Models, Genetic
Type of study:
Prognostic_studies
/
Risk_factors_studies
Limits:
Animals
/
Humans
Language:
En
Journal:
Genome Res
Journal subject:
BIOLOGIA MOLECULAR
/
GENETICA
Year:
2016
Type:
Article
Affiliation country:
United States