ABSTRACT
DNA methylation of CpG dinucleotides is an important epigenetic modification involved in the regulation of mammalian gene expression, with each type of cell developing a specific methylation profile during its differentiation. Recently, it has been shown that a small subgroup of transcription factors (TFs) might promote DNA demethylation at their binding sites. We developed a bioinformatics pipeline to predict from genome-wide DNA methylation data TFs that promote DNA demethylation at their binding site. We applied the pipeline to International Human Epigenome Consortium methylome data and selected 393 candidate transcription factor binding motifs and associated 383 TFs that are likely associated with DNA demethylation. Validation of a subset of the candidate TFs using an in vitro assay suggested that 28 of 49 TFs from various TF families had DNA-demethylation-promoting activity; TF families, such as bHLH and ETS, contained both TFs with and without the activity. The identified TFs showed large demethylated/methylated CpG ratios and their demethylated CpGs showed significant bias toward hypermethylation in original cells. Furthermore, the identified TFs promoted demethylation of distinct sets of CpGs, with slight overlap of the targeted CpGs among TF family members, which was consistent with the results of a gene ontology (GO) term analysis of the identified TFs. Gene expression analysis of the identified TFs revealed that multiple TFs from various families are specifically expressed in human cells and tissues. Together, our results suggest that a large number of TFs from various TF families are associated with cell-type-specific DNA demethylation during human cellular development.