RESUMO
Recent reports of a link between human- and primate-specific genetic factors and human/primate-specific characteristics and diseases necessitate genome-wide identification of those factors. We have previously reported core promoter short tandem repeats (STRs) of extreme length (≥6-repeats) that have expanded exceptionally in primates vs. non-primates, and may have a function in adaptive evolution. In the study reported here, we extended our study to the human STRs of ≥3-repeats in the category of penta and hexaucleotide STRs, across the entire human protein coding gene core promoters, and analyzed their status in several superorders and orders of vertebrates, using the Ensembl database. The ConSite software was used to identify the transcription factor (TF) sets binding to those STRs. STR specificity was observed at different levels of human and non-human primate (NHP) evolution. 73% of the pentanucleotide STRs and 68% of the hexanucleotide STRs were found to be specific to human and NHPs. AP-2alpha, Sp1, and MZF were the predominantly selected TFs (90%) binding to the human-specific STRs. Furthermore, the number of TF sets binding to a given STR was found to be a selection factor for that STR. Our findings indicate that selected STRs, the cognate binding TFs, and the number of TF set binding to those STRs function as switch codes at different levels of human and NHP evolution and speciation.