RESUMO
RNA polymerase II (POL II) is responsible for the transcription of messenger RNAs (mRNAs) and long non-coding RNAs (lncRNAs). Previously, we have shown the evolutionary invariance of the structural features of DNA in the POL II core promoters of the precursors of mRNAs. In this work, we have analyzed the POL II core promoters of the precursors of lncRNAs in Homo sapiens and Mus musculus genomes. Structural analysis of nucleotide sequences in positions -50, +30 bp in relation to the TSS have shown the extremely heterogeneous 3D structure that includes two singular regions - hexanucleotide "INR" around the TSS and octanucleotide "TATA-box" at around ~-28 bp upstream. Thus, the 3D structure of core promoters of lncRNA resembles the architecture of the core promoters of mRNAs; however, textual analysis revealed differences between promoters of lncRNAs and promoters of mRNAs, which lies in their textual characteristics; namely, the informational entropy at each position of the nucleotide text of lncRNA core promoters (by the exception of singular regions) is significantly higher than that of the mRNA core promoters. Another distinguishing feature of lncRNA is the extremely rare occurrence in the TATA box of octanucleotides with the consensus sequence. These textual differences can significantly affect the efficiency of the transcription of lncRNAs.