RESUMO
Long non-coding RNAs are a very versatile class of molecules that can have important roles in regulating a cells function, including regulating other genes on the transcriptional level. One of these mechanisms is that RNA can directly interact with DNA thereby recruiting additional components such as proteins to these sites via an RNA:dsDNA triplex formation. We genetically deleted the triplex forming sequence (FendrrBox) from the lncRNA Fendrr in mice and found that this FendrrBox is partially required for Fendrr function in vivo. We found that the loss of the triplex forming site in developing lungs causes a dysregulation of gene programs associated with lung fibrosis. A set of these genes contain a triplex site directly at their promoter and are expressed in lung fibroblasts. We biophysically confirmed the formation of an RNA:dsDNA triplex with target promoters in vitro. We found that Fendrr with the Wnt signalling pathway regulates these genes, implicating that Fendrr synergizes with Wnt signalling in lung fibrosis.
Assuntos
Fibrose Pulmonar , RNA Longo não Codificante , Animais , Camundongos , Fibrose , Pulmão/crescimento & desenvolvimento , Pulmão/metabolismo , Fibrose Pulmonar/genética , Fibrose Pulmonar/metabolismo , RNA Longo não Codificante/metabolismoRESUMO
Long non-coding RNAs (lncRNAs) are important regulators of gene expression and can associate with DNA as RNA : DNA heteroduplexes or RNA â DNA : DNA triple helix structures. Here, we review inâ vitro biochemical and biophysical experiments including electromobility shift assays (EMSA), circular dichroism (CD) spectroscopy, thermal melting analysis, microscale thermophoresis (MST), single-molecule Förster resonance energy transfer (smFRET) and nuclear magnetic resonance (NMR) spectroscopy to investigate RNA â DNA : DNA triple helix and RNA : DNA heteroduplex formation. We present the investigations of the antiparallel triplex-forming lncRNA MEG3 targeting the gene TGFB2 and the parallel triplex-forming lncRNA Fendrr with its target gene Emp2. The thermodynamic properties of these oligonucleotides lead to concentration-dependent heterogeneous mixtures, where a DNA duplex, an RNA : DNA heteroduplex and an RNA â DNA : DNA triplex coexist and their relative populations are modulated in a temperature-dependent manner. The inâ vitro data provide a reliable readout of triplex structures, as RNA â DNA : DNA triplexes show distinct features compared to DNA duplexes and RNA : DNA heteroduplexes. Our experimental results can be used to validate computationally predicted triple helix formation between novel disease-relevant lncRNAs and their DNA target genes.
Assuntos
DNA , Conformação de Ácido Nucleico , RNA Longo não Codificante , RNA Longo não Codificante/genética , RNA Longo não Codificante/química , RNA Longo não Codificante/metabolismo , DNA/química , DNA/genética , Humanos , Ácidos Nucleicos Heteroduplexes/química , RNA/química , RNA/genética , RNA/metabolismo , TermodinâmicaRESUMO
RNA.DNA:DNA triple helix (triplex) formation is a form of RNA-DNA interaction which regulates gene expression but is difficult to study experimentally in vivo. This makes accurate computational prediction of such interactions highly important in the field of RNA research. Current predictive methods use canonical Hoogsteen base pairing rules, which whilst biophysically valid, may not reflect the plastic nature of cell biology. Here, we present the first optimization approach to learn a probabilistic model describing RNA-DNA interactions directly from motifs derived from triplex sequencing data. We find that there are several stable interaction codes, including Hoogsteen base pairing and novel RNA-DNA base pairings, which agree with in vitro measurements. We implemented these findings in TriplexAligner, a program that uses the determined interaction codes to predict triplex binding. TriplexAligner predicts RNA-DNA interactions identified in all-to-all sequencing data more accurately than all previously published tools in human and mouse and also predicts previously studied triplex interactions with known regulatory functions. We further validated a novel triplex interaction using biophysical experiments. Our work is an important step towards better understanding of triplex formation and allows genome-wide analyses of RNA-DNA interactions.
Assuntos
Estudo de Associação Genômica Ampla , RNA , Humanos , Camundongos , Animais , RNA/genética , DNA/genética , DNA/metabolismo , Replicação do DNA , Conformação de Ácido NucleicoRESUMO
DNA:DNA:RNA triplexes that are formed through Hoogsteen base-pairing of the RNA in the major groove of the DNA duplex have been observed in vitro, but the extent to which these interactions occur in cells and how they impact cellular functions remains elusive. Using a combination of bioinformatic techniques, RNA/DNA pulldown and biophysical studies, we set out to identify functionally important DNA:DNA:RNA triplex-forming long non-coding RNAs (lncRNA) in human endothelial cells. The lncRNA HIF1α-AS1 was retrieved as a top hit. Endogenous HIF1α-AS1 reduces the expression of numerous genes, including EPH Receptor A2 and Adrenomedullin through DNA:DNA:RNA triplex formation by acting as an adapter for the repressive human silencing hub complex (HUSH). Moreover, the oxygen-sensitive HIF1α-AS1 is down-regulated in pulmonary hypertension and loss-of-function approaches not only result in gene de-repression but also enhance angiogenic capacity. As exemplified here with HIF1α-AS1, DNA:DNA:RNA triplex formation is a functionally important mechanism of trans-acting gene expression control.