RESUMO
Circular RNAs (circRNAs), formed by non-sequential back-splicing of pre-mRNA transcripts, are a widespread form of non-coding RNA in animal cells. However, it is unclear whether the majority of circRNAs represent splicing by-products without function or are produced in a regulated manner to carry out specific cellular functions. We show that hundreds of circRNAs are regulated during human epithelial-mesenchymal transition (EMT) and find that the production of over one-third of abundant circRNAs is dynamically regulated by the alternative splicing factor, Quaking (QKI), which itself is regulated during EMT. Furthermore, by modulating QKI levels, we show the effect on circRNA abundance is dependent on intronic QKI binding motifs. Critically, the addition of QKI motifs is sufficient to induce de novo circRNA formation from transcripts that are normally linearly spliced. These findings demonstrate circRNAs are both purposefully synthesized and regulated by cell-type specific mechanisms, suggesting they play specific biological roles in EMT.
Assuntos
Transição Epitelial-Mesenquimal , Proteínas de Ligação a RNA/metabolismo , RNA/metabolismo , Linhagem Celular , Éxons , Humanos , Íntrons , Splicing de RNA , RNA CircularRESUMO
While the majority of circRNAs are formed from infrequent back-splicing of exons from protein coding genes, some can be produced at quite high level and in a regulated manner. We describe the regulation, biogenesis and function of circDOCK1(2-27), a large, abundant circular RNA that is highly regulated during epithelial-mesenchymal transition (EMT) and whose formation depends on the epithelial splicing regulator ESRP1. CircDOCK1(2-27) synthesis in epithelial cells represses cell motility both by diverting transcripts from DOCK1 mRNA production to circRNA formation and by direct inhibition of migration by the circRNA. HITS-CLIP analysis and CRISPR-mediated deletions indicate ESRP1 controls circDOCK1(2-27) biosynthesis by binding a GGU-containing repeat region in intron 1 and detaining its splicing until Pol II completes its 157 kb journey to exon 27. Proximity-dependent biotinylation (BioID) assay suggests ESRP1 may modify the RNP landscape of intron 1 in a way that disfavours communication of exon 1 with exon 2, rather than physically bridging exon 2 to exon 27. The X-ray crystal structure of RNA-bound ESRP1 qRRM2 domain reveals it binds to GGU motifs, with the guanines embedded in clamp-like aromatic pockets in the protein.
Assuntos
Processamento Alternativo , RNA Circular , Proteínas de Ligação a RNA , Proteínas rac de Ligação ao GTP , RNA/genética , RNA/metabolismo , Splicing de RNA , RNA Circular/genética , Humanos , Linhagem Celular Tumoral , Proteínas de Ligação a RNA/metabolismo , Proteínas rac de Ligação ao GTP/genética , Proteínas rac de Ligação ao GTP/metabolismoRESUMO
Over the past decade, circular RNA (circRNA) research has evolved into a bona fide research field shedding light on the functional consequence of this unique family of RNA molecules in cancer. Although the method of formation and the abundance of circRNAs can differ from their cognate linear mRNA, the spectrum of interacting partners and their resultant cellular functions in oncogenesis are analogous. However, with 10 times more diversity in circRNA variants compared with linear RNA variants, combined with their hyperstability in the cell, circRNAs are equipped to influence every stage of oncogenesis. This is an opportune time to address the breadth of circRNA in cancer focused on their spatiotemporal expression, mutations in biogenesis factors and contemporary functions through each stage of cancer. In this Review, we highlight examples of functional circRNAs in specific cancers, which satisfy critical criteria, including their physical co-association with the target and circRNA abundance at stoichiometrically valid quantities. These considerations are essential to develop strategies for the therapeutic exploitation of circRNAs as biomarkers and targeted anticancer agents.
Assuntos
Neoplasias , RNA Circular , RNA Circular/genética , Humanos , Neoplasias/genética , Biomarcadores Tumorais/genética , Regulação Neoplásica da Expressão Gênica , AnimaisRESUMO
High-throughput RNA sequencing enables the quantification of transcript abundance and the identification of novel transcripts in biological samples. These include circular RNAs (circRNAs), a family of alternatively spliced RNA molecules that form a continuous loop. However, quantification and comparison of circRNAs between RNA sequencing libraries remain challenging due to confounding errors introduced during exonuclease digestion, library preparation and RNA sequencing itself. Here we describe a set of synthetic circRNA spike-ins-termed 'SynCRS'-that can be added directly to purified RNA samples before exonuclease digestion and library preparation. SynCRS, introduced either individually or in combinations of varying size and abundance, can be integrated into all next-generation sequencing workflows and, critically, facilitate the quantitative calibration of circRNA transcript abundance between samples, tissue types, species and laboratories. Our step-by-step protocol details the generation of SynCRS and guides users on the stoichiometry of SynCRS spike-in to RNA samples, followed by the bioinformatic steps required to facilitate quantitative comparisons of circRNAs between libraries. The laboratory steps to produce the SynCRS require an additional 3 d on top of the high throughput circRNA sequencing and bioinformatics. The protocol is suitable for users with basic experience in molecular biology and bioinformatics.
RESUMO
Molecular surveillance of viral pathogens and inference of transmission networks from genomic data play an increasingly important role in public health efforts, especially for HIV-1. For many methods, the genetic distance threshold used to connect sequences in the transmission network is a key parameter informing the properties of inferred networks. Using a distance threshold that is too high can result in a network with many spurious links, making it difficult to interpret. Conversely, a distance threshold that is too low can result in a network with too few links, which may not capture key insights into clusters of public health concern. Published research using the HIV-TRACE software package frequently uses the default threshold of 0.015 substitutions/site for HIV pol gene sequences, but in many cases, investigators heuristically select other threshold parameters to better capture the underlying dynamics of the epidemic they are studying. Here, we present a general heuristic scoring approach for tuning a distance threshold adaptively, which seeks to prevent the formation of giant clusters. We prioritize the ratio of the sizes of the largest and the second largest cluster, maximizing the number of clusters present in the network. We apply our scoring heuristic to outbreaks with different characteristics, such as regional or temporal variability, and demonstrate the utility of using the scoring mechanism's suggested distance threshold to identify clusters exhibiting risk factors that would have otherwise been more difficult to identify. For example, while we found that a 0.015 substitutions/site distance threshold is typical for US-like epidemics, recent outbreaks like the CRF07_BC subtype among men who have sex with men (MSM) in China have been found to have a lower optimal threshold of 0.005 to better capture the transition from injected drug use (IDU) to MSM as the primary risk factor. Alternatively, in communities surrounding Lake Victoria in Uganda, where there has been sustained heterosexual transmission for many years, we found that a larger distance threshold is necessary to capture a more risk factor-diverse population with sparse sampling over a longer period of time. Such identification may allow for more informed intervention action by respective public health officials.
RESUMO
Circular RNAs (circRNAs) are a widespread, cell-, tissue-, and disease-specific class of largely non-coding RNA transcripts. These single-stranded, covalently-closed transcripts arise through non-canonical splicing of pre-mRNA, a process called back-splicing. Back-splicing results in circRNAs which are distinguishable from their cognate mRNA as they possess a unique sequence of nucleic acids called the backsplice junction (BSJ). CircRNAs have been shown to play key functional roles in various cellular contexts and achieve this through their interaction with other macromolecules, particularly other RNA molecules and proteins. To elucidate the molecular mechanisms underlying circRNA function, it is necessary to identify these interacting partners. Herein, we present an optimized strategy for the simultaneous purification of the circRNA interactome within eukaryotic cells, allowing the identification of both circRNA-RNA and circRNA-protein interactions.
RESUMO
Circular RNAs (circRNAs) are a class of single-stranded, covalently closed RNA that contain a unique back-splice junction (bsj) sequence created by the ligation of their 5' and 3' ends via spliceosome-catalyzed back-splicing. A key step in illuminating the cellular roles of specific circRNAs is via increasing their expression. This is frequently done by transfecting cells with plasmid DNA containing cloned exons from which the circRNA is transcribed, flanked by sequences that promote back-splicing. We observed that commonly used plasmids lead to the production of circRNAs with molecular scars at the circRNA bsj. Stepwise redesign of the cloning vector corrected this problem, ensuring bona fide circRNAs are produced with their natural bsj at high efficiency. The fidelity of circRNAs produced from this new construct was validated by RNA sequencing and also functionally validated. To increase the utility of this modified resource for expressing circRNA, we developed an expanded set of vectors incorporating this design that (i) enables selection with a variety of antibiotics and fluorescent proteins, (ii) employs a range of promoters varying in promoter strength and (iii) generated a complementary set of lentiviral plasmids for difficult-to-transfect cells. These resources provide a novel and versatile toolkit for high-efficiency and scarless overexpression of circular RNAs that fulfill a critical need for the investigation of circRNA function.
RESUMO
Trinucleotide repeat disorders comprise ~20 severe, inherited, human neuromuscular and neurodegenerative disorders, which result from an abnormal expansion of repetitive sequences in the DNA. The most common of these, Huntington's disease (HD), results from expansion of the CAG repeat region in exon 1 of the HTT gene via an unknown mechanism. Since non-coding RNAs have been implicated in the initiation and progression of many diseases, herein we focused on a circular RNA (circRNA) molecule arising from non-canonical splicing (backsplicing) of HTT pre-mRNA. The most abundant circRNA from HTT, circHTT(2-6), was found to be more highly expressed in the frontal cortex of HD patients, compared with healthy controls, and positively correlated with CAG repeat tract length. Furthermore, the mouse orthologue (mmu_circHTT(2-6)) was found to be enriched within the brain and specifically the striatum, a region enriched for medium spiny neurons that are preferentially lost in HD. Transgenic overexpression of circHTT(2-6) in two human cell lines-SH-SY5Y and HEK293-reduced cell proliferation and nuclear size without affecting cell cycle progression or cellular size, or altering the CAG repeat region length within HTT. CircHTT(2-6) overexpression did not alter total HTT protein levels, but reduced its nuclear localisation. As these phenotypic and genotypic changes resemble those observed in HD patients, our results suggest that circHTT(2-6) may play a functional role in the pathophysiology of this disease.
Assuntos
Doença de Huntington , Neuroblastoma , Humanos , Camundongos , Animais , Doença de Huntington/metabolismo , RNA Circular/genética , Células HEK293 , Animais Geneticamente ModificadosRESUMO
The first step of oncogenesis is the acquisition of a repertoire of genetic mutations to initiate and sustain the malignancy. An important example of this initiation phase in acute leukemias is the formation of a potent oncogene by chromosomal translocations between the mixed lineage leukemia (MLL) gene and one of 100 translocation partners, known as the MLL recombinome. Here, we show that circular RNAs (circRNAs)-a family of covalently closed, alternatively spliced RNA molecules-are enriched within the MLL recombinome and can bind DNA, forming circRNA:DNA hybrids (circR loops) at their cognate loci. These circR loops promote transcriptional pausing, proteasome inhibition, chromatin re-organization, and DNA breakage. Importantly, overexpressing circRNAs in mouse leukemia xenograft models results in co-localization of genomic loci, de novo generation of clinically relevant chromosomal translocations mimicking the MLL recombinome, and hastening of disease onset. Our findings provide fundamental insight into the acquisition of chromosomal translocations by endogenous RNA carcinogens in leukemia.
Assuntos
Leucemia , Translocação Genética , Animais , Camundongos , Humanos , RNA Circular/genética , Proteína de Leucina Linfoide-Mieloide/genética , Proteína de Leucina Linfoide-Mieloide/metabolismo , Leucemia/genética , Leucemia/patologia , DNA , Proteínas de Fusão Oncogênica/genéticaRESUMO
High-throughput RNA sequencing (RNA-seq) and dedicated bioinformatics pipelines have synergized to identify an expansive repertoire of unique circular RNAs (circRNAs), exceeding 100,000 variants. While the vast majority of these circRNAs comprise canonical exonic and intronic sequences, microexons (MEs)-which occur in 30% of functional mRNA transcripts-have been entirely overlooked. CircRNAs which contain these known MEs (ME-circRNAs) could be identified with commonly utilized circRNA prediction pipelines, CIRCexplorer2 and CIRI2, but were not previously recognized as ME-circRNAs. In addition, when employing a bespoke bioinformatics pipeline for identifying RNA chimeras, called Hyb, we could also identify over 2000 ME-circRNAs which contain novel MEs at their backsplice junctions, that are uncalled by either CIRCexplorer2 or CIRI2. Analysis of circRNA-seq datasets from gliomas of varying clinical grades compared with matched control tissue has shown circRNAs have potential as prognostic markers for stratifying tumor from healthy tissue. Furthermore, the abundance of microexon-containing circRNAs (ME-circRNAs) between tumor and normal tissues is correlated with the expression of a splicing associated factor, Serine/arginine repetitive matrix 4 (SRRM4). Overexpressing SRRM4, known for regulating ME inclusion in mRNAs critical for neural differentiation, in human HEK293 cells resulted in the biogenesis of over 2000 novel ME-circRNAs, including ME-circEIF4G3, and changes in the abundance of many canonical circRNAs, including circSETDB2 and circLBRA. This shows SRRM4, in which its expression is correlated with poor prognosis in gliomas, acts as a bona fide circRNA biogenesis factor. Given the known roles of MEs and circRNAs in oncogenesis, the identification of these previously unrecognized ME-circRNAs further increases the complexity and functional purview of this non-coding RNA family.
Assuntos
Biologia Computacional , Éxons/genética , MicroRNAs/genética , Proteínas do Tecido Nervoso/genética , RNA Circular/metabolismo , Processamento Alternativo , Biologia Computacional/métodos , Éxons/fisiologia , Células HEK293 , Humanos , Proteínas do Tecido Nervoso/metabolismo , RNA Circular/genética , RNA Mensageiro/genéticaRESUMO
Circular RNAs (circRNAs) are a diverse and abundant class of hyper-stable, non-canonical RNAs that arise through a form of alternative splicing (AS) called back-splicing. These single-stranded, covalently-closed circRNA molecules have been identified in all eukaryotic kingdoms of life1, yet their functions have remained elusive. Here, we report that circRNAs can be used as bona fide biomarkers of functional, exon-skipped AS variants in Arabidopsis, including in the homeotic MADS-box transcription factor family. Furthermore, we demonstrate that circRNAs derived from exon 6 of the SEPALLATA3 (SEP3) gene increase abundance of the cognate exon-skipped AS variant (SEP3.3 which lacks exon 6), in turn driving floral homeotic phenotypes. Toward demonstrating the underlying mechanism, we show that the SEP3 exon 6 circRNA can bind strongly to its cognate DNA locus, forming an RNA:DNA hybrid, or R-loop, whereas the linear RNA equivalent bound significantly more weakly to DNA. R-loop formation results in transcriptional pausing, which has been shown to coincide with splicing factor recruitment and AS2-4. This report presents a novel mechanistic insight for how at least a subset of circRNAs probably contribute to increased splicing efficiency of their cognate exon-skipped messenger RNA and provides the first evidence of an organismal-level phenotype mediated by circRNA manipulation.
Assuntos
Proteínas de Arabidopsis/genética , Arabidopsis/genética , DNA Circular/genética , DNA de Plantas/genética , Proteínas de Homeodomínio/genética , Splicing de RNA/genética , RNA Mensageiro/genética , Fatores de Transcrição/genética , Arabidopsis/metabolismo , Proteínas de Arabidopsis/metabolismo , DNA Circular/metabolismo , DNA de Plantas/metabolismo , Proteínas de Homeodomínio/metabolismo , RNA Mensageiro/metabolismo , Fatores de Transcrição/metabolismoRESUMO
BACKGROUND: An important step in characterising the function of a gene is identifying the cells in which it is expressed. Traditional methods to determine this include in situ hybridisation, gene promoter-reporter fusions or cell isolation/purification techniques followed by quantitative PCR. These methods, although frequently used, can have limitations including their time-consuming nature, limited specificity, reliance upon well-annotated promoters, high cost, and the need for specialized equipment. In situ PCR is a relatively simple and rapid method that involves the amplification of specific mRNA directly within plant tissue whilst incorporating labelled nucleotides that are subsequently detected by immunohistochemistry. Another notable advantage of this technique is that it can be used on plants that are not easily genetically transformed. RESULTS: An optimised workflow for in-tube and on-slide in situ PCR is presented that has been evaluated using multiple plant species and tissue types. The protocol includes optimised methods for: (i) fixing, embedding, and sectioning of plant tissue; (ii) DNase treatment; (iii) in situ RT-PCR with the incorporation of DIG-labelled nucleotides; (iv) signal detection using colourimetric alkaline phosphatase substrates; and (v) mounting and microscopy. We also provide advice on troubleshooting and the limitations of using fluorescence as an alternative detection method. Using our protocol, reliable results can be obtained within two days from harvesting plant material. This method requires limited specialized equipment and can be adopted by any laboratory with a vibratome (vibrating blade microtome), a standard thermocycler, and a microscope. We show that the technique can be used to localise gene expression with cell-specific resolution. CONCLUSIONS: The in situ PCR method presented here is highly sensitive and specific. It reliably identifies the cellular expression pattern of even highly homologous and low abundance transcripts within target tissues, and can be completed within two days of harvesting tissue. As such, it has considerable advantages over other methods, especially in terms of time and cost. We recommend its adoption as the standard laboratory technique of choice for demonstrating the cellular expression pattern of a gene of interest.
RESUMO
The effect of single actinobacterial endophyte seed inoculants and a mixed microbial soil inoculant on the indigenous endophytic actinobacterial population in wheat roots was investigated by using the molecular technique terminal restriction fragment length polymorphism (T-RFLP). Wheat was cultivated either from seeds coated with the spores of single pure actinobacterial endophytes of Microbispora sp. strain EN2, Streptomyces sp. strain EN27, and Nocardioides albus EN46 or from untreated seeds sown in soil with and without a commercial mixed microbial soil inoculant. The endophytic actinobacterial population within the roots of 6-week-old wheat plants was assessed by T-RFLP. Colonization of the wheat roots by the inoculated actinobacterial endophytes was detected by T-RFLP, as were 28 to 42 indigenous actinobacterial genera present in the inoculated and uninoculated plants. The presence of the commercial mixed inoculant in the soil reduced the endophytic actinobacterial diversity from 40 genera to 21 genera and reduced the detectable root colonization by approximately half. The results indicate that the addition of a nonadapted microbial inoculum to the soil disrupted the natural actinobacterial endophyte population, reducing diversity and colonization levels. This was in contrast to the addition of a single actinobacterial endophyte to the wheat plant, where the increase in colonization level could be confirmed even though the indigenous endophyte population was not adversely affected.
Assuntos
Actinomycetales/crescimento & desenvolvimento , Ecossistema , Raízes de Plantas/microbiologia , Polimorfismo de Fragmento de Restrição , Microbiologia do Solo , Triticum/microbiologia , Actinomycetales/classificação , Actinomycetales/genética , DNA Ribossômico/análise , Raízes de Plantas/crescimento & desenvolvimento , RNA Ribossômico 16S/genética , Sementes/crescimento & desenvolvimento , Sementes/microbiologia , Análise de Sequência de DNA , Esporos Bacterianos/crescimento & desenvolvimento , Streptomyces/classificação , Streptomyces/genética , Triticum/crescimento & desenvolvimentoRESUMO
The endophytic actinobacterial population in the roots of wheat grown in three different soils obtained from the southeast part of South Australia was investigated by terminal restriction fragment length polymorphism (T-RFLP) analysis of the amplified 16S rRNA genes. A new, validated approach was applied to the T-RFLP analysis in order to estimate, to the genus level, the actinobacterial population that was identified. Actinobacterium-biased primers were used together with three restriction enzymes to obtain terminal restriction fragments (TRFs). The TRFs were matched to bacterial genera by the T-RFLP Analysis Program, and the data were analyzed to validate and semiquantify the genera present within the plant roots. The highest diversity and level of endophytic colonization were found in the roots of wheat grown in a dark loam from Swedes Flat, and the lowest were found in water-repellent sand from Western Flat. This molecular approach detected a greater diversity of actinobacteria than did previous culture-dependent methods, with the predominant genera being Mycobacterium (21.02%) in Swedes Flat, Streptomyces (14.35%) in Red Loam, and Kitasatospora (15.02%) in Western Flat. This study indicates that the soil that supported a higher number of indigenous organisms resulted in wheat roots with higher actinobacterial diversity and levels of colonization within the plant tissue. Sequencing of 16S rRNA clones, obtained using the same actinobacterium-biased PCR primers that were used in the T-RFLP analysis, confirmed the presence of the actinobacterial diversity and identified a number of Mycobacterium and Streptomyces species.