RESUMO
mRNAs can fold into complex structures that regulate gene expression. Resolving such structures de novo has remained challenging and has limited our understanding of the prevalence and functions of mRNA structure. We use SHAPE-MaP experiments in living E. coli cells to derive quantitative, nucleotide-resolution structure models for 194 endogenous transcripts encompassing approximately 400 genes. Individual mRNAs have exceptionally diverse architectures, and most contain well-defined structures. Active translation destabilizes mRNA structure in cells. Nevertheless, mRNA structure remains similar between in-cell and cell-free environments, indicating broad potential for structure-mediated gene regulation. We find that the translation efficiency of endogenous genes is regulated by unfolding kinetics of structures overlapping the ribosome binding site. We discover conserved structured elements in 35% of UTRs, several of which we validate as novel protein binding motifs. RNA structure regulates every gene studied here in a meaningful way, implying that most functional structures remain to be discovered.
Assuntos
Técnicas de Amplificação de Ácido Nucleico/métodos , RNA Mensageiro/metabolismo , Algoritmos , Sítios de Ligação , Sistema Livre de Células , Primers do DNA/metabolismo , Ensaio de Desvio de Mobilidade Eletroforética , Entropia , Escherichia coli/genética , Escherichia coli/metabolismo , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Modelos Moleculares , Conformação de Ácido Nucleico , Biossíntese de Proteínas , Dobramento de RNA , RNA Mensageiro/química , Proteínas Recombinantes/biossíntese , Proteínas Recombinantes/química , Proteínas Recombinantes/isolamento & purificação , Ribossomos/química , Ribossomos/metabolismo , Regiões não TraduzidasRESUMO
7SK is a conserved noncoding RNA that regulates transcription by sequestering the transcription factor P-TEFb. 7SK function entails complex changes in RNA structure, but characterizing RNA dynamics in cells remains an unsolved challenge. We developed a single-molecule chemical probing strategy, DANCE-MaP (deconvolution and annotation of ribonucleic conformational ensembles), that defines per-nucleotide reactivity, direct base pairing interactions, tertiary interactions, and thermodynamic populations for each state in RNA structural ensembles from a single experiment. DANCE-MaP reveals that 7SK RNA encodes a large-scale structural switch that couples dissolution of the P-TEFb binding site to structural remodeling at distal release factor binding sites. The 7SK structural equilibrium shifts in response to cell growth and stress and can be targeted to modulate expression of P-TEFbresponsive genes. Our study reveals that RNA structural dynamics underlie 7SK function as an integrator of diverse cellular signals to control transcription and establishes the power of DANCE-MaP to define RNA dynamics in cells.
Assuntos
Fator B de Elongação Transcricional Positiva , Proteínas de Ligação a RNA , Sítios de Ligação/genética , Células HeLa , Humanos , Fator B de Elongação Transcricional Positiva/genética , RNA Nuclear Pequeno/genética , RNA não Traduzido , Proteínas de Ligação a RNA/genéticaRESUMO
Translational reprogramming allows organisms to adapt to changing conditions. Upstream start codons (uAUGs), which are prevalently present in mRNAs, have crucial roles in regulating translation by providing alternative translation start sites1-4. However, what determines this selective initiation of translation between conditions remains unclear. Here, by integrating transcriptome-wide translational and structural analyses during pattern-triggered immunity in Arabidopsis, we found that transcripts with immune-induced translation are enriched with upstream open reading frames (uORFs). Without infection, these uORFs are selectively translated owing to hairpins immediately downstream of uAUGs, presumably by slowing and engaging the scanning preinitiation complex. Modelling using deep learning provides unbiased support for these recognizable double-stranded RNA structures downstream of uAUGs (which we term uAUG-ds) being responsible for the selective translation of uAUGs, and allows the prediction and rational design of translating uAUG-ds. We found that uAUG-ds-mediated regulation can be generalized to human cells. Moreover, uAUG-ds-mediated start-codon selection is dynamically regulated. After immune challenge in plants, induced RNA helicases that are homologous to Ded1p in yeast and DDX3X in humans resolve these structures, allowing ribosomes to bypass uAUGs to translate downstream defence proteins. This study shows that mRNA structures dynamically regulate start-codon selection. The prevalence of this RNA structural feature and the conservation of RNA helicases across kingdoms suggest that mRNA structural remodelling is a general feature of translational reprogramming.
Assuntos
Códon de Iniciação , Conformação de Ácido Nucleico , RNA de Cadeia Dupla , RNA Mensageiro , Humanos , Arabidopsis/genética , Arabidopsis/imunologia , Códon de Iniciação/genética , Reconhecimento da Imunidade Inata , Fases de Leitura Aberta/genética , Biossíntese de Proteínas/genética , Biossíntese de Proteínas/imunologia , Ribossomos/metabolismo , RNA de Cadeia Dupla/química , RNA de Cadeia Dupla/genética , RNA de Cadeia Dupla/metabolismo , RNA Mensageiro/genética , Transcriptoma , RNA Helicases DEAD-box/química , RNA Helicases DEAD-box/genética , RNA Helicases DEAD-box/metabolismo , Aprendizado ProfundoRESUMO
We report that the SARS-CoV-2 nucleocapsid protein (N-protein) undergoes liquid-liquid phase separation (LLPS) with viral RNA. N-protein condenses with specific RNA genomic elements under physiological buffer conditions and condensation is enhanced at human body temperatures (33°C and 37°C) and reduced at room temperature (22°C). RNA sequence and structure in specific genomic regions regulate N-protein condensation while other genomic regions promote condensate dissolution, potentially preventing aggregation of the large genome. At low concentrations, N-protein preferentially crosslinks to specific regions characterized by single-stranded RNA flanked by structured elements and these features specify the location, number, and strength of N-protein binding sites (valency). Liquid-like N-protein condensates form in mammalian cells in a concentration-dependent manner and can be altered by small molecules. Condensation of N-protein is RNA sequence and structure specific, sensitive to human body temperature, and manipulatable with small molecules, and therefore presents a screenable process for identifying antiviral compounds effective against SARS-CoV-2.
Assuntos
COVID-19/metabolismo , Proteínas do Nucleocapsídeo de Coronavírus/metabolismo , Genoma Viral , Nucleocapsídeo/metabolismo , RNA Viral/metabolismo , SARS-CoV-2/metabolismo , Animais , Antivirais/farmacologia , COVID-19/genética , Chlorocebus aethiops , Proteínas do Nucleocapsídeo de Coronavírus/genética , Avaliação Pré-Clínica de Medicamentos , Células HEK293 , Humanos , Nucleocapsídeo/genética , Fosfoproteínas/genética , Fosfoproteínas/metabolismo , SARS-CoV-2/genética , Células Vero , Tratamento Farmacológico da COVID-19RESUMO
Chemical probing technologies enable high-throughput examination of diverse structural features of RNA, including local nucleotide flexibility, RNA secondary structure, protein and ligand binding, through-space interaction networks, and multistate structural ensembles. Deep understanding of RNA structure-function relationships typically requires evaluating a system under structure- and function-altering conditions, linking these data with additional information, and visualizing multilayered relationships. Current platforms lack the broad accessibility, flexibility and efficiency needed to iterate on integrative analyses of these diverse, complex data. Here, we share the RNA visualization and graphical analysis toolset RNAvigate, a straightforward and flexible Python library that automatically parses 21 standard file formats (primary sequence annotations, per- and internucleotide data, and secondary and tertiary structures) and outputs 18 plot types. RNAvigate enables efficient exploration of nuanced relationships between multiple layers of RNA structure information and across multiple experimental conditions. Compatibility with Jupyter notebooks enables nonburdensome, reproducible, transparent and organized sharing of multistep analyses and data visualization strategies. RNAvigate simplifies and accelerates discovery and characterization of RNA-centric functions in biology.
Assuntos
RNA , Software , RNA/genética , RNA/química , Conformação de Ácido Nucleico , Análise de Sequência de RNARESUMO
The genomes of RNA viruses encode the information required for replication in host cells both in their linear sequence and in complex higher-order structures. A subset of these RNA genome structures show clear sequence conservation, and have been extensively described for well-characterized viruses. However, the extent to which viral RNA genomes contain functional structural elements-unable to be detected by sequence alone-that nonetheless are critical to viral fitness is largely unknown. Here, we devise a structure-first experimental strategy and use it to identify 22 structure-similar motifs across the coding sequences of the RNA genomes for the four dengue virus serotypes. At least 10 of these motifs modulate viral fitness, revealing a significant unnoticed extent of RNA structure-mediated regulation within viral coding sequences. These viral RNA structures promote a compact global genome architecture, interact with proteins, and regulate the viral replication cycle. These motifs are also thus constrained at the levels of both RNA structure and protein sequence and are potential resistance-refractory targets for antivirals and live-attenuated vaccines. Structure-first identification of conserved RNA structure enables efficient discovery of pervasive RNA-mediated regulation in viral genomes and, likely, other cellular RNAs.
Assuntos
Dengue , Vírus de RNA , Humanos , Conformação de Ácido Nucleico , RNA Viral/metabolismo , Vírus de RNA/genética , Genoma Viral/genética , Replicação Viral/genéticaRESUMO
The transcriptome represents an attractive but underused set of targets for small-molecule ligands. Here, we devise a technology that leverages fragment-based screening and SHAPE-MaP RNA structure probing to discover small-molecule fragments that bind an RNA structure of interest. We identified fragments and cooperatively binding fragment pairs that bind to the thiamine pyrophosphate (TPP) riboswitch with millimolar to micromolar affinities. We then used structure-activity relationship information to efficiently design a linked-fragment ligand, with no resemblance to the native ligand, with high ligand efficiency and druglikeness, that binds to the TPP thiM riboswitch with high nanomolar affinity and that modulates RNA conformation during cotranscriptional folding. Principles from this work are broadly applicable, leveraging cooperativity and multisite binding, for developing high-quality ligands for diverse RNA targets.
Assuntos
Dobramento de RNA , Riboswitch , Bibliotecas de Moléculas Pequenas , Pareamento de Bases , Ligantes , Bibliotecas de Moléculas Pequenas/química , Relação Estrutura-Atividade , Tiamina Pirofosfato/química , Transcrição GênicaRESUMO
Single-molecule correlated chemical probing (smCCP) is an experimentally concise strategy for characterizing higher-order structural interactions in RNA. smCCP data yield rich, but complex, information about base pairing, conformational ensembles, and tertiary interactions. To date, through-space communication specifically measuring RNA tertiary structure has been difficult to isolate from structural communication reflective of other interactions. Here, we introduce mutual information as a filtering metric to isolate tertiary structure communication contained within smCCP data and use this strategy to characterize the structural ensemble of the SAM-III riboswitch. We identified an smCCP fingerprint that is selective for states containing a tertiary structure that forms concurrently with cognate ligand binding. We then successfully applied mutual information filters to independent RNAs and isolated through-space tertiary interactions in riboswitches and large RNAs with complex structures. smCCP, coupled with mutual information criteria, can now be used as a tertiary structure discovery tool, including to identify specific states in an ensemble that have a higher-order structure. These studies pave the way for the use of the straightforward smCCP experiment for discovery and characterization of tertiary structure motifs in complex RNAs.
Assuntos
Conformação de Ácido Nucleico , Riboswitch , RNA/química , Pareamento de Bases , Modelos Moleculares , Imagem Individual de Molécula/métodosRESUMO
RNA molecules convey biological information both in their linear sequence and in their base-paired secondary and tertiary structures. Chemical probing experiments, which involve treating an RNA with a reagent that modifies conformationally dynamic nucleotides, have broadly enabled examination of short- and long-range RNA structure in diverse contexts, including in living cells. For decades, chemical probing experiments have been interpreted in a per-nucleotide way, such that the reactivity measured at each nucleotide reports the average structure at a position over all RNA molecules within a sample. However, there are numerous important cases where per-nucleotide chemical probing falls short, including for RNAs that are bound by proteins, RNAs that form complex higher order structures, and RNAs that sample multiple conformations.Recent experimental and computational innovations have started a revolution in RNA structure analysis by transforming chemical probing into a massively parallel, single-molecule experiment. Enabled by a specialized reverse transcription strategy called mutational profiling (MaP), multiple chemical modification events can be measured within individual RNA molecules. Nucleotides that communicate structurally through direct base pairing or large-scale folding-unfolding transitions will react with chemical probes in a correlated manner, thereby revealing structural complexity hidden to conventional approaches. These single-molecule correlated chemical probing (smCCP) experiments can be interpreted to directly identify nucleotides that base pair (the PAIR-MaP strategy) and to reveal long-range, through-space structural communication (RING-MaP). Correlated probing can also define the thermodynamic populations of complex RNA ensembles (DANCE-MaP). Complex RNA-protein networks can be interrogated by cross-linking proteins to RNA and measuring correlations between cross-linked positions (RNP-MaP).smCCP thus visualizes RNA secondary and higher-order structure with unprecedented accuracy, defining novel structures, RNA-protein interaction networks, time-resolved dynamics, and allosteric structural switches. These strategies are not mutually exclusive; in favorable cases, multiple levels of RNA structure â base pairing, through-space structural communication, and equilibrium ensembles â can be resolved concurrently. The physical experimentation required for smCCP is profoundly simple, and experiments are readily performed in cells on RNAs of any size, including large noncoding RNAs and mRNAs. Single-molecule correlated chemical probing is paving the way for a new generation of biophysical studies on RNA in living systems.
Assuntos
Nucleotídeos , RNA , Conformação de Ácido Nucleico , RNA/química , Pareamento de Bases , RNA Mensageiro , Proteínas/genéticaRESUMO
SERPINA1 mRNAs encode the protease inhibitor α-1-antitrypsin and are regulated through post-transcriptional mechanisms. α-1-antitrypsin deficiency leads to chronic obstructive pulmonary disease (COPD) and liver cirrhosis, and specific variants in the 5'-untranslated region (5'-UTR) are associated with COPD. The NM_000295.4 transcript is well expressed and translated in lung and blood and features an extended 5'-UTR that does not contain a competing upstream open reading frame (uORF). We show that the 5'-UTR of NM_000295.4 folds into a well-defined multi-helix structural domain. We systematically destabilized mRNA structure across the NM_000295.4 5'-UTR, and measured changes in (SHAPE quantified) RNA structure and cap-dependent translation relative to a native-sequence reporter. Surprisingly, despite destabilizing local RNA structure, most mutations either had no effect on or decreased translation. Most structure-destabilizing mutations retained native, global 5'-UTR structure. However, those mutations that disrupted the helix that anchors the 5'-UTR domain yielded three groups of non-native structures. Two of these non-native structure groups refolded to create a stable helix near the translation initiation site that decreases translation. Thus, in contrast to the conventional model that RNA structure in 5'-UTRs primarily inhibits translation, complex folding of the NM_000295.4 5'-UTR creates a translation-optimized message by promoting accessibility at the translation initiation site.
Assuntos
Biossíntese de Proteínas , Doença Pulmonar Obstrutiva Crônica , alfa 1-Antitripsina/genética , Regiões 5' não Traduzidas , Humanos , Inibidores de Proteases , Doença Pulmonar Obstrutiva Crônica/genética , RNA Mensageiro/metabolismoRESUMO
Nucleocapsid protein (N-protein) is required for multiple steps in betacoronaviruses replication. SARS-CoV-2-N-protein condenses with specific viral RNAs at particular temperatures making it a powerful model for deciphering RNA sequence specificity in condensates. We identify two separate and distinct double-stranded, RNA motifs (dsRNA stickers) that promote N-protein condensation. These dsRNA stickers are separately recognized by N-protein's two RNA binding domains (RBDs). RBD1 prefers structured RNA with sequences like the transcription-regulatory sequence (TRS). RBD2 prefers long stretches of dsRNA, independent of sequence. Thus, the two N-protein RBDs interact with distinct dsRNA stickers, and these interactions impart specific droplet physical properties that could support varied viral functions. Specifically, we find that addition of dsRNA lowers the condensation temperature dependent on RBD2 interactions and tunes translational repression. In contrast RBD1 sites are sequences critical for sub-genomic (sg) RNA generation and promote gRNA compression. The density of RBD1 binding motifs in proximity to TRS-L/B sequences is associated with levels of sub-genomic RNA generation. The switch to packaging is likely mediated by RBD1 interactions which generate particles that recapitulate the packaging unit of the virion. Thus, SARS-CoV-2 can achieve biochemical complexity, performing multiple functions in the same cytoplasm, with minimal protein components based on utilizing multiple distinct RNA motifs that control N-protein interactions.
Assuntos
Proteínas do Nucleocapsídeo de Coronavírus , RNA de Cadeia Dupla , SARS-CoV-2 , Sítios de Ligação , Proteínas do Nucleocapsídeo de Coronavírus/química , Fosfoproteínas/química , RNA de Cadeia Dupla/genética , RNA Viral/genética , Proteínas de Ligação a RNA/metabolismo , SARS-CoV-2/genética , TemperaturaRESUMO
Structured RNAs bind ligands and are attractive targets for small-molecule drugs. A wide variety of analytical methods have been used to characterize RNA-ligand interactions, but our experience is that most have significant limitations in terms of material requirements and applicability to complex RNAs. Surface plasmon resonance (SPR) potentially overcomes these limitations, but we find that the standard experimental framework measures notable nonspecific electrostatic-mediated interactions, frustrating analysis of weak RNA binders. SPR measurements are typically quantified relative to a non-target reference channel. Here, we show that referencing to a channel containing a non-binding control RNA enables subtraction of nonspecific binding contributions, allowing measurements of accurate and specific binding affinities. We validated this approach for small-molecule binders of two riboswitch RNAs with affinities ranging from nanomolar to millimolar, including low-molecular-mass fragment ligands. SPR implemented with reference subtraction reliably discriminates specific from nonspecific binding, uses RNA and ligand material efficiently, and enables rapid exploration of the ligand-binding landscape for RNA targets.
Assuntos
RNA , Ressonância de Plasmônio de Superfície , Ligantes , Ressonância de Plasmônio de Superfície/métodosRESUMO
RNA lies upstream of nearly all biology and functions as the central conduit of information exchange in all cells. RNA molecules encode information both in their primary sequences and in complex structures that form when an RNA folds back on itself. From the time of discovery of mRNA in the late 1950s until quite recently, we had only a rudimentary understanding of RNA structure across vast regions of most messenger and noncoding RNAs. This deficit is now rapidly being addressed, especially by selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistry, mutational profiling (MaP), and closely related platform technologies that, collectively, create chemical microscopes for RNA. These technologies make it possible to interrogate RNA structure, quantitatively, at nucleotide resolution, and at large scales, for entire mRNAs, noncoding RNAs, and viral RNA genomes. By applying comprehensive structure probing to diverse problems, we and others are showing that control of biological function mediated by RNA structure is ubiquitous across prokaryotic and eukaryotic organisms.Work over the past decade using SHAPE-based analyses has clarified key principles. First, the method of RNA structure probing matters. SHAPE-MaP, with its direct and one-step readout that probes nearly every nucleotide by reaction at the 2'-hydroxyl, gives a more detailed and accurate readout than alternatives. Second, comprehensive chemical probing is essential. Focusing on fragments of large RNAs or using meta-gene or statistical analyses to compensate for sparse data sets misses critical features and often yields structure models with poor predictive power. Finally, every RNA has its own internal structural personality. There are myriad ways in which RNA structure modulates sequence accessibility, protein binding, translation, splice-site choice, phase separation, and other fundamental biological processes. In essentially every instance where we have applied rigorous and quantitative SHAPE technologies to study RNA structure-function interrelationships, new insights regarding biological regulatory mechanisms have emerged. RNA elements with more complex higher-order structures appear more likely to contain high-information-content clefts and pockets that bind small molecules, broadly informing a vigorous field of RNA-targeted drug discovery.The broad implications of this collective work are twofold. First, it is long past time to abandon depiction of large RNAs as simple noodle-like or gently flowing molecules. Instead, we need to emphasize that nearly all RNAs are punctuated with distinctive internal structures, a subset of which modulate function in profound ways. Second, structure probing should be an integral component of any effort that seeks to understand the functional nexuses and biological roles of large RNAs.
Assuntos
RNA/metabolismo , Acilação , RNA/químicaRESUMO
The ribosome moves between distinct structural states and is organized into multiple functional domains. Here, we examined hundreds of occurrences of pairwise through-space communication between nucleotides in the ribosome small subunit RNA using RNA interaction groups analyzed by mutational profiling (RING-MaP) single-molecule correlated chemical probing in bacterial cells. RING-MaP revealed four structural communities in the small subunit RNA, each distinct from the organization defined by the RNA secondary structure. The head domain contains 2 structural communities: the outer-head contains the pivot for head swiveling, and an inner-head community is structurally integrated with helix 44 and spans the entire ribosome intersubunit interface. In-cell binding by the antibiotic spectinomycin (Spc) barely perturbs its local binding pocket as revealed by the per-nucleotide chemical probing signal. In contrast, Spc binding overstabilizes long-range RNA-RNA contacts that extend 95 Å across the ribosome that connect the pivot for head swiveling with the axis of intersubunit rotation. The two major motions of the small subunit-head swiveling and intersubunit rotation-are thus coordinated via long-range RNA structural communication, which is specifically modulated by Spc. Single-molecule correlated chemical probing reveals trans-domain structural communication and rationalizes the profound functional effects of binding by a low-molecular-mass antibiotic to the megadalton ribosome.
Assuntos
Antibacterianos/farmacologia , RNA Ribossômico 16S/metabolismo , Espectinomicina/farmacologia , Escherichia coli , RNA Ribossômico 16S/efeitos dos fármacosRESUMO
SHAPE-JuMP is a concise strategy for identifying close-in-space interactions in RNA molecules. Nucleotides in close three-dimensional proximity are crosslinked with a bi-reactive reagent that covalently links the 2'-hydroxyl groups of the ribose moieties. The identities of crosslinked nucleotides are determined using an engineered reverse transcriptase that jumps across crosslinked sites, resulting in a deletion in the cDNA that is detected using massively parallel sequencing. Here we introduce ShapeJumper, a bioinformatics pipeline to process SHAPE-JuMP sequencing data and to accurately identify through-space interactions, as observed in complex JuMP datasets. ShapeJumper identifies proximal interactions with near-nucleotide resolution using an alignment strategy that is optimized to tolerate the unique non-templated reverse-transcription profile of the engineered crosslink-traversing reverse-transcriptase. JuMP-inspired strategies are now poised to replace adapter-ligation for detecting RNA-RNA interactions in most crosslinking experiments.
Assuntos
DNA Complementar/química , RNA/química , Software , Algoritmos , Sítios de Ligação , Biologia Computacional , Reagentes de Ligações Cruzadas , DNA Complementar/genética , Engenharia Genética , Modelos Moleculares , Conformação de Ácido Nucleico , RNA/genética , Alinhamento de Sequência/estatística & dados numéricosRESUMO
RNA structure and dynamics are critical to biological function. However, strategies for determining RNA structure in vivo are limited, with established chemical probing and newer duplex detection methods each having deficiencies. Here we convert the common reagent dimethyl sulfate into a useful probe of all 4 RNA nucleotides. Building on this advance, we introduce PAIR-MaP, which uses single-molecule correlated chemical probing to directly detect base-pairing interactions in cells. PAIR-MaP has superior resolution compared to alternative experiments, can resolve multiple sets of pairing interactions for structurally dynamic RNAs, and enables highly accurate structure modeling, including of RNAs containing multiple pseudoknots and extensively bound by proteins. Application of PAIR-MaP to human RNase MRP and 2 bacterial messenger RNA 5' untranslated regions reveals functionally important and complex structures undetected by prior analyses. PAIR-MaP is a powerful, experimentally concise, and broadly applicable strategy for directly visualizing RNA base pairs and dynamics in cells.
Assuntos
RNA/química , Ésteres do Ácido Sulfúrico/química , Regiões 5' não Traduzidas , Pareamento de Bases , Sobrevivência Celular , Endorribonucleases/genética , Escherichia coli/genética , Humanos , Células Jurkat , Modelos Moleculares , Imagem Molecular/métodos , Técnicas de Sonda Molecular , Sondas Moleculares/química , Conformação de Ácido Nucleico , Nucleotídeos/química , RNA/genética , RNA Longo não Codificante/química , RNA Mensageiro/química , Sequências Reguladoras de Ácido RibonucleicoRESUMO
Higher-order structure governs function for many RNAs. However, discerning this structure for large RNA molecules in solution is an unresolved challenge. Here, we present SHAPE-JuMP (selective 2'-hydroxyl acylation analyzed by primer extension and juxtaposed merged pairs) to interrogate through-space RNA tertiary interactions. A bifunctional small molecule is used to chemically link proximal nucleotides in an RNA structure. The RNA cross-link site is then encoded into complementary DNA (cDNA) in a single, direct step using an engineered reverse transcriptase that "jumps" across cross-linked nucleotides. The resulting cDNAs contain a deletion relative to the native RNA sequence, which can be detected by sequencing, that indicates the sites of cross-linked nucleotides. SHAPE-JuMP measures RNA tertiary structure proximity concisely across large RNA molecules at nanometer resolution. SHAPE-JuMP is especially effective at measuring interactions in multihelix junctions and loop-to-helix packing, enables modeling of the global fold for RNAs up to several hundred nucleotides in length, facilitates ranking of structural models by consistency with through-space restraints, and is poised to enable solution-phase structural interrogation and modeling of complex RNAs.
Assuntos
RNA/química , Acilação , Reagentes de Ligações Cruzadas/química , DNA Complementar/química , Conformação de Ácido Nucleico , Oxazinas/química , RNA/genética , DNA Polimerase Dirigida por RNA/química , DNA Polimerase Dirigida por RNA/genética , Análise de Sequência de DNARESUMO
The emergence of catalysis in early genetic polymers such as RNA is considered a key transition in the origin of life, pre-dating the appearance of protein enzymes. DNA also demonstrates the capacity to fold into three-dimensional structures and form catalysts in vitro. However, to what degree these natural biopolymers comprise functionally privileged chemical scaffolds for folding or the evolution of catalysis is not known. The ability of synthetic genetic polymers (XNAs) with alternative backbone chemistries not found in nature to fold into defined structures and bind ligands raises the possibility that these too might be capable of forming catalysts (XNAzymes). Here we report the discovery of such XNAzymes, elaborated in four different chemistries (arabino nucleic acids, ANA; 2'-fluoroarabino nucleic acids, FANA; hexitol nucleic acids, HNA; and cyclohexene nucleic acids, CeNA) directly from random XNA oligomer pools, exhibiting in trans RNA endonuclease and ligase activities. We also describe an XNA-XNA ligase metalloenzyme in the FANA framework, establishing catalysis in an entirely synthetic system and enabling the synthesis of FANA oligomers and an active RNA endonuclease FANAzyme from its constituent parts. These results extend catalysis beyond biopolymers and establish technologies for the discovery of catalysts in a wide range of polymer scaffolds not found in nature. Evolution of catalysis independent of any natural polymer has implications for the definition of chemical boundary conditions for the emergence of life on Earth and elsewhere in the Universe.
Assuntos
Ácidos Nucleicos/síntese química , Ácidos Nucleicos/metabolismo , Polímeros/química , Polímeros/síntese química , Sequência de Bases , Catálise , Endonucleases/metabolismo , Ligases/metabolismo , Ácidos Nucleicos/química , Polímeros/metabolismo , RNA/metabolismoRESUMO
RNA virus genomes are efficient and compact carriers of biological information, encoding information required for replication both in their primary sequences and in higher-order RNA structures. However, the ubiquity of RNA elements with higher-order folds-in which helices pack together to form complex 3D structures-and the extent to which these elements affect viral fitness are largely unknown. Here we used single-molecule correlated chemical probing to define secondary and tertiary structures across the RNA genome of dengue virus serotype 2 (DENV2). Higher-order RNA structures are pervasive and involve more than one-third of nucleotides in the DENV2 genomic RNA. These 3D structures promote a compact overall architecture and contribute to viral fitness. Disrupting RNA regions with higher-order structures leads to stable, nonreverting mutants and could guide the development of vaccines based on attenuated RNA viruses. The existence of extensive regions of functional RNA elements with tertiary folds in viral RNAs, and likely many other messenger and noncoding RNAs, means that there are significant regions with pocket-containing surfaces that may serve as novel RNA-directed drug targets.
Assuntos
Capsídeo/ultraestrutura , Vírus da Dengue/ultraestrutura , Genoma Viral , RNA Viral/ultraestrutura , Pareamento de Bases , Capsídeo/química , Capsídeo/metabolismo , Vírus da Dengue/classificação , Vírus da Dengue/genética , Vírus da Dengue/metabolismo , Aptidão Genética , Modelos Moleculares , Conformação de Ácido Nucleico , RNA Viral/genética , RNA Viral/metabolismo , Sorogrupo , Montagem de Vírus/genéticaRESUMO
While a variety of powerful tools exists for analyzing RNA structure, identifying long-range and intermolecular base-pairing interactions has remained challenging. Recently, three groups introduced a high-throughput strategy that uses psoralen-mediated crosslinking to directly identify RNA-RNA duplexes in cells. Initial application of these methods highlights the preponderance of long-range structures within and between RNA molecules and their widespread structural dynamics.