Search | VHL Regional Portal

1.

Estimating RNA Secondary Structure Folding Free Energy Changes with efn2.

Zuber, Jeffrey; Mathews, David H.

Methods Mol Biol ; 2726: 1-13, 2024.

Article in English | MEDLINE | ID: mdl-38780725

ABSTRACT

A number of analyses require estimates of the folding free energy changes of specific RNA secondary structures. These predictions are often based on a set of nearest neighbor parameters that models the folding stability of a RNA secondary structure as the sum of folding stabilities of the structural elements that comprise the secondary structure. In the software suite RNAstructure, the free energy change calculation is implemented in the program efn2. The efn2 program estimates the folding free energy change and the experimental uncertainty in the folding free energy change. It can be run through the graphical user interface for RNAstructure, from the command line, or a web server. This chapter provides detailed protocols for using efn2.

Subject(s)

Nucleic Acid Conformation , RNA Folding , RNA , Software , Thermodynamics , RNA/chemistry , Computational Biology/methods , Models, Molecular

2.

NNDB: An Expanded Database of Nearest Neighbor Parameters for Predicting Stability of Nucleic Acid Secondary Structures.

Mittal, Abhinav; Turner, Douglas H; Mathews, David H.

J Mol Biol ; : 168549, 2024 Mar 24.

Article in English | MEDLINE | ID: mdl-38522645

ABSTRACT

Nearest neighbor thermodynamic parameters are widely used for RNA and DNA secondary structure prediction and to model thermodynamic ensembles of secondary structures. The Nearest Neighbor Database (NNDB) is a freely available web resource (https://rna.urmc.rochester.edu/NNDB) that provides the functional forms, parameter values, and example calculations. The NNDB provides the 1999 and 2004 set of RNA folding nearest neighbor parameters. We expanded the database to include a set of DNA parameters and a set of RNA parameters that includes m6A in addition to the canonical RNA nucleobases. The site was redesigned using the Quarto open-source publishing system. A downloadable PDF version of the complete resource and downloadable sets of nearest neighbor parameters are available.

3.

Secondary structures that regulate mRNA translation provide insights for ASO-mediated modulation of cardiac hypertrophy.

Hedaya, Omar M; Venkata Subbaiah, Kadiam C; Jiang, Feng; Xie, Li Huitong; Wu, Jiangbin; Khor, Eng-Soon; Zhu, Mingyi; Mathews, David H; Proschel, Chris; Yao, Peng.

Nat Commun ; 14(1): 6166, 2023 10 03.

Article in English | MEDLINE | ID: mdl-37789015

ABSTRACT

Translation of upstream open reading frames (uORFs) typically abrogates translation of main (m)ORFs. The molecular mechanism of uORF regulation in cells is not well understood. Here, we data-mined human and mouse heart ribosome profiling analyses and identified a double-stranded RNA (dsRNA) structure within the GATA4 uORF that cooperates with the start codon to augment uORF translation and inhibits mORF translation. A trans-acting RNA helicase DDX3X inhibits the GATA4 uORF-dsRNA activity and modulates the translational balance of uORF and mORF. Antisense oligonucleotides (ASOs) that disrupt this dsRNA structure promote mORF translation, while ASOs that base-pair immediately downstream (i.e., forming a bimolecular double-stranded region) of either the uORF or mORF start codon enhance uORF or mORF translation, respectively. Human cardiomyocytes and mice treated with a uORF-enhancing ASO showed reduced cardiac GATA4 protein levels and increased resistance to cardiomyocyte hypertrophy. We further show the broad utility of uORF-dsRNA- or mORF-targeting ASO to regulate mORF translation for other mRNAs. This work demonstrates that the uORF-dsRNA element regulates the translation of multiple mRNAs as a generalizable translational control mechanism. Moreover, we develop a valuable strategy to alter protein expression and cellular phenotypes by targeting or generating dsRNA downstream of a uORF or mORF start codon.

Subject(s)

Cardiomegaly , Protein Biosynthesis , Humans , Animals , Mice , Codon, Initiator/genetics , 5' Untranslated Regions , RNA, Messenger/genetics , Open Reading Frames/genetics , Cardiomegaly/genetics

4.

DNA Structure Design Is Improved Using an Artificially Expanded Alphabet of Base Pairs Including Loop and Mismatch Thermodynamic Parameters.

Pham, Tuan M; Miffin, Terrel; Sun, Hongying; Sharp, Kenneth K; Wang, Xiaoyu; Zhu, Mingyi; Hoshika, Shuichi; Peterson, Raymond J; Benner, Steven A; Kahn, Jason D; Mathews, David H.

ACS Synth Biol ; 12(9): 2750-2763, 2023 09 15.

Article in English | MEDLINE | ID: mdl-37671922

ABSTRACT

We show that in silico design of DNA secondary structures is improved by extending the base pairing alphabet beyond A-T and G-C to include the pair between 2-amino-8-(1'-ß-d-2'-deoxyribofuranosyl)-imidazo-[1,2-a]-1,3,5-triazin-(8H)-4-one and 6-amino-3-(1'-ß-d-2'-deoxyribofuranosyl)-5-nitro-(1H)-pyridin-2-one, abbreviated as P and Z. To obtain the thermodynamic parameters needed to include P-Z pairs in the designs, we performed 47 optical melting experiments and combined the results with previous work to fit free energy and enthalpy nearest neighbor folding parameters for P-Z pairs and G-Z wobble pairs. We find G-Z pairs have stability comparable to that of A-T pairs and should therefore be included as base pairs in structure prediction and design algorithms. Additionally, we extrapolated the set of loop, terminal mismatch, and dangling end parameters to include the P and Z nucleotides. These parameters were incorporated into the RNAstructure software package for secondary structure prediction and analysis. Using the RNAstructure Design program, we solved 99 of the 100 design problems posed by Eterna using the ACGT alphabet or supplementing it with P-Z pairs. Extending the alphabet reduced the propensity of sequences to fold into off-target structures, as evaluated by the normalized ensemble defect (NED). The NED values were improved relative to those from the Eterna example solutions in 91 of 99 cases in which Eterna-player solutions were provided. P-Z-containing designs had average NED values of 0.040, significantly below the 0.074 of standard-DNA-only designs, and inclusion of the P-Z pairs decreased the time needed to converge on a design. This work provides a sample pipeline for inclusion of any expanded alphabet nucleotides into prediction and design workflows.

Subject(s)

Algorithms , DNA , Base Pairing , Thermodynamics , Nucleotides

5.

Genome-Wide DNA Changes Acquired by Candida albicans Caspofungin-Adapted Mutants.

Zuber, Jeffrey; Sah, Sudisht K; Mathews, David H; Rustchenko, Elena.

Microorganisms ; 11(8)2023 Jul 25.

Article in English | MEDLINE | ID: mdl-37630430

ABSTRACT

Drugs from the echinocandin (ECN) class are now recommended 'front-line' treatments of infections caused by a prevailing fungal pathogen, C. albicans. However, the increased use of ECNs is associated with a rising resistance to ECNs. As the acquisition of ECN resistance in C. albicans is viewed as a multistep evolution, determining factors that are associated with the decreased ECN susceptibility is of importance. We have recently identified two cohorts of genes that are either up- or downregulated in concert in order to control remodeling of cell wall, an organelle targeted by ECNs, in laboratory mutants with decreased ECN susceptibility. Here, we profiled the global DNA sequence of four of these adapted mutants in search of DNA changes that are associated with decreased ECN susceptibility. We find a limited number of 112 unique mutations representing two alternative mutational pathways. Approximately half of the mutations occurred as hotspots. Approximately half of mutations and hotspots were shared by ECN-adapted mutants despite the mutants arising as independent events and differing in some of their phenotypes, as well as in condition of chromosome 5. A total of 88 mutations are associated with 43 open reading frames (ORFs) and occurred inside of an ORF or within 1 kb of an ORF, predominantly as single-nucleotide substitution. Mutations occurred more often in the 5'-UTR than in the 3'-UTR by a 1.67:1 ratio. A total of 16 mutations mapped to eight genomic features that were not ORFs: Tca4-4 retrotransposon; Tca2-7 retrotransposon; lambda-4a long terminal repeat; mu-Ra long terminal repeat; MRS-7b Major Repeat Sequence; MRS-R Major Repeat Sequence; RB2-5a repeat sequence; and tL (CAA) leucine tRNA. Finally, eight mutations are not associated with any ORF or other genomic feature. Repeated occurrence of single-nucleotide substitutions in non-related drug-adapted mutants strongly indicates that these DNA changes are accompanying drug adaptation and could possibly influence ECN susceptibility, thus serving as factors facilitating evolution of ECN drug resistance due to classical mutations in FKS1.

6.

LinearCoFold and LinearCoPartition: linear-time algorithms for secondary structure prediction of interacting RNA molecules.

Zhang, He; Li, Sizhen; Dai, Ning; Zhang, Liang; Mathews, David H; Huang, Liang.

Nucleic Acids Res ; 51(18): e94, 2023 10 13.

Article in English | MEDLINE | ID: mdl-37650626

ABSTRACT

Many RNAs function through RNA-RNA interactions. Fast and reliable RNA structure prediction with consideration of RNA-RNA interaction is useful, however, existing tools are either too simplistic or too slow. To address this issue, we present LinearCoFold, which approximates the complete minimum free energy structure of two strands in linear time, and LinearCoPartition, which approximates the cofolding partition function and base pairing probabilities in linear time. LinearCoFold and LinearCoPartition are orders of magnitude faster than RNAcofold. For example, on a sequence pair with combined length of 26,190 nt, LinearCoFold is 86.8× faster than RNAcofold MFE mode, and LinearCoPartition is 642.3× faster than RNAcofold partition function mode. Surprisingly, LinearCoFold and LinearCoPartition's predictions have higher PPV and sensitivity of intermolecular base pairs. Furthermore, we apply LinearCoFold to predict the RNA-RNA interaction between SARS-CoV-2 genomic RNA (gRNA) and human U4 small nuclear RNA (snRNA), which has been experimentally studied, and observe that LinearCoFold's prediction correlates better with the wet lab results than RNAcofold's.

Subject(s)

Algorithms , RNA , Humans , Base Pairing , Genomics , Nucleic Acid Conformation , RNA/chemistry , RNA/metabolism , RNA, Viral/chemistry , SARS-CoV-2/chemistry

7.

Secondary structures that regulate mRNA translation provide insights for ASO-mediated modulation of cardiac hypertrophy.

Hedaya, Omar M; Subbaiah, Kadiam C Venkata; Jiang, Feng; Xie, Li Huitong; Wu, Jiangbin; Khor, EngSoon; Zhu, Mingyi; Mathews, David H; Proschel, Chris; Yao, Peng.

bioRxiv ; 2023 Jun 15.

Article in English | MEDLINE | ID: mdl-37397986

ABSTRACT

Translation of upstream open reading frames (uORFs) typically abrogates translation of main (m)ORFs. The molecular mechanism of uORF regulation in cells is not well understood. Here, we identified a double-stranded RNA (dsRNA) structure residing within the GATA4 uORF that augments uORF translation and inhibits mORF translation. Antisense oligonucleotides (ASOs) that disrupt this dsRNA structure promote mORF translation, while ASOs that base-pair immediately downstream (i.e., forming a bimolecular double-stranded region) of either the uORF or mORF start codon enhance uORF or mORF translation, respectively. Human cardiomyocytes and mice treated with a uORF-enhancing ASO showed reduced cardiac GATA4 protein levels and increased resistance to cardiomyocyte hypertrophy. We further show the general utility of uORF-dsRNA- or mORF- targeting ASO to regulate mORF translation for other mRNAs. Our work demonstrates a regulatory paradigm that controls translational efficiency and a useful strategy to alter protein expression and cellular phenotypes by targeting or generating dsRNA downstream of a uORF or mORF start codon. Bullet points for discoveries: dsRNA within GATA4 uORF activates uORF translation and inhibits mORF translation. ASOs that target the dsRNA can either inhibit or enhance GATA4 mORF translation. ASOs can be used to impede hypertrophy in human cardiomyocytes and mouse hearts.uORF- and mORF-targeting ASOs can be used to control translation of multiple mRNAs.

8.

RNA Secondary Structure Analysis Using RNAstructure.

Ali, Sara E; Mittal, Abhinav; Mathews, David H.

Curr Protoc ; 3(7): e846, 2023 Jul.

Article in English | MEDLINE | ID: mdl-37487054

ABSTRACT

RNAstructure is a user-friendly program for the prediction and analysis of RNA secondary structure. It is available as a web server, a program with a graphical user interface, or a set of command line tools. The programs are available for Microsoft Windows, macOS, or Linux. This article provides protocols for prediction of RNA secondary structure (using the web server, the graphical user interface, or the command line) and high-affinity oligonucleotide binding sites to a structured RNA target (using the graphical user interface). © 2023 Wiley Periodicals LLC. Basic Protocol 1: Predicting RNA secondary structure using the RNAstructure web server Alternate Protocol 1: Predicting secondary structure and base pair probabilities using the RNAstructure graphical user interface Alternate Protocol 2: Predicting secondary structure and base pair probabilities using the RNAstructure command line interface Basic Protocol 2: Predicting binding affinities of oligonucleotides complementary to an RNA target using OligoWalk.

Subject(s)

Oligonucleotides , RNA , Binding Sites , Probability , Protein Structure, Secondary

9.

RNA design via structure-aware multifrontier ensemble optimization.

Zhou, Tianshuo; Dai, Ning; Li, Sizhen; Ward, Max; Mathews, David H; Huang, Liang.

Bioinformatics ; 39(39 Suppl 1): i563-i571, 2023 06 30.

Article in English | MEDLINE | ID: mdl-37387188

ABSTRACT

MOTIVATION: RNA design is the search for a sequence or set of sequences that will fold to desired structure, also known as the inverse problem of RNA folding. However, the sequences designed by existing algorithms often suffer from low ensemble stability, which worsens for long sequence design. Additionally, for many methods only a small number of sequences satisfying the MFE criterion can be found by each run of design. These drawbacks limit their use cases. RESULTS: We propose an innovative optimization paradigm, SAMFEO, which optimizes ensemble objectives (equilibrium probability or ensemble defect) by iterative search and yields a very large number of successfully designed RNA sequences as byproducts. We develop a search method which leverages structure level and ensemble level information at different stages of the optimization: initialization, sampling, mutation, and updating. Our work, while being less complicated than others, is the first algorithm that is able to design thousands of RNA sequences for the puzzles from the Eterna100 benchmark. In addition, our algorithm solves the most Eterna100 puzzles among all the general optimization based methods in our study. The only baseline solving more puzzles than our work is dependent on handcrafted heuristics designed for a specific folding model. Surprisingly, our approach shows superiority on designing long sequences for structures adapted from the database of 16S Ribosomal RNAs. AVAILABILITY AND IMPLEMENTATION: Our source code and data used in this article is available at https://github.com/shanry/SAMFEO.

Subject(s)

Algorithms , Benchmarking , Databases, Factual , Mutation , RNA, Ribosomal, 16S

10.

DNA Structure Design Is Improved Using an Artificially Expanded Alphabet of Base Pairs Including Loop and Mismatch Thermodynamic Parameters.

Pham, Tuan M; Miffin, Terrel; Sun, Hongying; Sharp, Kenneth K; Wang, Xiaoyu; Zhu, Mingyi; Hoshika, Shuichi; Peterson, Raymond J; Benner, Steven A; Kahn, Jason D; Mathews, David H.

bioRxiv ; 2023 Jun 08.

Article in English | MEDLINE | ID: mdl-37333404

ABSTRACT

We show that in silico design of DNA secondary structures is improved by extending the base pairing alphabet beyond A-T and G-C to include the pair between 2-amino-8-(1'-ß-D-2'-deoxyribofuranosyl)-imidazo-[1,2- a ]-1,3,5-triazin-(8 H )-4-one and 6-amino-3-(1'-ß-D-2'-deoxyribofuranosyl)-5-nitro-(1 H )-pyridin-2-one, simply P and Z. To obtain the thermodynamic parameters needed to include P-Z pairs in the designs, we performed 47 optical melting experiments and combined the results with previous work to fit a new set of free energy and enthalpy nearest neighbor folding parameters for P-Z pairs and G-Z wobble pairs. We find that G-Z pairs have stability comparable to A-T pairs and therefore should be considered quantitatively by structure prediction and design algorithms. Additionally, we extrapolated the set of loop, terminal mismatch, and dangling end parameters to include P and Z nucleotides. These parameters were incorporated into the RNAstructure software package for secondary structure prediction and analysis. Using the RNAstructure Design program, we solved 99 of the 100 design problems posed by Eterna using the ACGT alphabet or supplementing with P-Z pairs. Extending the alphabet reduced the propensity of sequences to fold into off-target structures, as evaluated by the normalized ensemble defect (NED). The NED values were improved relative to those from the Eterna example solutions in 91 of 99 cases where Eterna-player solutions were provided. P-Z-containing designs had average NED values of 0.040, significantly below the 0.074 of standard-DNA-only designs, and inclusion of the P-Z pairs decreased the time needed to converge on a design. This work provides a sample pipeline for inclusion of any expanded alphabet nucleotides into prediction and design workflows.

11.

Algorithm for optimized mRNA design improves stability and immunogenicity.

Zhang, He; Zhang, Liang; Lin, Ang; Xu, Congcong; Li, Ziyu; Liu, Kaibo; Liu, Boxiang; Ma, Xiaopin; Zhao, Fanfan; Jiang, Huiling; Chen, Chunxiu; Shen, Haifa; Li, Hangwen; Mathews, David H; Zhang, Yujian; Huang, Liang.

Nature ; 621(7978): 396-403, 2023 Sep.

Article in English | MEDLINE | ID: mdl-37130545

ABSTRACT

Messenger RNA (mRNA) vaccines are being used to combat the spread of COVID-19 (refs. 1-3), but they still exhibit critical limitations caused by mRNA instability and degradation, which are major obstacles for the storage, distribution and efficacy of the vaccine products4. Increasing secondary structure lengthens mRNA half-life, which, together with optimal codons, improves protein expression5. Therefore, a principled mRNA design algorithm must optimize both structural stability and codon usage. However, owing to synonymous codons, the mRNA design space is prohibitively large-for example, there are around 2.4 × 10632 candidate mRNA sequences for the SARS-CoV-2 spike protein. This poses insurmountable computational challenges. Here we provide a simple and unexpected solution using the classical concept of lattice parsing in computational linguistics, where finding the optimal mRNA sequence is analogous to identifying the most likely sentence among similar-sounding alternatives6. Our algorithm LinearDesign finds an optimal mRNA design for the spike protein in just 11 minutes, and can concurrently optimize stability and codon usage. LinearDesign substantially improves mRNA half-life and protein expression, and profoundly increases antibody titre by up to 128 times in mice compared to the codon-optimization benchmark on mRNA vaccines for COVID-19 and varicella-zoster virus. This result reveals the great potential of principled mRNA design and enables the exploration of previously unreachable but highly stable and efficient designs. Our work is a timely tool for vaccines and other mRNA-based medicines encoding therapeutic proteins such as monoclonal antibodies and anti-cancer drugs7,8.

Subject(s)

Algorithms , COVID-19 Vaccines , COVID-19 , RNA Stability , RNA, Messenger , SARS-CoV-2 , mRNA Vaccines , Animals , Humans , Mice , Codon/genetics , COVID-19/genetics , COVID-19/immunology , COVID-19/prevention & control , COVID-19 Vaccines/chemistry , COVID-19 Vaccines/genetics , COVID-19 Vaccines/immunology , Half-Life , Herpesvirus 3, Human/genetics , Herpesvirus 3, Human/immunology , mRNA Vaccines/chemistry , mRNA Vaccines/genetics , mRNA Vaccines/immunology , RNA Stability/genetics , RNA Stability/immunology , RNA, Messenger/chemistry , RNA, Messenger/genetics , RNA, Messenger/immunology , RNA, Messenger/metabolism , SARS-CoV-2/genetics , SARS-CoV-2/immunology

12.

In vivo secondary structural analysis of Influenza A virus genomic RNA.

Mirska, Barbara; Wozniak, Tomasz; Lorent, Dagny; Ruszkowska, Agnieszka; Peterson, Jake M; Moss, Walter N; Mathews, David H; Kierzek, Ryszard; Kierzek, Elzbieta.

Cell Mol Life Sci ; 80(5): 136, 2023 May 02.

Article in English | MEDLINE | ID: mdl-37131079

ABSTRACT

Influenza A virus (IAV) is a respiratory virus that causes epidemics and pandemics. Knowledge of IAV RNA secondary structure in vivo is crucial for a better understanding of virus biology. Moreover, it is a fundament for the development of new RNA-targeting antivirals. Chemical RNA mapping using selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) coupled with Mutational Profiling (MaP) allows for the thorough examination of secondary structures in low-abundance RNAs in their biological context. So far, the method has been used for analyzing the RNA secondary structures of several viruses including SARS-CoV-2 in virio and in cellulo. Here, we used SHAPE-MaP and dimethyl sulfate mutational profiling with sequencing (DMS-MaPseq) for genome-wide secondary structure analysis of viral RNA (vRNA) of the pandemic influenza A/California/04/2009 (H1N1) strain in both in virio and in cellulo environments. Experimental data allowed the prediction of the secondary structures of all eight vRNA segments in virio and, for the first time, the structures of vRNA5, 7, and 8 in cellulo. We conducted a comprehensive structural analysis of the proposed vRNA structures to reveal the motifs predicted with the highest accuracy. We also performed a base-pairs conservation analysis of the predicted vRNA structures and revealed many highly conserved vRNA motifs among the IAVs. The structural motifs presented herein are potential candidates for new IAV antiviral strategies.

Subject(s)

COVID-19 , Influenza A Virus, H1N1 Subtype , Influenza A virus , Humans , Influenza A Virus, H1N1 Subtype/genetics , SARS-CoV-2/genetics , Influenza A virus/genetics , RNA, Viral/genetics , Genomics

13.

Computational Resources for Molecular Biology 2023.

Mathews, David H; Casadio, Rita; Sternberg, Michael J E.

J Mol Biol ; 435(14): 168160, 2023 07 15.

Article in English | MEDLINE | ID: mdl-37244569

Subject(s)

Computational Biology , Molecular Biology

14.

Generation and Functional Analysis of Defective Viral Genomes during SARS-CoV-2 Infection.

Zhou, Terry; Gilliam, Nora J; Li, Sizhen; Spandau, Simone; Osborn, Raven M; Connor, Sarah; Anderson, Christopher S; Mariani, Thomas J; Thakar, Juilee; Dewhurst, Stephen; Mathews, David H; Huang, Liang; Sun, Yan.

mBio ; 14(3): e0025023, 2023 06 27.

Article in English | MEDLINE | ID: mdl-37074178

ABSTRACT

Defective viral genomes (DVGs) have been identified in many RNA viruses as a major factor influencing antiviral immune response and viral pathogenesis. However, the generation and function of DVGs in SARS-CoV-2 infection are less known. In this study, we elucidated DVG generation in SARS-CoV-2 and its relationship with host antiviral immune response. We observed DVGs ubiquitously from transcriptome sequencing (RNA-seq) data sets of in vitro infections and autopsy lung tissues of COVID-19 patients. Four genomic hot spots were identified for DVG recombination, and RNA secondary structures were suggested to mediate DVG formation. Functionally, bulk and single-cell RNA-seq analysis indicated the interferon (IFN) stimulation of SARS-CoV-2 DVGs. We further applied our criteria to the next-generation sequencing (NGS) data set from a published cohort study and observed a significantly higher amount and frequency of DVG in symptomatic patients than those in asymptomatic patients. Finally, we observed exceptionally diverse DVG populations in one immunosuppressive patient up to 140 days after the first positive test of COVID-19, suggesting for the first time an association between DVGs and persistent viral infections in SARS-CoV-2. Together, our findings strongly suggest a critical role of DVGs in modulating host IFN responses and symptom development, calling for further inquiry into the mechanisms of DVG generation and into how DVGs modulate host responses and infection outcome during SARS-CoV-2 infection. IMPORTANCE Defective viral genomes (DVGs) are generated ubiquitously in many RNA viruses, including SARS-CoV-2. Their interference activity to full-length viruses and IFN stimulation provide the potential for them to be used in novel antiviral therapies and vaccine development. SARS-CoV-2 DVGs are generated through the recombination of two discontinuous genomic fragments by viral polymerase complex, and this recombination is also one of the major mechanisms for the emergence of new coronaviruses. Focusing on the generation and function of SARS-CoV-2 DVGs, these studies identify new hot spots for nonhomologous recombination and strongly suggest that the secondary structures within viral genomes mediate the recombination. Furthermore, these studies provide the first evidence for IFN stimulation activity of de novo DVGs during natural SARS-CoV-2 infection. These findings set up the foundation for further mechanism studies of SARS-CoV-2 recombination and provide evidence to harness the immunostimulatory potential of DVGs in the development of a vaccine and antivirals for SARS-CoV-2.

Subject(s)

COVID-19 , RNA Viruses , Humans , RNA, Viral/genetics , Cohort Studies , COVID-19/genetics , SARS-CoV-2/genetics , Genome, Viral , RNA Viruses/genetics , Antiviral Agents

15.

A riboswitch separated from its ribosome-binding site still regulates translation.

Schroeder, Griffin M; Akinyemi, Olayinka; Malik, Jeffrey; Focht, Caroline M; Pritchett, Elizabeth M; Baker, Cameron D; McSally, James P; Jenkins, Jermaine L; Mathews, David H; Wedekind, Joseph E.

Nucleic Acids Res ; 51(5): 2464-2484, 2023 03 21.

Article in English | MEDLINE | ID: mdl-36762498

ABSTRACT

Riboswitches regulate downstream gene expression by binding cellular metabolites. Regulation of translation initiation by riboswitches is posited to occur by metabolite-mediated sequestration of the Shine-Dalgarno sequence (SDS), causing bypass by the ribosome. Recently, we solved a co-crystal structure of a prequeuosine1-sensing riboswitch from Carnobacterium antarcticum that binds two metabolites in a single pocket. The structure revealed that the second nucleotide within the gene-regulatory SDS, G34, engages in a crystal contact, obscuring the molecular basis of gene regulation. Here, we report a co-crystal structure wherein C10 pairs with G34. However, molecular dynamics simulations reveal quick dissolution of the pair, which fails to reform. Functional and chemical probing assays inside live bacterial cells corroborate the dispensability of the C10-G34 pair in gene regulation, leading to the hypothesis that the compact pseudoknot fold is sufficient for translation attenuation. Remarkably, the C. antarcticum aptamer retained significant gene-regulatory activity when uncoupled from the SDS using unstructured spacers up to 10 nucleotides away from the riboswitch-akin to steric-blocking employed by sRNAs. Accordingly, our work reveals that the RNA fold regulates translation without SDS sequestration, expanding known riboswitch-mediated gene-regulatory mechanisms. The results infer that riboswitches exist wherein the SDS is not embedded inside a stable fold.

Subject(s)

Protein Biosynthesis , Riboswitch , Binding Sites , Gene Expression Regulation , Molecular Dynamics Simulation , Nucleic Acid Conformation , Ribosomes/genetics , Ribosomes/metabolism

16.

Linear-Time Algorithms for RNA Structure Prediction.

Zhang, He; Zhang, Liang; Liu, Kaibo; Li, Sizhen; Mathews, David H; Huang, Liang.

Methods Mol Biol ; 2586: 15-34, 2023.

Article in English | MEDLINE | ID: mdl-36705896

ABSTRACT

RNA secondary structure prediction is widely used to understand RNA function. Existing dynamic programming-based algorithms, both the classical minimum free energy (MFE) methods and partition function methods, suffer from a major limitation: their runtimes scale cubically with the RNA length, and this slowness limits their use in genome-wide applications. Inspired by incremental parsing for context-free grammars in computational linguistics, we designed linear-time heuristic algorithms, LinearFold and LinearPartition, to approximate the MFE structure, partition function and base pairing probabilities. These programs are orders of magnitude faster than Vienna RNAfold and CONTRAfold on long sequences. More interestingly, LinearFold and LinearPartition lead to more accurate predictions on the longest sequence families for which the structures are well established (16S and 23S Ribosomal RNAs), as well as improved accuracies for long-range base pairs (500 + nucleotides apart). This chapter provides protocols for using LinearFold and LinearPartition for secondary structure prediction.

Subject(s)

Algorithms , RNA , Humans , RNA/chemistry , Nucleic Acid Conformation , Base Pairing , Entropy , Computational Biology/methods , Sequence Analysis, RNA/methods

17.

LazySampling and LinearSampling: fast stochastic sampling of RNA secondary structure with applications to SARS-CoV-2.

Zhang, He; Li, Sizhen; Zhang, Liang; Mathews, David H; Huang, Liang.

Nucleic Acids Res ; 51(2): e7, 2023 01 25.

Article in English | MEDLINE | ID: mdl-36401871

ABSTRACT

Many RNAs fold into multiple structures at equilibrium, and there is a need to sample these structures according to their probabilities in the ensemble. The conventional sampling algorithm suffers from two limitations: (i) the sampling phase is slow due to many repeated calculations; and (ii) the end-to-end runtime scales cubically with the sequence length. These issues make it difficult to be applied to long RNAs, such as the full genomes of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). To address these problems, we devise a new sampling algorithm, LazySampling, which eliminates redundant work via on-demand caching. Based on LazySampling, we further derive LinearSampling, an end-to-end linear time sampling algorithm. Benchmarking on nine diverse RNA families, the sampled structures from LinearSampling correlate better with the well-established secondary structures than Vienna RNAsubopt and RNAplfold. More importantly, LinearSampling is orders of magnitude faster than standard tools, being 428× faster (72 s versus 8.6 h) than RNAsubopt on the full genome of SARS-CoV-2 (29 903 nt). The resulting sample landscape correlates well with the experimentally guided secondary structure models, and is closer to the alternative conformations revealed by experimentally driven analysis. Finally, LinearSampling finds 23 regions of 15 nt with high accessibilities in the SARS-CoV-2 genome, which are potential targets for COVID-19 diagnostics and therapeutics.

Subject(s)

Algorithms , COVID-19 , SARS-CoV-2 , Humans , Base Sequence , COVID-19/diagnosis , COVID-19/genetics , RNA, Viral/genetics , RNA, Viral/chemistry , SARS-CoV-2/genetics , Nucleic Acid Conformation

18.

Isothermal Titration Calorimetry Analysis of a Cooperative Riboswitch Using an Interdependent-Sites Binding Model.

Cavender, Chapin E; Schroeder, Griffin M; Mathews, David H; Wedekind, Joseph E.

Methods Mol Biol ; 2568: 53-73, 2023.

Article in English | MEDLINE | ID: mdl-36227562

ABSTRACT

Isothermal titration calorimetry (ITC) is a powerful biophysical tool to characterize energetic profiles of biomacromolecular interactions without any alteration of the underlying chemical structures. In this protocol, we describe procedures for performing, analyzing, and interpreting ITC data obtained from a cooperative riboswitch-ligand interaction.

Subject(s)

Riboswitch , Calorimetry/methods , Ligands , Protein Binding , Thermodynamics

19.

Intrinsically Unstructured Sequences in the mRNA 3' UTR Reduce the Ability of Poly(A) Tail to Enhance Translation.

Lai, Wan-Jung C; Zhu, Mingyi; Belinite, Margarita; Ballard, Gregory; Mathews, David H; Ermolenko, Dmitri N.

J Mol Biol ; 434(24): 167877, 2022 12 30.

Article in English | MEDLINE | ID: mdl-36368412

ABSTRACT

The 5' cap and 3' poly(A) tail of mRNA are known to synergistically stimulate translation initiation via the formation of the capâ¢eIF4Eâ¢eIF4Gâ¢PABPâ¢poly(A) complex. Most mRNA sequences have an intrinsic propensity to fold into extensive intramolecular secondary structures that result in short end-to-end distances. The inherent compactness of mRNAs might stabilize the capâ¢eIF4Eâ¢eIF4Gâ¢PABPâ¢poly(A) complex and enhance cap-poly(A) translational synergy. Here, we test this hypothesis by introducing intrinsically unstructured sequences into the 5' or 3' UTRs of model mRNAs. We found that the introduction of unstructured sequences into the 3' UTR, but not the 5' UTR, decreases mRNA translation in cell-free wheat germ and yeast extracts without affecting mRNA stability. The observed reduction in protein synthesis results from the diminished ability of the poly(A) tail to stimulate translation. These results suggest that base pair formation by the 3' UTR enhances the cap-poly(A) synergy in translation initiation.

Subject(s)

3' Untranslated Regions , Poly A , Protein Biosynthesis , 5' Untranslated Regions , Eukaryotic Initiation Factor-4G/chemistry , Poly A/chemistry , Poly(A)-Binding Proteins/chemistry , RNA Caps/chemistry , Cell-Free System , Triticum , Saccharomyces cerevisiae , Nucleic Acid Conformation , RNA Stability

20.

Generation and functional analysis of defective viral genomes during SARS-CoV-2 infection.

Zhou, Terry; Gilliam, Nora J; Li, Sizhen; Spaudau, Simone; Osborn, Raven M; Anderson, Christopher S; Mariani, Thomas J; Thakar, Juilee; Dewhurst, Stephen; Mathews, David H; Huang, Liang; Sun, Yan.

bioRxiv ; 2022 Sep 23.

Article in English | MEDLINE | ID: mdl-36172120

ABSTRACT

Defective viral genomes (DVGs) have been identified in many RNA viruses as a major factor influencing antiviral immune response and viral pathogenesis. However, the generation and function of DVGs in SARS-CoV-2 infection are less known. In this study, we elucidated DVG generation in SARS-CoV-2 and its relationship with host antiviral immune response. We observed DVGs ubiquitously from RNA-seq datasets of in vitro infections and autopsy lung tissues of COVID-19 patients. Four genomic hotspots were identified for DVG recombination and RNA secondary structures were suggested to mediate DVG formation. Functionally, bulk and single cell RNA-seq analysis indicated the IFN stimulation of SARS-CoV-2 DVGs. We further applied our criteria to the NGS dataset from a published cohort study and observed significantly higher DVG amount and frequency in symptomatic patients than that in asymptomatic patients. Finally, we observed unusually high DVG frequency in one immunosuppressive patient up to 140 days after admitted to hospital due to COVID-19, first-time suggesting an association between DVGs and persistent viral infections in SARS-CoV-2. Together, our findings strongly suggest a critical role of DVGs in modulating host IFN responses and symptom development, calling for further inquiry into the mechanisms of DVG generation and how DVGs modulate host responses and infection outcome during SARS-CoV-2 infection. Importance: Defective viral genomes (DVGs) are ubiquitously generated in many RNA viruses, including SARS-CoV-2. Their interference activity to full-length viruses and IFN stimulation provide them the potential for novel antiviral therapies and vaccine development. SARS-CoV-2 DVGs are generated through the recombination of two discontinuous genomic fragments by viral polymerase complex and the recombination is also one of the major mechanisms for the emergence of new coronaviruses. Focusing on the generation and function of SARS-CoV-2 DVGs, these studies identify new hotspots for non-homologous recombination and strongly suggest that the secondary structures within viral genomes mediate the recombination. Furthermore, these studies provide the first evidence for IFN stimulation activity of de novo DVGs during natural SARS-CoV-2 infection. These findings set up the foundation for further mechanism studies of SARS-CoV-2 recombination and provide the evidence to harness DVGsâ™ immunostimulatory potential in the development of vaccine and antivirals for SARS-CoV-2.

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL