Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Bull Math Biol ; 82(10): 133, 2020 10 07.
Artigo em Inglês | MEDLINE | ID: mdl-33029669

RESUMO

A growing number of RNA sequences are now known to exist in some distribution with two or more different stable structures. Recent algorithms attempt to reconstruct such mixtures using the list of nucleotides in a sequence in conjunction with auxiliary experimental footprinting data. In this paper, we demonstrate some challenges which remain in addressing this problem; in particular we consider the difficulty of reconstructing a mixture of two RNA structures across a spectrum of different relative abundances. Although progress has been made in identifying the stable structures present, it remains nontrivial to predict the relative abundance of each within the experimentally sampled mixture. Because the ratio of structures present can change depending on experimental conditions, it is the footprinting data-and not the sequence-which must encode information on changes in the relative abundance. Here, we use simulated experimental data to demonstrate that there exist RNA sequences and relative abundance combinations which cannot be recovered by current methods. We then prove that this is not a single exception, but rather part of the rule. In particular, we show, using a Nussinov-Jacobson model, that recovering the relative abundances is difficult for a large proportion of RNA structure pairs. Lastly, we use information theory to establish a framework for quantifying how useful auxiliary data is in predicting the relative abundance of a structure. Together, these results demonstrate that aspects of the problem of reconstructing a mixture of RNA structures from experimental data remain open.


Assuntos
Modelos Biológicos , RNA , Algoritmos , Sequência de Bases , Conceitos Matemáticos , Conformação de Ácido Nucleico , Nucleotídeos , RNA/química , RNA/genética
2.
Nucleic Acids Res ; 42(22): e171, 2014 Dec 16.
Artigo em Inglês | MEDLINE | ID: mdl-25392423

RESUMO

As the biomedical impact of small RNAs grows, so does the need to understand competing structural alternatives for regions of functional interest. Suboptimal structure analysis provides significantly more RNA base pairing information than a single minimum free energy prediction. Yet computational enhancements like Boltzmann sampling have not been fully adopted by experimentalists since identifying meaningful patterns in this data can be challenging. Profiling is a novel approach to mining RNA suboptimal structure data which makes the power of ensemble-based analysis accessible in a stable and reliable way. Balancing abstraction and specificity, profiling identifies significant combinations of base pairs which dominate low-energy RNA secondary structures. By design, critical similarities and differences are highlighted, yielding crucial information for molecular biologists. The code is freely available via http://gtfold.sourceforge.net/profiling.html.


Assuntos
Pequeno RNA não Traduzido/química , Análise de Sequência de RNA/métodos , Pareamento de Bases , Interpretação Estatística de Dados , Modelos Moleculares , Conformação de Ácido Nucleico , RNA Bacteriano/química , Vibrio cholerae/genética
3.
Nucleic Acids Res ; 41(5): 2807-16, 2013 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-23325843

RESUMO

Recent advances in RNA structure determination include using data from high-throughput probing experiments to improve thermodynamic prediction accuracy. We evaluate the extent and nature of improvements in data-directed predictions for a diverse set of 16S/18S ribosomal sequences using a stochastic model of experimental SHAPE data. The average accuracy for 1000 data-directed predictions always improves over the original minimum free energy (MFE) structure. However, the amount of improvement varies with the sequence, exhibiting a correlation with MFE accuracy. Further analysis of this correlation shows that accurate MFE base pairs are typically preserved in a data-directed prediction, whereas inaccurate ones are not. Thus, the positive predictive value of common base pairs is consistently higher than the directed prediction accuracy. Finally, we confirm sequence dependencies in the directability of thermodynamic predictions and investigate the potential for greater accuracy improvements in the worst performing test sequence.


Assuntos
Simulação por Computador , Modelos Moleculares , RNA Ribossômico 16S/química , RNA Ribossômico 18S/química , Software , Algoritmos , Animais , Funções Verossimilhança , Conformação de Ácido Nucleico , RNA Arqueal/química , RNA Bacteriano/química , Processos Estocásticos , Termodinâmica
4.
J Math Biol ; 69(6-7): 1743-72, 2014 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-24384698

RESUMO

We analyze the distribution of RNA secondary structures given by the Knudsen-Hein stochastic context-free grammar used in the prediction program Pfold. Our main theorem gives relations between the expected number of these motifs--independent of the grammar probabilities. These relations are a consequence of proving that the distribution of base pairs, of helices, and of different types of loops is asymptotically Gaussian in this model of RNA folding. Proof techniques use singularity analysis of probability generating functions. We also demonstrate that these asymptotic results capture well the expected number of RNA base pairs in native ribosomal structures, and certain other aspects of their predicted secondary structures. In particular, we find that the predicted structures largely satisfy the expected relations, although the native structures do not.


Assuntos
Modelos Químicos , Conformação de Ácido Nucleico , Dobramento de RNA , RNA/química , Algoritmos , Pareamento de Bases , Distribuição Normal , Processos Estocásticos , Termodinâmica
5.
Methods Mol Biol ; 2726: 105-124, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38780729

RESUMO

The structure of an RNA sequence encodes information about its biological function. Dynamic programming algorithms are often used to predict the conformation of an RNA molecule from its sequence alone, and adding experimental data as auxiliary information improves prediction accuracy. This auxiliary data is typically incorporated into the nearest neighbor thermodynamic model22 by converting the data into pseudoenergies. Here, we look at how much of the space of possible structures auxiliary data allows prediction methods to explore. We find that for a large class of RNA sequences, auxiliary data shifts the predictions significantly. Additionally, we find that predictions are highly sensitive to the parameters which define the auxiliary data pseudoenergies. In fact, the parameter space can typically be partitioned into regions where different structural predictions predominate.


Assuntos
Algoritmos , Biologia Computacional , Conformação de Ácido Nucleico , RNA , Termodinâmica , RNA/química , RNA/genética , Biologia Computacional/métodos , Software
6.
J Struct Biol ; 180(1): 110-6, 2012 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-22750417

RESUMO

Satellite tobacco mosaic virus (STMV) is an icosahedral T=1 single-stranded RNA virus with a genome containing 1058 nucleotides. X-ray crystallography revealed a structure containing 30 double-helical RNA segments, with each helix having nine base pairs and an unpaired nucleotide at the 3' end of each strand. Based on this structure, Larson and McPherson proposed a model of 30 hairpin-loop elements occupying the edges of the icosahedron and connected by single-stranded regions. More recently, Schroeder et al. have combined the results of chemical probing with a novel helix searching algorithm to propose a specific secondary structure for the STMV genome, compatible with the Larson-McPherson model. Here we report an all-atom model of STMV, using the complete protein and RNA sequences and the Schroeder RNA secondary structure. As far as we know, this is the first all-atom model for the complete structure of any virus (100% of the atoms) using the natural genomic sequence.


Assuntos
Capsídeo/ultraestrutura , Modelos Moleculares , RNA Viral/ultraestrutura , Vírus Satélite do Mosaico do Tabaco/ultraestrutura , Capsídeo/química , Cristalografia por Raios X , Sequências Repetidas Invertidas , Conformação de Ácido Nucleico , Estrutura Quaternária de Proteína , RNA Viral/química
7.
Bull Math Biol ; 73(4): 754-76, 2011 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-21207176

RESUMO

Motivated by recent work in parametric sequence alignment, we study the parameter space for scoring RNA folds and construct an RNA polytope. A vertex of this polytope corresponds to RNA secondary structures with common branching. We use this polytope and its normal fan to study the effect of varying three parameters in the free energy model that are not determined experimentally. Our results indicate that variation of these specific parameters does not have a dramatic effect on the structures predicted by the free energy model. We additionally map a collection of known RNA secondary structures to the RNA polytope.


Assuntos
Modelos Moleculares , Conformação de Ácido Nucleico , RNA/química , Termodinâmica , Algoritmos , Sequência de Bases , Bases de Dados de Ácidos Nucleicos
8.
Nucleic Acids Res ; 37(4): e29, 2009 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-19158187

RESUMO

The identification of small structural motifs and their organization into larger subassemblies is of fundamental interest in the analysis, prediction and design of 3D structures of large RNAs. This problem has been studied only sparsely, as most of the existing work is limited to the characterization and discovery of motifs in RNA secondary structures. We present a novel geometric method for the characterization and identification of structural motifs in 3D rRNA molecules. This method enables the efficient recognition of known 3D motifs, such as tetraloops, E-loops, kink-turns and others. Furthermore, it provides a new way of characterizing complex 3D motifs, notably junctions, that have been defined and identified in the secondary structure but have not been analyzed and classified in three dimensions. We demonstrate the relevance and utility of our approach by applying it to the Haloarcula marismortui large ribosomal unit. Pending the implementation of a dedicated web server, the code accompanying this article, written in JAVA, is available upon request from the contact author.


Assuntos
RNA Ribossômico/química , Biologia Computacional/métodos , Haloarcula marismortui/genética , Modelos Moleculares , Conformação de Ácido Nucleico , RNA Ribossômico/classificação , Análise de Sequência de RNA
9.
Bull Math Biol ; 71(1): 84-106, 2009 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19083065

RESUMO

We give a Large Deviation Principle (LDP) with explicit rate function for the distribution of vertex degrees in plane trees, a combinatorial model of RNA secondary structures. We calculate the typical degree distributions based on nearest neighbor free energies, and compare our results with the branching configurations found in two sets of large RNA secondary structures. We find substantial agreement overall, with some interesting deviations which merit further study.


Assuntos
Modelos Moleculares , Conformação de Ácido Nucleico , RNA Ribossômico 23S/ultraestrutura , RNA Viral/ultraestrutura , Interpretação Estatística de Dados , Árvores de Decisões , Redes Neurais de Computação , Picornaviridae/genética , Probabilidade , Termodinâmica
10.
J Biol Phys ; 39(2): 163-72, 2013 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-23860866

RESUMO

There are two important problems in the assembly of small, icosahedral RNA viruses. First, how does the capsid protein select the viral RNA for packaging, when there are so many other candidate RNA molecules available? Second, what is the mechanism of assembly? With regard to the first question, there are a number of cases where a particular RNA sequence or structure--often one or more stem-loops--either promotes assembly or is required for assembly, but there are others where specific packaging signals are apparently not required. With regard to the assembly pathway, in those cases where stem-loops are involved, the first step is generally believed to be binding of the capsid proteins to these "fingers" of the RNA secondary structure. In the mature virus, the core of the RNA would then occupy the center of the viral particle, and the stem-loops would reach outward, towards the capsid, like stalagmites reaching up from the floor of a grotto towards the ceiling. Those viruses whose assembly does not depend on protein binding to stem-loops could have a different structure, with the core of the RNA lying just under the capsid, and the fingers reaching down into the interior of the virus, like stalactites. We review the literature on these alternative structures, focusing on RNA selectivity and the assembly mechanism, and we propose experiments aimed at determining, in a given virus, which of the two structures actually occurs.


Assuntos
Genoma Viral , Vírus de RNA/genética , Levivirus/química , Levivirus/genética , Modelos Moleculares , Vírus de RNA/química , Vírus Satélite do Mosaico do Tabaco/química , Vírus Satélite do Mosaico do Tabaco/genética
11.
BMC Res Notes ; 5: 341, 2012 Jul 02.
Artigo em Inglês | MEDLINE | ID: mdl-22747589

RESUMO

BACKGROUND: Accurate and efficient RNA secondary structure prediction remains an important open problem in computational molecular biology. Historically, advances in computing technology have enabled faster and more accurate RNA secondary structure predictions. Previous parallelized prediction programs achieved significant improvements in runtime, but their implementations were not portable from niche high-performance computers or easily accessible to most RNA researchers. With the increasing prevalence of multi-core desktop machines, a new parallel prediction program is needed to take full advantage of today's computing technology. FINDINGS: We present here the first implementation of RNA secondary structure prediction by thermodynamic optimization for modern multi-core computers. We show that GTfold predicts secondary structure in less time than UNAfold and RNAfold, without sacrificing accuracy, on machines with four or more cores. CONCLUSIONS: GTfold supports advances in RNA structural biology by reducing the timescales for secondary structure prediction. The difference will be particularly valuable to researchers working with lengthy RNA sequences, such as RNA viral genomes.


Assuntos
Algoritmos , Biologia Computacional/métodos , RNA/química , Software , Biologia Computacional/instrumentação , Conformação de Ácido Nucleico , Análise de Sequência de RNA , Termodinâmica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA