Búsqueda | Portal Regional de la BVS

1.

Barrett, Christopher; Bura, Andrei C; He, Qijun; Huang, Fenix W; Li, Thomas J X; Reidys, Christian M.

RNA ; 30(1): 1-15, 2023 Dec 18.

Artículo en Inglés | MEDLINE | ID: mdl-37903545

RESUMEN

We present a novel framework enhancing the prediction of whether novel lineage poses the threat of eventually dominating the viral population. The framework is based purely on genomic sequence data, without requiring prior established biological analysis. Its building blocks are sets of coevolving sites in the alignment (motifs), identified via coevolutionary signals. The collection of such motifs forms a relational structure over the polymorphic sites. Motifs are constructed using distances quantifying the coevolutionary coupling of pairs and manifest as coevolving clusters of sites. We present an approach to genomic surveillance based on this notion of relational structure. Our system will issue an alert regarding a lineage, based on its contribution to drastic changes in the relational structure. We then conduct a comprehensive retrospective analysis of the COVID-19 pandemic based on SARS-CoV-2 genomic sequence data in GISAID from October 2020 to September 2022, across 21 lineages and 27 countries with weekly resolution. We investigate the performance of this surveillance system in terms of its accuracy, timeliness, and robustness. Lastly, we study how well each lineage is classified by such a system.

Asunto(s)

COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/genética , COVID-19/genética , Pandemias , Estudios Retrospectivos , Genómica

2.

The energy-spectrum of bicompatible sequences.

Huang, Fenix W; Barrett, Christopher L; Reidys, Christian M.

Algorithms Mol Biol ; 16(1): 7, 2021 Jun 01.

Artículo en Inglés | MEDLINE | ID: mdl-34074304

RESUMEN

BACKGROUND: Genotype-phenotype maps provide a meaningful filtration of sequence space and RNA secondary structures are particular such phenotypes. Compatible sequences, which satisfy the base-pairing constraints of a given RNA structure, play an important role in the context of neutral evolution. Sequences that are simultaneously compatible with two given structures (bicompatible sequences), are beacons in phenotypic transitions, induced by erroneously replicating populations of RNA sequences. RNA riboswitches, which are capable of expressing two distinct secondary structures without changing the underlying sequence, are one example of bicompatible sequences in living organisms. RESULTS: We present a full loop energy model Boltzmann sampler of bicompatible sequences for pairs of structures. The sequence sampler employs a dynamic programming routine whose time complexity is polynomial when assuming the maximum number of exposed vertices, [Formula: see text], is a constant. The parameter [Formula: see text] depends on the two structures and can be very large. We introduce a novel topological framework encapsulating the relations between loops that sheds light on the understanding of [Formula: see text]. Based on this framework, we give an algorithm to sample sequences with minimum [Formula: see text] on a particular topologically classified case as well as giving hints to the solution in the other cases. As a result, we utilize our sequence sampler to study some established riboswitches. CONCLUSION: Our analysis of riboswitch sequences shows that a pair of structures needs to satisfy key properties in order to facilitate phenotypic transitions and that pairs of random structures are unlikely to do so. Our analysis observes a distinct signature of riboswitch sequences, suggesting a new criterion for identifying native sequences and sequences subjected to evolutionary pressure. Our free software is available at: https://github.com/FenixHuang667/Bifold .

3.

Snord116 Post-transcriptionally Increases Nhlh2 mRNA Stability: Implications for Human Prader-Willi Syndrome.

Kocher, Matthew A; Huang, Fenix W; Le, Erin; Good, Deborah J.

Hum Mol Genet ; 30(12): 1101-1110, 2021 06 09.

Artículo en Inglés | MEDLINE | ID: mdl-33856031

RESUMEN

The smallest genomic region causing Prader-Willi Syndrome (PWS) deletes the non-coding RNA SNORD116 cluster; however, the function of SNORD116 remains a mystery. Previous work in the field revealed the tantalizing possibility that expression of NHLH2, a gene previously implicated in both obesity and hypogonadism, was downregulated in PWS patients and differentiated stem cells. In silico RNA: RNA modeling identified several potential interaction domains between SNORD116 and NHLH2 mRNA. One of these interaction domains was highly conserved in most vertebrate NHLH2 mRNAs examined. A construct containing the Nhlh2 mRNA, including its 3'-UTR, linked to a c-myc tag was transfected into a hypothalamic neuron cell line in the presence and absence of exogenously-expressed Snord116. Nhlh2 mRNA expression was upregulated in the presence of Snord116 dependent on the length and type of 3'UTR used on the construct. Furthermore, use of actinomycin D to stop new transcription in N29/2 cells demonstrated that the upregulation occurred through increased stability of the Nhlh2 mRNA in the 45 minutes immediately following transcription. In silico modeling also revealed that a single nucleotide variant (SNV) in the NHLH2 mRNA could reduce the predicted interaction strength of the NHLH2:SNORD116 diad. Indeed, use of an Nhlh2 mRNA construct containing this SNV significantly reduces the ability of Snord116 to increase Nhlh2 mRNA levels. For the first time, these data identify a motif and mechanism for SNORD116-mediated regulation of NHLH2, clarifying the mechanism by which deletion of the SNORD116 snoRNAs locus leads to PWS phenotypes.

Asunto(s)

Factores de Transcripción con Motivo Hélice-Asa-Hélice Básico/genética , Síndrome de Prader-Willi/genética , Proteínas Proto-Oncogénicas c-myc/genética , ARN Nucleolar Pequeño/genética , Animales , Regulación del Desarrollo de la Expresión Génica , Humanos , Hipotálamo/metabolismo , Hipotálamo/patología , Ratones , Neuronas/metabolismo , Neuronas/patología , Síndrome de Prader-Willi/metabolismo , Síndrome de Prader-Willi/patología , Procesamiento Postranscripcional del ARN/genética , Estabilidad del ARN/genética

4.

Multiscale Feedback Loops in SARS-CoV-2 Viral Evolution.

Barrett, Christopher; Bura, Andrei C; He, Qijun; Huang, Fenix W; Li, Thomas J X; Waterman, Michael S; Reidys, Christian M.

J Comput Biol ; 28(3): 248-256, 2021 03.

Artículo en Inglés | MEDLINE | ID: mdl-33275493

RESUMEN

COVID-19 is an infectious disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The viral genome is considered to be relatively stable and the mutations that have been observed and reported thus far are mainly focused on the coding region. This article provides evidence that macrolevel pandemic dynamics, such as social distancing, modulate the genomic evolution of SARS-CoV-2. This view complements the prevalent paradigm that microlevel observables control macrolevel parameters such as death rates and infection patterns. First, we observe differences in mutational signals for geospatially separated populations such as the prevalence of A23404G in CA versus NY and WA. We show that the feedback between macrolevel dynamics and the viral population can be captured employing a transfer entropy framework. Second, we observe complex interactions within mutational clades. Namely, when C14408T first appeared in the viral population, the frequency of A23404G spiked in the subsequent week. Third, we identify a noncoding mutation, G29540A, within the segment between the coding gene of the N protein and the ORF10 gene, which is largely confined to NY (>95%). These observations indicate that macrolevel sociobehavioral measures have an impact on the viral genomics and may be useful for the dashboard-like tracking of its evolution. Finally, despite the fact that SARS-CoV-2 is a genetically robust organism, our findings suggest that we are dealing with a high degree of adaptability. Owing to its ample spread, mutations of unusual form are observed and a high complexity of mutational interaction is exhibited.

Asunto(s)

COVID-19/virología , Evolución Molecular , Genoma Viral , SARS-CoV-2/genética , COVID-19/epidemiología , COVID-19/transmisión , Biología Computacional , Frecuencia de los Genes , Conductas Relacionadas con la Salud , Política de Salud , Humanos , Modelos Genéticos , Mutación , Pandemias , Filogenia , Distanciamiento Físico , SARS-CoV-2/patogenicidad , SARS-CoV-2/fisiología , Glicoproteína de la Espiga del Coronavirus/genética

5.

Genetic robustness of let-7 miRNA sequence-structure pairs.

He, Qijun; Huang, Fenix W; Barrett, Christopher; Reidys, Christian M.

RNA ; 25(12): 1592-1603, 2019 12.

Artículo en Inglés | MEDLINE | ID: mdl-31548338

RESUMEN

Genetic robustness, the preservation of evolved phenotypes against genotypic mutations, is one of the central concepts in evolution. In recent years a large body of work has focused on the origins, mechanisms, and consequences of robustness in a wide range of biological systems. In particular, research on ncRNAs studied the ability of sequences to maintain folded structures against single-point mutations. In these studies, the structure is merely a reference. However, recent work revealed evidence that structure itself contributes to the genetic robustness of ncRNAs. We follow this line of thought and consider sequence-structure pairs as the unit of evolution and introduce the spectrum of extended mutational robustness (EMR spectrum) as a measurement of genetic robustness. Our analysis of the miRNA let-7 family captures key features of structure-modulated evolution and facilitates the study of robustness against multiple-point mutations.

Asunto(s)

MicroARNs/genética , Mutación/genética , Animales , Evolución Molecular , Genotipo , Humanos , Modelos Genéticos , Conformación de Ácido Nucleico , Fenotipo

6.

A Boltzmann Sampler for 1-Pairs with Double Filtration.

Barrett, Christopher; He, Qijun; Huang, Fenix W; Reidys, Christian M.

J Comput Biol ; 26(3): 173-192, 2019 03.

Artículo en Inglés | MEDLINE | ID: mdl-30653353

RESUMEN

Recently, a framework considering RNA sequences and their RNA secondary structures as pairs led to some information-theoretic perspectives on how the semantics encoded in RNA sequences can be inferred. This pairing arises naturally from the energy model of RNA secondary structures. Fixing the sequence in the pairing produces the RNA energy landscape, whose partition function was discovered by McCaskill. Dually, fixing the structure induces the energy landscape of sequences. The latter has been considered originally for designing more efficient inverse folding algorithms and subsequently enhanced by facilitating the sampling of sequences. We present here a partition function of sequence/structure pairs, with endowed Hamming distance and base pair distance filtration. This partition function is an augmentation of the previous mentioned (dual) partition function. We develop an efficient dynamic programming routine to recursively compute the partition function with this double filtration. Our framework is capable of dealing with RNA secondary structures as well as 1-structures, where a 1-structure is an RNA pseudoknot structure consisting of "building blocks" of genus 0 or 1. In particular, 0-structures, consisting of only "building blocks" of genus 0, are exactly RNA secondary structures. The time complexity for calculating the partition function of 1-pairs, that is, sequence/structure pairs where the structures are 1-structures, is O(h3b3n6), where h, b, n denote the Hamming distance, base pair distance, and sequence length, respectively. The time complexity for the partition function of 0-pairs is O(h2b2n3).

Asunto(s)

Algoritmos , Pliegue del ARN , ARN/química , Análisis de Secuencia de ARN/métodos , Simulación de Dinámica Molecular , Motivos de Nucleótidos

7.

An Efficient Dual Sampling Algorithm with Hamming Distance Filtration.

Barrett, Christopher; He, Qijun; Huang, Fenix W; Reidys, Christian M.

J Comput Biol ; 25(11): 1179-1192, 2018 11.

Artículo en Inglés | MEDLINE | ID: mdl-30133328

RESUMEN

Recently, a framework considering ribonucleic acid (RNA) sequences and their RNA secondary structures as pairs has led to new information theoretic perspectives on how the semantics encoded in RNA sequences can be inferred. In this context, the pairing arises naturally from the energy model of RNA secondary structures. Fixing the sequence in the pairing produces the RNA energy landscape, whose partition function was discovered by McCaskill. Dually, fixing the structure induces the energy landscape of sequences. The latter has been considered for designing more efficient inverse folding algorithms. In this work, we present the dual partition function filtered by Hamming distance, together with a Boltzmann sampler using novel dynamic programming routines for the loop-based energy model. The time complexity of the algorithm is [Formula: see text], where [Formula: see text] are Hamming distance and sequence length, respectively, reducing the time complexity of samplers, reported in the literature by [Formula: see text]. We then present two applications, the first in the context of the evolution of natural sequence-structure pairs of microRNAs and the second in constructing neutral paths. The former studies the inverse folding rate (IFR) of sequence-structure pairs, filtered by Hamming distance, observing that such pairs evolve toward higher levels of robustness, that is, increasing IFR. The latter is an algorithm that constructs neutral paths: given two sequences in a neutral network, we employ the sampler to construct short paths connecting them, consisting of sequences all contained in the neutral network.

Asunto(s)

Algoritmos , Biología Computacional/métodos , ARN/química , Secuencia de Bases , Humanos , Modelos Moleculares , Conformación de Ácido Nucleico

8.

Sequence-structure relations of biopolymers.

Barrett, Christopher; Huang, Fenix W; Reidys, Christian M.

Bioinformatics ; 33(3): 382-389, 2017 02 01.

Artículo en Inglés | MEDLINE | ID: mdl-28171628

RESUMEN

Motivation: DNA data is transcribed into single-stranded RNA, which folds into specific molecular structures. In this paper we pose the question to what extent sequence- and structure-information correlate. We view this correlation as structural semantics of sequence data that allows for a different interpretation than conventional sequence alignment. Structural semantics could enable us to identify more general embedded 'patterns' in DNA and RNA sequences. Results: We compute the partition function of sequences with respect to a fixed structure and connect this computation to the mutual information of a sequencestructure pair for RNA secondary structures. We present a Boltzmann sampler and obtain the a priori probability of specific sequence patterns. We present a detailed analysis for the three PDB-structures, 2JXV (hairpin), 2N3R (3-branch multi-loop) and 1EHZ (tRNA). We localize specific sequence patterns, contrast the energy spectrum of the Boltzmann sampled sequences versus those sequences that refold into the same structure and derive a criterion to identify native structures. We illustrate that there are multiple sequences in the partition function of a fixed structure, each having nearly the same mutual information, that are nevertheless poorly aligned. This indicates the possibility of the existence of relevant patterns embedded in the sequences that are not discoverable using alignments. Availability and Implementation: The source code is freely available at http://staff.vbi.vt.edu/fenixh/Sampler.zip Contact: duckcr@vbi.vt.edu Supplimentary Information: Supplementary data are available at Bioinformatics online.

Asunto(s)

Biología Computacional/métodos , Conformación de Ácido Nucleico , ARN/química , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Algoritmos , Probabilidad , ARN/metabolismo

9.

Topological language for RNA.

Huang, Fenix W D; Reidys, Christian M.

Math Biosci ; 282: 109-120, 2016 12.

Artículo en Inglés | MEDLINE | ID: mdl-27773681

RESUMEN

In this paper we introduce a novel, context-free grammar, RNAFeatures*, capable of generating any RNA structure including pseudoknot structures (pk-structure). We represent pk-structures as orientable fatgraphs, which naturally leads to a filtration by their topological genus. Within this framework, RNA secondary structures correspond to pk-structures of genus zero. RNAFeatures* acts on formal, arc-labeled RNA secondary structures, called λ-structures. λ-structures correspond one-to-one to pk-structures together with some additional information. This information consists of the specific rearrangement of the backbone, by which a pk-structure can be made cross-free. RNAFeatures* is an extension of the grammar for secondary structures and employs an enhancement by labelings of the symbols as well as the production rules. We discuss how to use RNAFeatures* to obtain a stochastic context-free grammar for pk-structures, using data of RNA sequences and structures. The induced grammar facilitates fast Boltzmann sampling and statistical analysis. As a first application, we present an O(nlog (n)) runtime algorithm which samples pk-structures based on ninety tRNA sequences and structures from the Nucleic Acid Database (NDB). AVAILABILITY: the source code for simulation results is available at http://staff.vbi.vt.edu/fenixh/TPstructure.zip. The code is written in C and compiled by Xcode.

Asunto(s)

Modelos Teóricos , ARN/química

10.

Shapes of topological RNA structures.

Huang, Fenix W D; Reidys, Christian M.

Math Biosci ; 270(Pt A): 57-65, 2015 Dec.

Artículo en Inglés | MEDLINE | ID: mdl-26482318

RESUMEN

A topological RNA structure is derived by fattening the edges of a contact structure into ribbons. The shape of a topological RNA structure is obtained by collapsing the stacks of the structure into single arcs and by removing any arcs of length one, as well as isolated vertices. A shape contains the key topological information of the molecular conformation and for fixed topological genus there exist only finitely many such shapes. In this paper we compute the generating polynomial of shapes of fixed topological genus g. We furthermore derive an algorithm having O(glog g) time complexity uniformly generating shapes of genus g and discuss some applications in the context of databases of RNA pseudoknot structures.

Asunto(s)

Conformación de Ácido Nucleico , ARN/química , Algoritmos , Bases de Datos de Ácidos Nucleicos , Conceptos Matemáticos , Modelos Moleculares

11.

Generation of RNA pseudoknot structures with topological genus filtration.

Huang, Fenix W D; Nebel, Markus E; Reidys, Christian M.

Math Biosci ; 245(2): 216-25, 2013 Oct.

Artículo en Inglés | MEDLINE | ID: mdl-23900061

RESUMEN

In this paper we present a sampling framework for RNA structures of fixed topological genus. We introduce a novel, linear time, uniform sampling algorithm for RNA structures of fixed topological genus g, for arbitrary g>0. Furthermore we develop a linear time sampling algorithm for RNA structures of fixed topological genus g that are weighted by a simplified, loop-based energy functional. For this process the partition function of the energy functional has to be computed once, which has O(n(2)) time complexity.

Asunto(s)

Conformación de Ácido Nucleico , ARN/química , Algoritmos , Biología Computacional , Conceptos Matemáticos , Modelos Moleculares

12.

Topology of RNA-RNA interaction structures.

Andersen, Jørgen E; Huang, Fenix W D; Penner, Robert C; Reidys, Christian M.

J Comput Biol ; 19(7): 928-43, 2012 Jul.

Artículo en Inglés | MEDLINE | ID: mdl-22731621

RESUMEN

The topological filtration of interacting RNA complexes is studied, and the role is analyzed of certain diagrams called irreducible shadows, which form suitable building blocks for more general structures. We prove that, for two interacting RNAs, called interaction structures, there exist for fixed genus only finitely many irreducible shadows. This implies that, for fixed genus, there are only finitely many classes of interaction structures. In particular, the simplest case of genus zero already provides the formalism for certain types of structures that occur in nature and are not covered by other filtrations. This case of genus zero interaction structures is already of practical interest, is studied here in detail, and is found to be expressed by a multiple context-free grammar that extends the usual one for RNA secondary structures. We show that, in O(n(6)) time and O(n(4)) space complexity, this grammar for genus zero interaction structures provides not only minimum free energy solutions but also the complete partition function and base pairing probabilities.

Asunto(s)

Algoritmos , Conformación de Ácido Nucleico , ARN/química , Modelos Teóricos , Termodinámica

13.

Addendum: topology and prediction of RNA pseudoknots.

Reidys, Christian M; Huang, Fenix W D; Andersen, Jørgen E; Penner, Robert C; Stadler, Peter F; Nebel, Markus E.

Bioinformatics ; 28(2): 300, 2012 Jan 15.

Artículo en Inglés | MEDLINE | ID: mdl-22106334

Asunto(s)

Pliegue del ARN , ARN/química , Programas Informáticos , Algoritmos , Conformación de Ácido Nucleico , Análisis de Secuencia de ARN

14.

Topology and prediction of RNA pseudoknots.

Reidys, Christian M; Huang, Fenix W D; Andersen, Jørgen E; Penner, Robert C; Stadler, Peter F; Nebel, Markus E.

Bioinformatics ; 27(8): 1076-85, 2011 Apr 15.

Artículo en Inglés | MEDLINE | ID: mdl-21335320

RESUMEN

MOTIVATION: Several dynamic programming algorithms for predicting RNA structures with pseudoknots have been proposed that differ dramatically from one another in the classes of structures considered. RESULTS: Here, we use the natural topological classification of RNA structures in terms of irreducible components that are embeddable in the surfaces of fixed genus. We add to the conventional secondary structures four building blocks of genus one in order to construct certain structures of arbitrarily high genus. A corresponding unambiguous multiple context-free grammar provides an efficient dynamic programming approach for energy minimization, partition function and stochastic sampling. It admits a topology-dependent parametrization of pseudoknot penalties that increases the sensitivity and positive predictive value of predicted base pairs by 10-20% compared with earlier approaches. More general models based on building blocks of higher genus are also discussed. AVAILABILITY: The source code of gfold is freely available at http://www.combinatorics.cn/cbpc/gfold.tar.gz. CONTACT: duck@santafe.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

ARN/química , Algoritmos , Emparejamiento Base , Conformación de Ácido Nucleico , ARN/clasificación , Análisis de Secuencia de ARN , Programas Informáticos

15.

Target prediction and a statistical sampling algorithm for RNA-RNA interaction.

Huang, Fenix W D; Qin, Jing; Reidys, Christian M; Stadler, Peter F.

Bioinformatics ; 26(2): 175-81, 2010 Jan 15.

Artículo en Inglés | MEDLINE | ID: mdl-19910305

RESUMEN

MOTIVATION: It has been proven that the accessibility of the target sites has a critical influence on RNA-RNA binding, in general and the specificity and efficiency of miRNAs and siRNAs, in particular. Recently, O(N(6)) time and O(N(4)) space dynamic programming (DP) algorithms have become available that compute the partition function of RNA-RNA interaction complexes, thereby providing detailed insights into their thermodynamic properties. RESULTS: Modifications to the grammars underlying earlier approaches enables the calculation of interaction probabilities for any given interval on the target RNA. The computation of the 'hybrid probabilities' is complemented by a stochastic sampling algorithm that produces a Boltzmann weighted ensemble of RNA-RNA interaction structures. The sampling of k structures requires only negligible additional memory resources and runs in O(k.N(3)). AVAILABILITY: The algorithms described here are implemented in C as part of the rip package. The source code of rip2 can be downloaded from http://www.combinatorics.cn/cbpc/rip.html and http://www.bioinf.uni-leipzig.de/Software/rip.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

Algoritmos , Modelos Estadísticos , ARN/química , Sitios de Unión , Biología Computacional/métodos , Bases de Datos Genéticas , MicroARNs/química , MicroARNs/metabolismo , Modelos Moleculares , Conformación de Ácido Nucleico , ARN/metabolismo , ARN Interferente Pequeño/química , ARN Interferente Pequeño/metabolismo , Programas Informáticos

16.

Folding 3-noncrossing RNA pseudoknot structures.

Huang, Fenix W D; Peng, Wade W J; Reidys, Christian M.

J Comput Biol ; 16(11): 1549-75, 2009 Nov.

Artículo en Inglés | MEDLINE | ID: mdl-19958083

RESUMEN

In this article, we present the novel ab initio folding algorithm cross, which generates minimum free energy (mfe), 3-noncrossing, canonical RNA structures. Here an RNA structure is 3-noncrossing if it does not contain three or more mutually crossing arcs and canonical, if each of its stacks has size greater or equal than two. Our notion of mfe-structure is based on a specific concept of pseudoknots and respective loop-based energy parameters. The algorithm decomposes into three subroutines: first the inductive construction of motifs and their associated shadows, second the generation of the (rooted) skeleta-trees and third the saturation of the skeleta via context dependent dynamic programming routines.

Asunto(s)

Conformación de Ácido Nucleico , ARN/química , Algoritmos , ARN de Transferencia/química , Secuencias Reguladoras de Ácido Ribonucleico/genética

17.

Partition function and base pairing probabilities for RNA-RNA interaction prediction.

Huang, Fenix W D; Qin, Jing; Reidys, Christian M; Stadler, Peter F.

Bioinformatics ; 25(20): 2646-54, 2009 Oct 15.

Artículo en Inglés | MEDLINE | ID: mdl-19671692

RESUMEN

MOTIVATION: The RNA-RNA interaction problem (RIP) consists in finding the energetically optimal structure of two RNA molecules that bind to each other. The standard model allows secondary structures in both partners as well as additional base pairs between the two RNAs subject to certain restrictions that ensure that RIP is solvabale by a polynomial time dynamic programming algorithm. RNA-RNA binding, like RNA folding, is typically not dominated by the ground state structure. Instead, a large ensemble of alternative structures contributes to the interaction thermodynamics. RESULTS: We present here an O(N(6)) time and O(N(4)) dynamics programming algorithm for computing the full partition function for RIP which is based on the combinatorial notion of 'tight structures'. Albeit equivalent to recent work by H. Chitsaz and collaborators, our approach in addition provides a full-fledged computation of the base pairing probabilities, which relies on the notion of a decomposition tree for joint structures. In practise, our implementation is efficient enough to investigate, for instance, the interactions of small bacterial RNAs and their target mRNAs. AVAILABILITY: The program rip is implemented in C. The source code is available for download from http://www.combinatorics.cn/cbpc/rip.html and http://www.bioinf.uni-leipzig.de/Software/rip.html.

Asunto(s)

Emparejamiento Base , Biología Computacional/métodos , ARN/química , Algoritmos , Bases de Datos Genéticas , Conformación de Ácido Nucleico , ARN/metabolismo , Termodinámica

18.

Sequence-structure relations of pseudoknot RNA.

Huang, Fenix W D; Li, Linda Y M; Reidys, Christian M.

BMC Bioinformatics ; 10 Suppl 1: S39, 2009 Jan 30.

Artículo en Inglés | MEDLINE | ID: mdl-19208140

RESUMEN

BACKGROUND: The analysis of sequence-structure relations of RNA is based on a specific notion and folding of RNA structure. The notion of coarse grained structure employed here is that of canonical RNA pseudoknot contact-structures with at most two mutually crossing bonds (3-noncrossing). These structures are folded by a novel, ab initio prediction algorithm, cross, capable of searching all 3-noncrossing RNA structures. The algorithm outputs the minimum free energy structure. RESULTS: After giving some background on RNA pseudoknot structures and providing an outline of the folding algorithm being employed, we present in this paper various, statistical results on the mapping from RNA sequences into 3-noncrossing RNA pseudoknot structures. We study properties, like the fraction of pseudoknot structures, the dominant pseudoknot-shapes, neutral walks, neutral neighbors and local connectivity. We then put our results into context of molecular evolution of RNA. CONCLUSION: Our results imply that, in analogy to RNA secondary structures, 3-noncrossing pseudoknot RNA represents a molecular phenotype that is well suited for molecular and in particular neutral evolution. We can conclude that extended, percolating neutral networks of pseudoknot RNA exist.

Asunto(s)

Algoritmos , ARN/química , Análisis de Secuencia de ARN/métodos , Conformación de Ácido Nucleico , Fenotipo , ARN/genética , Termodinámica

19.

Statistics of canonical RNA pseudoknot structures.

Huang, Fenix W D; Reidys, Christian M.

J Theor Biol ; 253(3): 570-8, 2008 Aug 07.

Artículo en Inglés | MEDLINE | ID: mdl-18511081

RESUMEN

In this paper we study canonical RNA pseudoknot structures. We prove central limit theorems for the distributions of the arc-numbers of k-noncrossing RNA structures with given minimum stack-size tau over n nucleotides. Furthermore we compare the space of all canonical structures with canonical minimum free energy pseudoknot structures. Our results generalize the analysis of Schuster et al. obtained for RNA secondary structures [Hofacker, I.L., Schuster, P., Stadler, P.F., 1998. Combinatorics of RNA secondary structures. Discrete Appl. Math. 88, 207-237; Jin, E.Y., Reidys, C.M., 2007b. Central and local limit theorems for RNA structures. J. Theor. Biol. 250 (2008), 547-559; 2007a. Asymptotic enumeration of RNA structures with pseudoknots. Bull. Math. Biol., 70 (4), 951-970] to k-noncrossing RNA structures. Here k2 and tau are arbitrary natural numbers. We compare canonical pseudoknot structures to arbitrary structures and show that canonical pseudoknot structures exhibit significantly smaller exponential growth rates. We then compute the asymptotic distribution of their arc-numbers. Finally, we analyze how the minimum stack-size and crossing number factor into the distributions.

Asunto(s)

Modelos Genéticos , Conformación de Ácido Nucleico , ARN/genética , Algoritmos , Animales , Modelos Moleculares

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA