RESUMO
The structure of an RNA, and even more so its interactions with other RNAs, provide valuable information about its function. Secondary structure-based tools for RNA-RNA interaction predictions provide a quick way to identify possible interaction targets and structures. However, these tools ignore the effect of steric hindrance on the tertiary (3D) structure level, and do not consider whether a suitable folding pathway exists to form the interaction. As a consequence, these tools often predict interactions that are unrealistically long and could be formed (in three dimensions) only by going through highly entangled intermediates. Here, we present a computational pipeline to assess whether a proposed secondary (2D) structure interaction is sterically feasible and reachable along a plausible folding pathway. To this end, we simulate the folding of a series of 3D structures along a given 2D folding path. To avoid the complexity of large-scale atomic resolution simulations, our pipeline uses coarse-grained 3D modeling and breaks up the folding path into small steps, each corresponding to the extension of the interaction by 1 or 2 bp. We apply our pipeline to analyze RNA-RNA interaction formation for three selected RNA-RNA complexes. We find that kissing hairpins, in contrast to interactions in the exterior loop, are difficult to extend and tend to get stuck at an interaction length of 6 bp. Our tool, including source code, documentation, and sample data, is available at www.github.com/irenekb/RRI-3D.
Assuntos
Dobramento de RNA , RNA , RNA/química , Conformação de Ácido Nucleico , Estudos de Viabilidade , SoftwareRESUMO
tRNA genes exist in multiple copies in the genome of all organisms across the three domains of life. Besides the sequence differences across tRNA copies, extensive post-transcriptional modification adds a further layer to tRNA diversification. Whilst the crucial role of tRNAs as adapter molecules in protein translation is well established, whether all tRNAs are actually expressed, and whether the differences across isodecoders play any regulatory role is only recently being uncovered. Here we built upon recent developments in the use of NGS-based methods for RNA modification detection and developed tRAM-seq, an experimental protocol and in silico analysis pipeline to investigate tRNA expression and modification. Using tRAM-seq, we analysed the full ensemble of nucleo-cytoplasmic and mitochondrial tRNAs during embryonic development of the model vertebrate zebrafish. We show that the repertoire of tRNAs changes during development, with an apparent major switch in tRNA isodecoder expression and modification profile taking place around the start of gastrulation. Taken together, our findings suggest the existence of a general reprogramming of the expressed tRNA pool, possibly gearing the translational machinery for distinct stages of the delicate and crucial process of embryo development.
Assuntos
Desenvolvimento Embrionário , RNA de Transferência , Peixe-Zebra , Animais , Peixe-Zebra/genética , Peixe-Zebra/embriologia , RNA de Transferência/metabolismo , RNA de Transferência/genética , Desenvolvimento Embrionário/genética , Processamento Pós-Transcricional do RNA , Regulação da Expressão Gênica no Desenvolvimento , Embrião não Mamífero/metabolismo , RNA Mitocondrial/genética , RNA Mitocondrial/metabolismoRESUMO
The determination of the three-dimensional structure of large RNA macromolecules in solution is a challenging task that often requires the use of several experimental and computational techniques. Small-angle X-ray scattering can provide insight into some geometrical properties of the probed molecule, but this data must be properly interpreted in order to generate a three-dimensional model. Here, we propose a multiscale pipeline which introduces SAXS data into modelling the global shape of RNA in solution, which can be hierarchically refined until reaching atomistic precision in explicit solvent. The low-resolution helix model (Ernwin) deals with the exploration of the huge conformational space making use of the SAXS data, while a nucleotide-level model (SPQR) removes clashes and disentangles the proposed structures, leading the structure to an all-atom representation in explicit water. We apply the procedure on four different known pdb structures up to 159 nucleotides with promising results. Additionally, we predict an all-atom structure for the Plasmodium falceparum signal recognition particle ALU RNA based on SAXS data deposited in the SASBDB, which has an alternate conformation and better fit to the SAXS data than the previously published structure based on the same data but other modelling methods.
Assuntos
Modelos Moleculares , Conformação de Ácido Nucleico , RNA , Espalhamento a Baixo Ângulo , Difração de Raios X , Difração de Raios X/métodos , RNA/químicaRESUMO
Numerous viruses utilize essential long-range RNA-RNA genome interactions, specifically flaviviruses. Using Japanese encephalitis virus (JEV) as a model system, we computationally predicted and then biophysically validated and characterized its long-range RNA-RNA genomic interaction. Using multiple RNA computation assessment programs, we determine the primary RNA-RNA interacting site among JEV isolates and numerous related viruses. Following in vitro transcription of RNA, we provide, for the first time, characterization of an RNA-RNA interaction using size-exclusion chromatography coupled with multi-angle light scattering and analytical ultracentrifugation. Next, we demonstrate that the 5' and 3' terminal regions of JEV interact with nM affinity using microscale thermophoresis, and this affinity is significantly reduced when the conserved cyclization sequence is not present. Furthermore, we perform computational kinetic analyses validating the cyclization sequence as the primary driver of this RNA-RNA interaction. Finally, we examined the 3D structure of the interaction using small-angle X-ray scattering, revealing a flexible yet stable interaction. This pathway can be adapted and utilized to study various viral and human long-non-coding RNA-RNA interactions and determine their binding affinities, a critical pharmacological property of designing potential therapeutics.
Assuntos
Vírus da Encefalite Japonesa (Espécie) , RNA Viral , Humanos , RNA Viral/química , RNA Longo não Codificante/químicaRESUMO
MOTIVATION: In living organisms, many RNA molecules are modified post-transcriptionally. This turns the widely known four-letter RNA alphabet ACGU into a much larger one with currently more than 300 known distinct modified bases. The roles for the majority of modified bases remain uncertain, but many are already well-known for their ability to influence the preferred structures that an RNA may adopt. In fact, tRNAs sometimes require certain modifications to fold into their cloverleaf shaped structure. However, predicting the structure of RNAs with base modifications is still difficult due to the lack of efficient algorithms that can deal with the extended sequence alphabet, as well as missing parameter sets that account for the changes in stability induced by the modified bases. RESULTS: We present an approach to include sparse energy parameter data for modified bases into the ViennaRNA Package. Our method does not require any changes to the underlying efficient algorithms but instead uses a set of plug-in constraints that adapt the predictions in terms of loop evaluation at runtime. These adaptations are efficient in the sense that they are only performed for loops where additional parameters are actually available for. In addition, our approach also facilitates the inclusion of more modified bases as soon as further parameters become available. AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available at https://www.tbi.univie.ac.at/RNA.
Assuntos
RNA , Software , Conformação de Ácido Nucleico , RNA/química , Algoritmos , Dobramento de RNARESUMO
MOTIVATION: Folding during transcription can have an important influence on the structure and function of RNA molecules, as regions closer to the 5' end can fold into metastable structures before potentially stronger interactions with the 3' end become available. Thermodynamic RNA folding models are not suitable to predict structures that result from cotranscriptional folding, as they can only calculate properties of the equilibrium distribution. Other software packages that simulate the kinetic process of RNA folding during transcription exist, but they are mostly applicable for short sequences. RESULTS: We present a new algorithm that tracks changes to the RNA secondary structure ensemble during transcription. At every transcription step, new representative local minima are identified, a neighborhood relation is defined and transition rates are estimated for kinetic simulations. After every simulation, a part of the ensemble is removed and the remainder is used to search for new representative structures. The presented algorithm is deterministic (up to numeric instabilities of simulations), fast (in comparison with existing methods), and it is capable of folding RNAs much longer than 200 nucleotides. AVAILABILITY AND IMPLEMENTATION: This software is open-source and available at https://github.com/ViennaRNA/drtransformer. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Heurística , Dobramento de RNA , Conformação de Ácido Nucleico , RNA/química , Software , AlgoritmosRESUMO
MOTIVATION: Predicting the folding dynamics of RNAs is a computationally difficult problem, first and foremost due to the combinatorial explosion of alternative structures in the folding space. Abstractions are therefore needed to simplify downstream analyses, and thus make them computationally tractable. This can be achieved by various structure sampling algorithms. However, current sampling methods are still time consuming and frequently fail to represent key elements of the folding space. METHOD: We introduce RNAxplorer, a novel adaptive sampling method to efficiently explore the structure space of RNAs. RNAxplorer uses dynamic programming to perform an efficient Boltzmann sampling in the presence of guiding potentials, which are accumulated into pseudo-energy terms and reflect similarity to already well-sampled structures. This way, we effectively steer sampling toward underrepresented or unexplored regions of the structure space. RESULTS: We developed and applied different measures to benchmark our sampling methods against its competitors. Most of the measures show that RNAxplorer produces more diverse structure samples, yields rare conformations that may be inaccessible to other sampling methods and is better at finding the most relevant kinetic traps in the landscape. Thus, it produces a more representative coarse graining of the landscape, which is well suited to subsequently compute better approximations of RNA folding kinetics. AVAILABILITYAND IMPLEMENTATION: https://github.com/ViennaRNA/RNAxplorer/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
RESUMO
The internal ribosome entry site (IRES) RNA of bovine viral diarrhoea virus (BVDV), an economically significant Pestivirus, is required for the cap-independent translation of viral genomic RNA. Thus, it is essential for viral replication and pathogenesis. We applied a combination of high-throughput biochemical RNA structure probing (SHAPE-MaP) and in silico modelling approaches to gain insight into the secondary and tertiary structures of BVDV IRES RNA. Our study demonstrated that BVDV IRES RNA in solution forms a modular architecture composed of three distinct structural domains (I-III). Two regions within domain III are represented in tertiary interactions to form an H-type pseudoknot. Computational modelling of the pseudoknot motif provided a fine-grained picture of the tertiary structure and local arrangement of helices in the BVDV IRES. Furthermore, comparative genomics and consensus structure predictions revealed that the pseudoknot is evolutionarily conserved among many Pestivirus species. These studies provide detailed insight into the structural arrangement of BVDV IRES RNA H-type pseudoknot and encompassing motifs that likely contribute to the optimal functionality of viral cap-independent translation element.
Assuntos
Vírus da Diarreia Viral Bovina , Sítios Internos de Entrada Ribossomal , Diarreia , Vírus da Diarreia Viral Bovina/genética , Humanos , Conformação de Ácido Nucleico , RNA Viral/química , RNA Viral/genética , Replicação ViralRESUMO
Cis-Natural Antisense Transcripts (cis-NATs), which overlap protein coding genes and are transcribed from the opposite DNA strand, constitute an important group of noncoding RNAs. Whereas several examples of cis-NATs regulating the expression of their cognate sense gene are known, most cis-NATs function by altering the steady-state level or structure of mRNA via changes in transcription, mRNA stability, or splicing, and very few cases involve the regulation of sense mRNA translation. This study was designed to systematically search for cis-NATs influencing cognate sense mRNA translation in Arabidopsis (Arabidopsis thaliana). Establishment of a pipeline relying on sequencing of total polyA+ and polysomal RNA from Arabidopsis grown under various conditions (i.e. nutrient deprivation and phytohormone treatments) allowed the identification of 14 cis-NATs whose expression correlated either positively or negatively with cognate sense mRNA translation. With use of a combination of cis-NAT stable over-expression in transgenic plants and transient expression in protoplasts, the impact of cis-NAT expression on mRNA translation was confirmed for 4 out of 5 tested cis-NAT:sense mRNA pairs. These results expand the number of cis-NATs known to regulate cognate sense mRNA translation and provide a foundation for future studies of their mode of action. Moreover, this study highlights the role of this class of noncoding RNAs in translation regulation.
Assuntos
Arabidopsis/genética , Biossíntese de Proteínas , RNA Antissenso/genética , Proteínas de Arabidopsis/genética , Proteínas de Ligação a DNA/genética , Regulação da Expressão Gênica de Plantas , Plantas Geneticamente Modificadas , RNA Mensageiro/genética , RNA de Plantas , Reprodutibilidade dos Testes , Análise de Sequência de RNA , Fatores de Transcrição/genéticaRESUMO
Chemical modifications of RNA nucleotides change their identity and characteristics and thus alter genetic and structural information encoded in the genomic DNA. tRNA and rRNA are probably the most heavily modified genes, and often depend on derivatization or isomerization of their nucleobases in order to correctly fold into their functional structures. Recent RNomics studies, however, report transcriptome wide RNA modification and suggest a more general regulation of structuredness of RNAs by this so called epitranscriptome. Modification seems to require specific substrate structures, which in turn are stabilized or destabilized and thus promote or inhibit refolding events of regulatory RNA structures. In this review, we revisit RNA modifications and the related structures from a computational point of view. We discuss known substrate structures, their properties such as sub-motifs as well as consequences of modifications on base pairing patterns and possible refolding events. Given that efficient RNA structure prediction methods for canonical base pairs have been established several decades ago, we review to what extend these methods allow the inclusion of modified nucleotides to model and study epitranscriptomic effects on RNA structures.
Assuntos
Adenosina/metabolismo , Inosina/metabolismo , Processamento Pós-Transcricional do RNA , Análise de Sequência de RNA/métodos , Transcriptoma , Animais , Pareamento de Bases , Sequência de Bases , Humanos , Metilação , MicroRNAs/genética , MicroRNAs/metabolismo , Conformação de Ácido Nucleico , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , RNA Ribossômico/genética , RNA Ribossômico/metabolismo , RNA Nuclear Pequeno/genética , RNA Nuclear Pequeno/metabolismo , RNA de Transferência/genética , RNA de Transferência/metabolismoRESUMO
Summary: A standard method for the identification of novel RNAs or proteins is homology search via probabilistic models. One approach relies on the definition of families, which can be encoded as covariance models (CMs) or Hidden Markov Models (HMMs). While being powerful tools, their complexity makes it tedious to investigate them in their (default) tabulated form. This specifically applies to the interpretation of comparisons between multiple models as in family clans. The Covariance model visualization tools (CMV) visualize CMs or HMMs to: I) Obtain an easily interpretable representation of HMMs and CMs; II) Put them in context with the structural sequence alignments they have been created from; III) Investigate results of model comparisons and highlight regions of interest. Availability and implementation: Source code (http://www.github.com/eggzilla/cmv), web-service (http://rna.informatik.uni-freiburg.de/CMVS). Supplementary information: Supplementary data are available at Bioinformatics online.
Assuntos
Conformação de Ácido Nucleico , Conformação Proteica , Alinhamento de Sequência/métodos , Análise de Sequência de Proteína/métodos , Análise de Sequência de RNA/métodos , Software , RNA/química , RNA/metabolismoRESUMO
Riboswitches form an abundant class of cis-regulatory RNA elements that mediate gene expression by binding a small metabolite. For synthetic biology applications, they are becoming cheap and accessible systems for selectively triggering transcription or translation of downstream genes. Many riboswitches are kinetically controlled, hence knowledge of their co-transcriptional mechanisms is essential. We present here an efficient implementation for analyzing co-transcriptional RNA-ligand interaction dynamics. This approach allows for the first time to model concentration-dependent metabolite binding/unbinding kinetics. We exemplify this novel approach by means of the recently studied I-A 2'-deoxyguanosine (2'dG)-sensing riboswitch from Mesoplasma florum.
Assuntos
Biologia Computacional/métodos , Conformação de Ácido Nucleico , RNA Bacteriano/genética , Riboswitch/genética , Transcrição Gênica , Sítios de Ligação/genética , Entomoplasmataceae/genética , Cinética , Ligantes , Modelos Biológicos , Dobramento de RNA , RNA Bacteriano/química , RNA Bacteriano/metabolismoRESUMO
This contribution sketches a work flow to design an RNA switch that is able to adapt two structural conformations in a ligand-dependent way. A well characterized RNA aptamer, i.e., knowing its Kd and adaptive structural features, is an essential ingredient of the described design process. We exemplify the principles using the well-known theophylline aptamer throughout this work. The aptamer in its ligand-binding competent structure represents one structural conformation of the switch while an alternative fold that disrupts the binding-competent structure forms the other conformation. To keep it simple we do not incorporate any regulatory mechanism to control transcription or translation. We elucidate a commonly used design process by explicitly dissecting and explaining the necessary steps in detail. We developed a novel objective function which specifies the mechanistics of this simple, ligand-triggered riboswitch and describe an extensive in silico analysis pipeline to evaluate important kinetic properties of the designed sequences. This protocol and the developed software can be easily extended or adapted to fit novel design scenarios and thus can serve as a template for future needs.
Assuntos
Aptâmeros de Nucleotídeos/síntese química , Biologia Computacional/métodos , Conformação de Ácido Nucleico , Riboswitch/genética , Aptâmeros de Nucleotídeos/genética , Biologia Computacional/instrumentação , Cinética , Ligantes , Dobramento de RNA , SoftwareRESUMO
Intermolecular interactions of ncRNAs are at the core of gene regulation events, and identifying the full map of these interactions bears crucial importance for ncRNA functional studies. It is known that RNA-RNA interactions are built up by complementary base pairings between interacting RNAs and high level of complementarity between two RNA sequences is a powerful predictor of such interactions. Here, we present RIsearch2, a large-scale RNA-RNA interaction prediction tool that enables quick localization of potential near-complementary RNA-RNA interactions between given query and target sequences. In contrast to previous heuristics which either search for exact matches while including G-U wobble pairs or employ simplified energy models, we present a novel approach using a single integrated seed-and-extend framework based on suffix arrays. RIsearch2 enables fast discovery of candidate RNA-RNA interactions on genome/transcriptome-wide scale. We furthermore present an siRNA off-target discovery pipeline that not only predicts the off-target transcripts but also computes the off-targeting potential of a given siRNA. This is achieved by combining genome-wide RIsearch2 predictions with target site accessibilities and transcript abundance estimates. We show that this pipeline accurately predicts siRNA off-target interactions and enables off-targeting potential comparisons between different siRNA designs. RIsearch2 and the siRNA off-target discovery pipeline are available as stand-alone software packages from http://rth.dk/resources/risearch.
Assuntos
Modelos Estatísticos , RNA Interferente Pequeno/genética , RNA não Traduzido/genética , Software , Transcriptoma , Algoritmos , Pareamento de Bases , Sequência de Bases , Linhagem Celular Tumoral , Humanos , Modelos Genéticos , RNA Interferente Pequeno/metabolismo , RNA não Traduzido/metabolismoRESUMO
MOTIVATION: Realizing the value of synthetic biology in biotechnology and medicine requires the design of molecules with specialized functions. Due to its close structure to function relationship, and the availability of good structure prediction methods and energy models, RNA is perfectly suited to be synthetically engineered with predefined properties. However, currently available RNA design tools cannot be easily adapted to accommodate new design specifications. Furthermore, complicated sampling and optimization methods are often developed to suit a specific RNA design goal, adding to their inflexibility. RESULTS: We developed a C ++ library implementing a graph coloring approach to stochastically sample sequences compatible with structural and sequence constraints from the typically very large solution space. The approach allows to specify and explore the solution space in a well defined way. Our library also guarantees uniform sampling, which makes optimization runs performant by not only avoiding re-evaluation of already found solutions, but also by raising the probability of finding better solutions for long optimization runs. We show that our software can be combined with any other software package to allow diverse RNA design applications. Scripting interfaces allow the easy adaption of existing code to accommodate new scenarios, making the whole design process very flexible. We implemented example design approaches written in Python to demonstrate these advantages. AVAILABILITY AND IMPLEMENTATION: RNAblueprint , Python implementations and benchmark datasets are available at github: https://github.com/ViennaRNA . CONTACT: s.hammer@univie.ac.at, ivo@tbi.univie.ac.at or sven@tbi.univie.ac.at. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Modelos Moleculares , RNA/química , Software , Biologia Sintética/métodos , Conformação de Ácido NucleicoRESUMO
AREsite2 represents an update for AREsite, an on-line resource for the investigation of AU-rich elements (ARE) in human and mouse mRNA 3'UTR sequences. The new updated and enhanced version allows detailed investigation of AU, GU and U-rich elements (ARE, GRE, URE) in the transcriptome of Homo sapiens, Mus musculus, Danio rerio, Caenorhabditis elegans and Drosophila melanogaster. It contains information on genomic location, genic context, RNA secondary structure context and conservation of annotated motifs. Improvements include annotation of motifs not only in 3'UTRs but in the whole gene body including introns, additional genomes, and locally stable secondary structures from genome wide scans. Furthermore, we include data from CLIP-Seq experiments in order to highlight motifs with validated protein interaction. Additionally, we provide a REST interface for experienced users to interact with the database in a semi-automated manner. The database is publicly available at: http://rna.tbi.univie.ac.at/AREsite.
Assuntos
Regiões 3' não Traduzidas , Bases de Dados de Ácidos Nucleicos , RNA/química , Animais , Genômica , Humanos , Camundongos , Anotação de Sequência Molecular , Conformação de Ácido Nucleico , Motivos de NucleotídeosRESUMO
Determining the function of a non-coding RNA requires costly and time-consuming wet-lab experiments. For this reason, computational methods which ascertain the homology of a sequence and thereby deduce functionality and family membership are often exploited. In this fashion, newly sequenced genomes can be annotated in a completely computational way. Covariance models are commonly used to assign novel RNA sequences to a known RNA family. However, to construct such models several examples of the family have to be already known. Moreover, model building is the work of experts who manually edit the necessary RNA alignment and consensus structure. Our method, RNAlien, starting from a single input sequence collects potential family member sequences by multiple iterations of homology search. RNA family models are fully automatically constructed for the found sequences. We have tested our method on a subset of the Rfam RNA family database. RNAlien models are a starting point to construct models of comparable sensitivity and specificity to manually curated ones from the Rfam database. RNAlien Tool and web server are available at http://rna.tbi.univie.ac.at/rnalien/.
Assuntos
Algoritmos , Modelos Moleculares , RNA/química , Pareamento de Bases , Humanos , Conformação de Ácido Nucleico , Homologia de Sequência do Ácido Nucleico , Fatores de TempoRESUMO
Gene repression induced by the formation of transcriptional terminators represents a prime example for the coupling of RNA synthesis, folding, and regulation. In this context, mapping the changes in available conformational space of transcription intermediates during RNA synthesis is important to understand riboswitch function. A majority of riboswitches, an important class of small metabolite-sensing regulatory RNAs, act as transcriptional regulators, but the dependence of ligand binding and the subsequent allosteric conformational switch on mRNA transcript length has not yet been investigated. We show a strict fine-tuning of binding and sequence-dependent alterations of conformational space by structural analysis of all relevant transcription intermediates at single-nucleotide resolution for the I-A type 2'dG-sensing riboswitch from Mesoplasma florum by NMR spectroscopy. Our results provide a general framework to dissect the coupling of synthesis and folding essential for riboswitch function, revealing the importance of metastable states for RNA-based gene regulation.
Assuntos
Riboswitch , Regulação Alostérica , Sítios de Ligação , Expressão Gênica , Ligantes , Espectroscopia de Ressonância Magnética , Modelos Biológicos , Conformação de Ácido Nucleico , Dobramento de RNA , Transcrição GênicaRESUMO
A 3D model of RNA structure can provide information about its function and regulation that is not possible with just the sequence or secondary structure. Current models suffer from low accuracy and long running times and either neglect or presume knowledge of the long-range interactions which stabilize the tertiary structure. Our coarse-grained, helix-based, tertiary structure model operates with only a few degrees of freedom compared with all-atom models while preserving the ability to sample tertiary structures given a secondary structure. It strikes a balance between the precision of an all-atom tertiary structure model and the simplicity and effectiveness of a secondary structure representation. It provides a simplified tool for exploring global arrangements of helices and loops within RNA structures. We provide an example of a novel energy function relying only on the positions of stems and loops. We show that coupling our model to this energy function produces predictions as good as or better than the current state of the art tools. We propose that given the wide range of conformational space that needs to be explored, a coarse-grain approach can explore more conformations in less iterations than an all-atom model coupled to a fine-grain energy function. Finally, we emphasize the overarching theme of providing an ensemble of predicted structures, something which our tool excels at, rather than providing a handful of the lowest energy structures.
Assuntos
Biologia Computacional/métodos , RNA/química , Algoritmos , Ligação de Hidrogênio , Modelos Moleculares , Conformação de Ácido NucleicoRESUMO
MOTIVATION: The function of an RNA molecule is not only linked to its native structure, which is usually taken to be the ground state of its folding landscape, but also in many cases crucially depends on the details of the folding pathways such as stable folding intermediates or the timing of the folding process itself. To model and understand these processes, it is necessary to go beyond ground state structures. The study of rugged RNA folding landscapes holds the key to answer these questions. Efficient coarse-graining methods are required to reduce the intractably vast energy landscapes into condensed representations such as barrier trees or basin hopping graphs : BHG) that convey an approximate but comprehensive picture of the folding kinetics. So far, exact and heuristic coarse-graining methods have been mostly restricted to the pseudoknot-free secondary structures. Pseudoknots, which are common motifs and have been repeatedly hypothesized to play an important role in guiding folding trajectories, were usually excluded. RESULTS: We generalize the BHG framework to include pseudoknotted RNA structures and systematically study the differences in predicted folding behavior depending on whether pseudoknotted structures are allowed to occur as folding intermediates or not. We observe that RNAs with pseudoknotted ground state structures tend to have more pseudoknotted folding intermediates than RNAs with pseudoknot-free ground state structures. The occurrence and influence of pseudoknotted intermediates on the folding pathway, however, appear to depend very strongly on the individual RNAs so that no general rule can be inferred. AVAILABILITY AND IMPLEMENTATION: The algorithms described here are implemented in C++ as standalone programs. Its source code and Supplemental material can be freely downloaded from http://www.tbi.univie.ac.at/bhg.html. CONTACT: qin@bioinf.uni-leipzig.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.