Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 31
Filtrar
1.
Bioinformatics ; 40(2)2024 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-38230755

RESUMO

MOTIVATION: The prediction of RNA structure canonical base pairs from a single sequence, especially pseudoknotted ones, remains challenging in a thermodynamic models that approximates the energy of the local 3D motifs joining canonical stems. It has become more and more apparent in recent years that the structural motifs in the loops, composed of noncanonical interactions, are essential for the final shape of the molecule enabling its multiple functions. Our capacity to predict accurate 3D structures is also limited when it comes to the organization of the large intricate network of interactions that form inside those loops. RESULTS: We previously developed the integer programming framework RNA Motifs over Integer Programming (RNAMoIP) to reconcile RNA secondary structure and local 3D motif information available in databases. We further develop our model to now simultaneously predict the canonical base pairs (with pseudoknots) from base pair probability matrices with or without alignment. We benchmarked our new method over the all nonredundant RNAs below 150 nucleotides. We show that the joined prediction of canonical base pairs structure and local conserved motifs (i) improves the ratio of well-predicted interactions in the secondary structure, (ii) predicts well canonical and Wobble pairs at the location where motifs are inserted, (iii) is greatly improved with evolutionary information, and (iv) noncanonical motifs at kink-turn locations. AVAILABILITY AND IMPLEMENTATION: The source code of the framework is available at https://gitlab.info.uqam.ca/cbe/RNAMoIP and an interactive web server at https://rnamoip.cbe.uqam.ca/.


Assuntos
Algoritmos , RNA , RNA/química , Conformação de Ácido Nucleico , Software , Motivos de Nucleotídeos
2.
PLoS Comput Biol ; 19(8): e1011309, 2023 08.
Artigo em Inglês | MEDLINE | ID: mdl-37535676

RESUMO

Hepatitis B virus (HBV) infection kinetics in immunodeficient mice reconstituted with humanized livers from inoculation to steady state is highly dynamic despite the absence of an adaptive immune response. To recapitulate the multiphasic viral kinetic patterns, we developed an agent-based model that includes intracellular virion production cycles reflecting the cyclic nature of each individual virus lifecycle. The model fits the data well predicting an increase in production cycles initially starting with a long production cycle of 1 virion per 20 hours that gradually reaches 1 virion per hour after approximately 3-4 days before virion production increases dramatically to reach to a steady state rate of 4 virions per hour per cell. Together, modeling suggests that it is the cyclic nature of the virus lifecycle combined with an initial slow but increasing rate of HBV production from each cell that plays a role in generating the observed multiphasic HBV kinetic patterns in humanized mice.


Assuntos
Hepatite B , Replicação Viral , Animais , Camundongos , Cinética , DNA Viral , Vírus da Hepatite B/genética , Vírion/fisiologia
3.
Phys Rev Lett ; 131(21): 218401, 2023 Nov 24.
Artigo em Inglês | MEDLINE | ID: mdl-38072605

RESUMO

AlphaFold2 (AF) is a promising tool, but is it accurate enough to predict single mutation effects? Here, we report that the localized structural deformation between protein pairs differing by only 1-3 mutations-as measured by the effective strain-is correlated across 3901 experimental and AF-predicted structures. Furthermore, analysis of ∼11 000 proteins shows that the local structural change correlates with various phenotypic changes. These findings suggest that AF can predict the range and magnitude of single-mutation effects on average, and we propose a method to improve precision of AF predictions and to indicate when predictions are unreliable.


Assuntos
Mutação , Proteínas , Software , Proteínas/genética
4.
RNA ; 26(11): 1530-1540, 2020 11.
Artigo em Inglês | MEDLINE | ID: mdl-32747608

RESUMO

Chaperone proteins-the most disordered among all protein groups-help RNAs fold into their functional structure by destabilizing misfolded configurations or stabilizing the functional ones. But disentangling the mechanism underlying RNA chaperoning is challenging, mostly because of inherent disorder of the chaperones and the transient nature of their interactions with RNA. In particular, it is unclear how specific the interactions are and what role is played by amino acid charge and polarity patterns. Here, we address these questions in the RNA chaperone StpA. We adapted direct coupling analysis (DCA) into the αßDCA method that can treat in tandem sequences written in two alphabets, nucleotides and amino acids. With αßDCA, we could analyze StpA-RNA interactions and show consistency with a previously proposed two-pronged mechanism: StpA disrupts specific positions in the group I intron while globally and loosely binding to the entire structure. Moreover, the interactions are strongly associated with the charge pattern: Negatively charged regions in the destabilizing StpA amino-terminal affect a few specific positions in the RNA, located in stems and in the pseudoknot. In contrast, positive regions in the carboxy-terminal contain strongly coupled amino acids that promote nonspecific or weakly specific binding to the RNA. The present study opens new avenues to examine the functions of disordered proteins and to design disruptive proteins based on their charge patterns.


Assuntos
Proteínas de Ligação a DNA/química , Proteínas de Ligação a DNA/metabolismo , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/metabolismo , Escherichia coli/metabolismo , Chaperonas Moleculares/química , Chaperonas Moleculares/metabolismo , RNA/metabolismo , Algoritmos , Sequência de Aminoácidos , Sequência de Bases , Proteínas de Ligação a DNA/genética , Escherichia coli/química , Proteínas de Escherichia coli/genética , Íntrons , Modelos Moleculares , Chaperonas Moleculares/genética , Conformação de Ácido Nucleico , Ligação Proteica , RNA/química , Dobramento de RNA
5.
J Virol ; 95(14): e0049220, 2021 06 24.
Artigo em Inglês | MEDLINE | ID: mdl-33910953

RESUMO

Whereas the mode of action of lamivudine (LAM) against hepatitis B virus (HBV) is well established, the inhibition mechanism(s) of interferon alpha (IFN-α) is less completely defined. To advance our understanding, we mathematically modeled HBV kinetics during 14-day pegylated IFN-α-2a (pegIFN), LAM, or pegIFN-plus-LAM (pegIFN+LAM) treatment of 39 chronically HBV-infected humanized uPA/SCID chimeric mice. Serum HBV DNA and intracellular HBV DNA were measured frequently. We developed a multicompartmental mathematical model and simultaneously fit it to the serum and intracellular HBV DNA data. Unexpectedly, even in the absence of an adaptive immune response, a biphasic decline in serum HBV DNA and intracellular HBV DNA was observed in response to all treatments. Kinetic analysis and modeling indicate that the first phase represents inhibition of intracellular HBV DNA synthesis and secretion, which was similar under all treatments with an overall mean efficacy of 98%. In contrast, there were distinct differences in HBV decline during the second phase, which was accounted for in the model by a time-dependent inhibition of intracellular HBV DNA synthesis, with the steepest decline observed during pegIFN+LAM treatment (1.28/day) and the slowest (0.1/day) during pegIFN monotherapy. Reminiscent of observations in patients treated with pegIFN and/or LAM, a biphasic HBV decline was observed in treated humanized mice in the absence of an adaptive immune response. Interestingly, combination treatment did not increase the initial inhibition of HBV production but rather enhanced second-phase decline, providing insight into the dynamics of HBV treatment response and the mode of action of IFN-α against HBV. IMPORTANCE Chronic hepatitis B virus (HBV) infection remains a global health care problem, as we lack sufficient curative treatment options. Elucidating the dynamics of HBV infection and treatment response at the molecular level could facilitate the development of novel, more effective HBV antivirals. Currently, the only well-established small animal HBV infection model available is the chimeric uPA/SCID mice with humanized livers; however, the HBV inhibition kinetics under pegylated IFN-α-2a (pegIFN) in this model system have not been determined in sufficient detail. In this study, viral kinetics in 39 humanized mice treated with pegIFN and/or lamivudine were monitored and analyzed using a mathematical modeling approach. We found that the main mode of action of IFN-α is blocking HBV DNA synthesis and that the majority of synthesized HBV DNA is secreted. Our study provides novel insights into HBV DNA dynamics within infected human hepatocytes.


Assuntos
Antivirais/farmacologia , Vírus da Hepatite B/fisiologia , Hepatite B/tratamento farmacológico , Hepatite B/virologia , Interferon-alfa/farmacologia , Animais , Pré-Escolar , DNA Viral/sangue , Modelos Animais de Doenças , Feminino , Vírus da Hepatite B/efeitos dos fármacos , Humanos , Lactente , Cinética , Lamivudina/farmacologia , Transplante de Fígado , Masculino , Camundongos SCID , Modelos Teóricos , Polietilenoglicóis/farmacologia , Proteínas Recombinantes/farmacologia , Albumina Sérica/metabolismo , Quimeras de Transplante
6.
PLoS Comput Biol ; 17(5): e1008990, 2021 05.
Artigo em Inglês | MEDLINE | ID: mdl-34048427

RESUMO

RNA tertiary structure is crucial to its many non-coding molecular functions. RNA architecture is shaped by its secondary structure composed of stems, stacked canonical base pairs, enclosing loops. While stems are precisely captured by free-energy models, loops composed of non-canonical base pairs are not. Nor are distant interactions linking together those secondary structure elements (SSEs). Databases of conserved 3D geometries (a.k.a. modules) not captured by energetic models are leveraged for structure prediction and design, but the computational complexity has limited their study to local elements, loops. Representing the RNA structure as a graph has recently allowed to expend this work to pairs of SSEs, uncovering a hierarchical organization of these 3D modules, at great computational cost. Systematically capturing recurrent patterns on a large scale is a main challenge in the study of RNA structures. In this paper, we present an efficient algorithm to compute maximal isomorphisms in edge colored graphs. We extend this algorithm to a framework well suited to identify RNA modules, and fast enough to considerably generalize previous approaches. To exhibit the versatility of our framework, we first reproduce results identifying all common modules spanning more than 2 SSEs, in a few hours instead of weeks. The efficiency of our new algorithm is demonstrated by computing the maximal modules between any pair of entire RNA in the non-redundant corpus of known RNA 3D structures. We observe that the biggest modules our method uncovers compose large shared sub-structure spanning hundreds of nucleotides and base pairs between the ribosomes of Thermus thermophilus, Escherichia Coli, and Pseudomonas aeruginosa.


Assuntos
Conformação de Ácido Nucleico , RNA/química , Algoritmos , Pareamento de Bases , Biologia Computacional/métodos
7.
RNA Biol ; 19(1): 1208-1227, 2022 01.
Artigo em Inglês | MEDLINE | ID: mdl-36384383

RESUMO

This study investigates the importance of the structural context in the formation of a type I/II A-minor motif. This very frequent structural motif has been shown to be important in the spatial folding of RNA molecules. We developed an automated method to classify A-minor motif occurrences according to their 3D context similarities, and we used a graph approach to represent both the structural A-minor motif occurrences and their classes at different scales. This approach leads us to uncover new subclasses of A-minor motif occurrences according to their local 3D similarities. The majority of classes are composed of homologous occurrences, but some of them are composed of non-homologous occurrences. The different classifications we obtain allow us to better understand the importance of the context in the formation of A-minor motifs. In a second step, we investigate how much knowledge of the context around an A-minor motif can help to infer its presence (and position). More specifically, we want to determine what kind of information, contained in the structural context, can be useful to characterize and predict A-minor motifs. We show that, for some A-minor motifs, the topology combined with a sequence signal is sufficient to predict the presence and the position of an A-minor motif occurrence. In most other cases, these signals are not sufficient for predicting the A-minor motif, however we show that they are good signals for this purpose. All the classification and prediction pipelines rely on automated processes, for which we describe the underlying algorithms and parameters.


Assuntos
Imageamento Tridimensional , RNA , Algoritmos , Valor Preditivo dos Testes , Humanos , RNA/química
8.
Nucleic Acids Res ; 48(14): 7690-7699, 2020 08 20.
Artigo em Inglês | MEDLINE | ID: mdl-32652015

RESUMO

RNA-small molecule binding is a key regulatory mechanism which can stabilize 3D structures and activate molecular functions. The discovery of RNA-targeting compounds is thus a current topic of interest for novel therapies. Our work is a first attempt at bringing the scalability and generalization abilities of machine learning methods to the problem of RNA drug discovery, as well as a step towards understanding the interactions which drive binding specificity. Our tool, RNAmigos, builds and encodes a network representation of RNA structures to predict likely ligands for novel binding sites. We subject ligand predictions to virtual screening and show that we are able to place the true ligand in the 71st-73rd percentile in two decoy libraries, showing a significant improvement over several baselines, and a state of the art method. Furthermore, we observe that augmenting structural networks with non-canonical base pairing data is the only representation able to uncover a significant signal, suggesting that such interactions are a necessary source of binding specificity. We also find that pre-training with an auxiliary graph representation learning task significantly boosts performance of ligand prediction. This finding can serve as a general principle for RNA structure-function prediction when data is scarce. RNAmigos shows that RNA binding data contains structural patterns with potential for drug discovery, and provides methodological insights for possible applications to other structure-function learning tasks. The source code, data and a Web server are freely available at http://rnamigos.cs.mcgill.ca.


Assuntos
RNA/química , Software , Pareamento de Bases , Sítios de Ligação , Ligantes , Conformação de Ácido Nucleico
9.
RNA ; 25(12): 1579-1591, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31467146

RESUMO

The RNA world hypothesis relies on the ability of ribonucleic acids to spontaneously acquire complex structures capable of supporting essential biological functions. Multiple sophisticated evolutionary models have been proposed for their emergence, but they often assume specific conditions. In this work, we explore a simple and parsimonious scenario describing the emergence of complex molecular structures at the early stages of life. We show that at specific GC content regimes, an undirected replication model is sufficient to explain the apparition of multibranched RNA secondary structures-a structural signature of many essential ribozymes. We ran a large-scale computational study to map energetically stable structures on complete mutational networks of 50-nt-long RNA sequences. Our results reveal that the sequence landscape with stable structures is enriched with multibranched structures at a length scale coinciding with the appearance of complex structures in RNA databases. A random replication mechanism preserving a 50% GC content may suffice to explain a natural enrichment of stable complex structures in populations of functional RNAs. In contrast, an evolutionary mechanism eliciting the most stable folds at each generation appears to help reaching multibranched structures at highest GC content.


Assuntos
Conformação de Ácido Nucleico , RNA/química , Composição de Bases , Sequência de Bases , Evolução Molecular , Mutação , RNA/genética , Dobramento de RNA , Estabilidade de RNA , Relação Estrutura-Atividade , Transcrição Gênica
10.
Bioinformatics ; 36(9): 2920-2922, 2020 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-31971575

RESUMO

SUMMARY: RNA design has conceptually evolved from the inverse RNA folding problem. In the classical inverse RNA problem, the user inputs an RNA secondary structure and receives an output RNA sequence that folds into it. Although modern RNA design methods are based on the same principle, a finer control over the resulting sequences is sought. As an important example, a substantial number of non-coding RNA families show high preservation in specific regions, while being more flexible in others and this information should be utilized in the design. By using the additional information, RNA design tools can help solve problems of practical interest in the growing fields of synthetic biology and nanotechnology. incaRNAfbinv 2.0 utilizes a fragment-based approach, enabling a control of specific RNA secondary structure motifs. The new version allows significantly more control over the general RNA shape, and also allows to express specific restrictions over each motif separately, in addition to other advanced features. AVAILABILITY AND IMPLEMENTATION: incaRNAfbinv 2.0 is available through a standalone package and a web-server at https://www.cs.bgu.ac.il/incaRNAfbinv. Source code, command-line and GUI wrappers can be found at https://github.com/matandro/RNAsfbinv. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
RNA , Software , Motivos de Nucleotídeos , RNA/genética , Dobramento de RNA , Análise de Sequência de RNA
11.
Nucleic Acids Res ; 47(7): 3321-3332, 2019 04 23.
Artigo em Inglês | MEDLINE | ID: mdl-30828711

RESUMO

RNA structures possess multiple levels of structural organization. A secondary structure, made of Watson-Crick helices connected by loops, forms a scaffold for the tertiary structure. The 3D structures adopted by these loops are therefore critical determinants shaping the global 3D architecture. Earlier studies showed that these local 3D structures can be described as conserved sets of ordered non-Watson-Crick base pairs called RNA structural modules. Unfortunately, the computational efficiency and scope of the current 3D module identification methods are too limited yet to benefit from all the knowledge accumulated in the module databases. We present BayesPairing, an automated, efficient and customizable tool for (i) building Bayesian networks representing RNA 3D modules and (ii) rapid identification of 3D modules in sequences. BayesPairing uses a flexible definition of RNA 3D modules that allows us to consider complex architectures such as multi-branched loops and features multiple algorithmic improvements. We benchmarked our methods using cross-validation techniques on 3409 RNA chains and show that BayesPairing achieves up to ∼70% identification accuracy on module positions and base pair interactions. BayesPairing can handle a broader range of motifs (versatility) and offers considerable running time improvements (efficiency), opening the door to a broad range of large-scale applications.


Assuntos
Pareamento de Bases , Teorema de Bayes , RNA/química , Automação , Bases de Dados Genéticas , Conjuntos de Dados como Assunto , Reprodutibilidade dos Testes , Fatores de Tempo
12.
Brief Bioinform ; 19(2): 350-358, 2018 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-28049135

RESUMO

Computational programs for predicting RNA sequences with desired folding properties have been extensively developed and expanded in the past several years. Given a secondary structure, these programs aim to predict sequences that fold into a target minimum free energy secondary structure, while considering various constraints. This procedure is called inverse RNA folding. Inverse RNA folding has been traditionally used to design optimized RNAs with favorable properties, an application that is expected to grow considerably in the future in light of advances in the expanding new fields of synthetic biology and RNA nanostructures. Moreover, it was recently demonstrated that inverse RNA folding can successfully be used as a valuable preprocessing step in computational detection of novel noncoding RNAs. This review describes the most popular freeware programs that have been developed for such purposes, starting from RNAinverse that was devised when formulating the inverse RNA folding problem. The most recently published ones that consider RNA secondary structure as input are antaRNA, RNAiFold and incaRNAfbinv, each having different features that could be beneficial to specific biological problems in practice. The various programs also use distinct approaches, ranging from ant colony optimization to constraint programming, in addition to adaptive walk, simulated annealing and Boltzmann sampling. This review compares between the various programs and provides a simple description of the various possibilities that would benefit practitioners in selecting the most suitable program. It is geared for specific tasks requiring RNA design based on input secondary structure, with an outlook toward the future of RNA design programs.


Assuntos
Algoritmos , Conformação de Ácido Nucleico , Dobramento de RNA , RNA/química , Software , Animais , Biologia Computacional/métodos , Humanos , Modelos Moleculares
13.
Nucleic Acids Res ; 46(8): 3841-3851, 2018 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-29608773

RESUMO

The wealth of the combinatorics of nucleotide base pairs enables RNA molecules to assemble into sophisticated interaction networks, which are used to create complex 3D substructures. These interaction networks are essential to shape the 3D architecture of the molecule, and also to provide the key elements to carry molecular functions such as protein or ligand binding. They are made of organised sets of long-range tertiary interactions which connect distinct secondary structure elements in 3D structures. Here, we present a de novo data-driven approach to extract automatically from large data sets of full RNA 3D structures the recurrent interaction networks (RINs). Our methodology enables us for the first time to detect the interaction networks connecting distinct components of the RNA structure, highlighting their diversity and conservation through non-related functional RNAs. We use a graphical model to perform pairwise comparisons of all RNA structures available and to extract RINs and modules. Our analysis yields a complete catalog of RNA 3D structures available in the Protein Data Bank and reveals the intricate hierarchical organization of the RNA interaction networks and modules. We assembled our results in an online database (http://carnaval.lri.fr) which will be regularly updated. Within the site, a tool allows users with a novel RNA structure to detect automatically whether the novel structure contains previously observed RINs.


Assuntos
Bases de Dados de Ácidos Nucleicos/estatística & dados numéricos , Conformação de Ácido Nucleico , RNA/química , Algoritmos , Pareamento de Bases , Biologia Computacional/métodos , Mineração de Dados/métodos , Bases de Dados de Proteínas/estatística & dados numéricos , Modelos Moleculares , Dobramento de RNA , Software
14.
Bull Math Biol ; 81(10): 3675-3721, 2019 10.
Artigo em Inglês | MEDLINE | ID: mdl-31338739

RESUMO

Mathematical models that are based on differential equations require detailed knowledge about the parameters that are included in the equations. Some of the parameters can be measured experimentally while others need to be estimated. When the models become more sophisticated, such as in the case of multiscale models of hepatitis C virus dynamics that deal with partial differential equations (PDEs), several strategies can be tried. It is possible to use parameter estimation on an analytical approximation of the solution to the multiscale model equations, namely the long-term approximation, but this limits the scope of the parameter estimation method used and a long-term approximation needs to be derived for each model. It is possible to transform the PDE multiscale model to a system of ODEs, but this has an effect on the model parameters themselves and the transformation can become problematic for some models. Finally, it is possible to use numerical solutions for the multiscale model and then use canned methods for the parameter estimation, but the latter is making the user dependent on a black box without having full control over the method. The strategy developed here is to start by working directly on the multiscale model equations for preparing them toward the parameter estimation method that is fully coded and controlled by the user. It can also be adapted to multiscale models of other viruses. The new method is described, and illustrations are provided using a user-friendly simulator that incorporates the method.


Assuntos
Hepacivirus/fisiologia , Hepatite C Crônica/virologia , Modelos Biológicos , Antivirais/uso terapêutico , Simulação por Computador , Hepatite C Crônica/terapia , Humanos , Cinética , Conceitos Matemáticos
15.
Nucleic Acids Res ; 45(W1): W440-W444, 2017 07 03.
Artigo em Inglês | MEDLINE | ID: mdl-28525607

RESUMO

RNA structures are hierarchically organized. The secondary structure is articulated around sophisticated local three-dimensional (3D) motifs shaping the full 3D architecture of the molecule. Recent contributions have identified and organized recurrent local 3D motifs, but applications of this knowledge for predictive purposes is still in its infancy. We recently developed a computational framework, named RNA-MoIP, to reconcile RNA secondary structure and local 3D motif information available in databases. In this paper, we introduce a web service using our software for predicting RNA hybrid 2D-3D structures from sequence data only. Optionally, it can be used for (i) local 3D motif prediction or (ii) the refinement of user-defined secondary structures. Importantly, our web server automatically generates a script for the MC-Sym software, which can be immediately used to quickly predict all-atom RNA 3D models. The web server is available at http://rnamoip.cs.mcgill.ca.


Assuntos
Motivos de Nucleotídeos , RNA/química , Software , Sequência de Bases , Internet , Modelos Moleculares , Conformação de Ácido Nucleico
16.
Nucleic Acids Res ; 44(11): e104, 2016 06 20.
Artigo em Inglês | MEDLINE | ID: mdl-27095200

RESUMO

Systematic structure probing experiments (e.g. SHAPE) of RNA mutants such as the mutate-and-map (MaM) protocol give us a direct access into the genetic robustness of ncRNA structures. Comparative studies of homologous sequences provide a distinct, yet complementary, approach to analyze structural and functional properties of non-coding RNAs. In this paper, we introduce a formal framework to combine the biochemical signal collected from MaM experiments, with the evolutionary information available in multiple sequence alignments. We apply neutral theory principles to detect complex long-range dependencies between nucleotides of a single stranded RNA, and implement these ideas into a software called aRNhAck We illustrate the biological significance of this signal and show that the nucleotides networks calculated with aRNhAck are correlated with nucleotides located in RNA-RNA, RNA-protein, RNA-DNA and RNA-ligand interfaces. aRNhAck is freely available at http://csb.cs.mcgill.ca/arnhack.


Assuntos
Evolução Molecular , Mutação , Conformação de Ácido Nucleico , RNA/genética , Software , Algoritmos , Sítios de Ligação , Biologia Computacional/métodos , DNA/química , Modelos Moleculares , Ligação Proteica , Conformação Proteica , Proteínas/química , RNA/química , Navegador
17.
Nucleic Acids Res ; 44(W1): W308-14, 2016 07 08.
Artigo em Inglês | MEDLINE | ID: mdl-27185893

RESUMO

In recent years, new methods for computational RNA design have been developed and applied to various problems in synthetic biology and nanotechnology. Lately, there is considerable interest in incorporating essential biological information when solving the inverse RNA folding problem. Correspondingly, RNAfbinv aims at including biologically meaningful constraints and is the only program to-date that performs a fragment-based design of RNA sequences. In doing so it allows the design of sequences that do not necessarily exactly fold into the target, as long as the overall coarse-grained tree graph shape is preserved. Augmented by the weighted sampling algorithm of incaRNAtion, our web server called incaRNAfbinv implements the method devised in RNAfbinv and offers an interactive environment for the inverse folding of RNA using a fragment-based design approach. It takes as input: a target RNA secondary structure; optional sequence and motif constraints; optional target minimum free energy, neutrality and GC content. In addition to the design of synthetic regulatory sequences, it can be used as a pre-processing step for the detection of novel natural occurring RNAs. The two complementary methodologies RNAfbinv and incaRNAtion are merged together and fully implemented in our web server incaRNAfbinv, available at http://www.cs.bgu.ac.il/incaRNAfbinv.


Assuntos
Conformação de Ácido Nucleico , Dobramento de RNA , RNA/química , Software , Algoritmos , Composição de Bases , Pareamento de Bases , Sequência de Bases , Gráficos por Computador , Internet , Mutação , RNA/genética , Análise de Sequência de RNA , Termodinâmica
18.
BMC Genomics ; 17(Suppl 10): 862, 2016 11 11.
Artigo em Inglês | MEDLINE | ID: mdl-28185557

RESUMO

BACKGROUND: Secondary structures form the scaffold of multiple sequence alignment of non-coding RNA (ncRNA) families. An accurate reconstruction of ancestral ncRNAs must use this structural signal. However, the inference of ancestors of a single ncRNA family with a single consensus structure may bias the results towards sequences with high affinity to this structure, which are far from the true ancestors. METHODS: In this paper, we introduce achARNement, a maximum parsimony approach that, given two alignments of homologous ncRNA families with consensus secondary structures and a phylogenetic tree, simultaneously calculates ancestral RNA sequences for these two families. RESULTS: We test our methodology on simulated data sets, and show that achARNement outperforms classical maximum parsimony approaches in terms of accuracy, but also reduces by several orders of magnitude the number of candidate sequences. To conclude this study, we apply our algorithms on the Glm clan and the FinP-traJ clan from the Rfam database. CONCLUSIONS: Our results show that our methods reconstruct small sets of high-quality candidate ancestors with better agreement to the two target structures than with classical approaches. Our program is freely available at: http://csb.cs.mcgill.ca/acharnement .


Assuntos
Algoritmos , RNA não Traduzido/química , Evolução Biológica , Bases de Dados Genéticas , Internet , Conformação de Ácido Nucleico , Filogenia , RNA não Traduzido/classificação , RNA não Traduzido/genética , Alinhamento de Sequência , Interface Usuário-Computador
19.
Bioinformatics ; 29(13): i308-15, 2013 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-23812999

RESUMO

MOTIVATIONS: The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding properties) and more importantly do not allow the user to control explicitly the nucleotide distribution such as the GC-content in their sequences. However, the latter is an important criterion for large-scale applications as it could presumably be used to design sequences with better transcription rates and/or structural plasticity. RESULTS: In this article, we introduce IncaRNAtion, a novel algorithm to design RNA sequences folding into target secondary structures with a predefined nucleotide distribution. IncaRNAtion uses a global sampling approach and weighted sampling techniques. We show that our approach is fast (i.e. running time comparable or better than local search methods), seedless (we remove the bias of the seed in local search heuristics) and successfully generates high-quality sequences (i.e. thermodynamically stable) for any GC-content. To complete this study, we develop a hybrid method combining our global sampling approach with local search strategies. Remarkably, our glocal methodology overcomes both local and global approaches for sampling sequences with a specific GC-content and target structure. AVAILABILITY: IncaRNAtion is available at csb.cs.mcgill.ca/incarnation/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , RNA/química , Composição de Bases , Sequência de Bases , Modelos Estatísticos , Conformação de Ácido Nucleico , Nucleotídeos/análise , Análise de Sequência de RNA , Software , Termodinâmica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA