Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
1.
Nat Methods ; 19(10): 1234-1242, 2022 10.
Artículo en Inglés | MEDLINE | ID: mdl-36192461

RESUMEN

Despite the popularity of computer-aided study and design of RNA molecules, little is known about the accuracy of commonly used structure modeling packages in tasks sensitive to ensemble properties of RNA. Here, we demonstrate that the EternaBench dataset, a set of more than 20,000 synthetic RNA constructs designed on the RNA design platform Eterna, provides incisive discriminative power in evaluating current packages in ensemble-oriented structure prediction tasks. We find that CONTRAfold and RNAsoft, packages with parameters derived through statistical learning, achieve consistently higher accuracy than more widely used packages in their standard settings, which derive parameters primarily from thermodynamic experiments. We hypothesized that training a multitask model with the varied data types in EternaBench might improve inference on ensemble-based prediction tasks. Indeed, the resulting model, named EternaFold, demonstrated improved performance that generalizes to diverse external datasets including complete messenger RNAs, viral genomes probed in human cells and synthetic designs modeling mRNA vaccines.


Asunto(s)
Algoritmos , ARN , Humanos , Conformación de Ácido Nucleico , Estructura Secundaria de Proteína , ARN/genética , Termodinámica
2.
Trends Biochem Sci ; 39(11): 507-9, 2014 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-25300714

RESUMEN

Hypothesis-driven experimentation - the scientific method - can be subverted by fraud, irreproducibility, and lack of rigorous predictive tests. A robust solution to these problems may be the 'massive open laboratory' model, recently embodied in the internet-scale videogame EteRNA. Deploying similar platforms throughout biology could enforce the scientific method more broadly.


Asunto(s)
Biología Computacional/métodos , Pliegue del ARN , ARN/química , Interfaz Usuario-Computador , Juegos de Video , Biología Molecular/instrumentación , Biología Molecular/métodos , ARN/genética
3.
Proc Natl Acad Sci U S A ; 111(6): 2122-7, 2014 Feb 11.
Artículo en Inglés | MEDLINE | ID: mdl-24469816

RESUMEN

Self-assembling RNA molecules present compelling substrates for the rational interrogation and control of living systems. However, imperfect in silico models--even at the secondary structure level--hinder the design of new RNAs that function properly when synthesized. Here, we present a unique and potentially general approach to such empirical problems: the Massive Open Laboratory. The EteRNA project connects 37,000 enthusiasts to RNA design puzzles through an online interface. Uniquely, EteRNA participants not only manipulate simulated molecules but also control a remote experimental pipeline for high-throughput RNA synthesis and structure mapping. We show herein that the EteRNA community leveraged dozens of cycles of continuous wet laboratory feedback to learn strategies for solving in vitro RNA design problems on which automated methods fail. The top strategies--including several previously unrecognized negative design rules--were distilled by machine learning into an algorithm, EteRNABot. Over a rigorous 1-y testing phase, both the EteRNA community and EteRNABot significantly outperformed prior algorithms in a dozen RNA secondary structure design tests, including the creation of dendrimer-like structures and scaffolds for small molecule sensors. These results show that an online community can carry out large-scale experiments, hypothesis generation, and algorithm design to create practical advances in empirical science.


Asunto(s)
Laboratorios/organización & administración , ARN/química , Algoritmos , Conformación de Ácido Nucleico , Programas Informáticos , Interfaz Usuario-Computador
4.
Nature ; 466(7307): 756-60, 2010 Aug 05.
Artículo en Inglés | MEDLINE | ID: mdl-20686574

RESUMEN

People exert large amounts of problem-solving effort playing computer games. Simple image- and text-recognition tasks have been successfully 'crowd-sourced' through games, but it is not clear if more complex scientific problems can be solved with human-directed computing. Protein structure prediction is one such problem: locating the biologically relevant native conformation of a protein is a formidable computational challenge given the very large size of the search space. Here we describe Foldit, a multiplayer online game that engages non-scientists in solving hard prediction problems. Foldit players interact with protein structures using direct manipulation tools and user-friendly versions of algorithms from the Rosetta structure prediction methodology, while they compete and collaborate to optimize the computed energy. We show that top-ranked Foldit players excel at solving challenging structure refinement problems in which substantial backbone rearrangements are necessary to achieve the burial of hydrophobic residues. Players working collaboratively develop a rich assortment of new strategies and algorithms; unlike computational approaches, they explore not only the conformational space but also the space of possible search strategies. The integration of human visual problem-solving and strategy development capabilities with traditional computational algorithms through interactive multiplayer games is a powerful new approach to solving computationally-limited scientific problems.


Asunto(s)
Biología Computacional/métodos , Juegos Experimentales , Procesos de Grupo , Internet , Solución de Problemas , Pliegue de Proteína , Proteínas/química , Algoritmos , Gráficos por Computador , Simulación por Computador , Conducta Cooperativa , Señales (Psicología) , Humanos , Enlace de Hidrógeno , Interacciones Hidrofóbicas e Hidrofílicas , Imagenología Tridimensional , Actividades Recreativas , Modelos Moleculares , Resonancia Magnética Nuclear Biomolecular , Estimulación Luminosa , Conformación Proteica , Proteínas/metabolismo , Procesos Estocásticos , Termodinámica
5.
J Mol Biol ; 428(5 Pt A): 748-757, 2016 Feb 27.
Artículo en Inglés | MEDLINE | ID: mdl-26902426

RESUMEN

Designing RNAs that form specific secondary structures is enabling better understanding and control of living systems through RNA-guided silencing, genome editing and protein organization. Little is known, however, about which RNA secondary structures might be tractable for downstream sequence design, increasing the time and expense of design efforts due to inefficient secondary structure choices. Here, we present insights into specific structural features that increase the difficulty of finding sequences that fold into a target RNA secondary structure, summarizing the design efforts of tens of thousands of human participants and three automated algorithms (RNAInverse, INFO-RNA and RNA-SSD) in the Eterna massive open laboratory. Subsequent tests through three independent RNA design algorithms (NUPACK, DSS-Opt and MODENA) confirmed the hypothesized importance of several features in determining design difficulty, including sequence length, mean stem length, symmetry and specific difficult-to-design motifs such as zigzags. Based on these results, we have compiled an Eterna100 benchmark of 100 secondary structure design challenges that span a large range in design difficulty to help test future efforts. Our in silico results suggest new routes for improving computational RNA design methods and for extending these insights to assess "designability" of single RNA structures, as well as of switches for in vitro and in vivo applications.


Asunto(s)
Conformación de Ácido Nucleico , ARN/química , Análisis de Secuencia de ARN/métodos , Algoritmos , Biología Computacional , Humanos , Modelos Moleculares , Programas Informáticos
6.
Methods Enzymol ; 487: 545-74, 2011.
Artículo en Inglés | MEDLINE | ID: mdl-21187238

RESUMEN

We have recently completed a full re-architecturing of the ROSETTA molecular modeling program, generalizing and expanding its existing functionality. The new architecture enables the rapid prototyping of novel protocols by providing easy-to-use interfaces to powerful tools for molecular modeling. The source code of this rearchitecturing has been released as ROSETTA3 and is freely available for academic use. At the time of its release, it contained 470,000 lines of code. Counting currently unpublished protocols at the time of this writing, the source includes 1,285,000 lines. Its rapid growth is a testament to its ease of use. This chapter describes the requirements for our new architecture, justifies the design decisions, sketches out central classes, and highlights a few of the common tasks that the new software can perform.


Asunto(s)
Simulación por Computador , Sustancias Macromoleculares/química , Modelos Moleculares , Programas Informáticos , ADN/química
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA