Your browser doesn't support javascript.
loading
Ribonanza: deep learning of RNA structure through dual crowdsourcing.
He, Shujun; Huang, Rui; Townley, Jill; Kretsch, Rachael C; Karagianes, Thomas G; Cox, David B T; Blair, Hamish; Penzar, Dmitry; Vyaltsev, Valeriy; Aristova, Elizaveta; Zinkevich, Arsenii; Bakulin, Artemy; Sohn, Hoyeol; Krstevski, Daniel; Fukui, Takaaki; Tatematsu, Fumiya; Uchida, Yusuke; Jang, Donghoon; Lee, Jun Seong; Shieh, Roger; Ma, Tom; Martynov, Eduard; Shugaev, Maxim V; Bukhari, Habib S T; Fujikawa, Kazuki; Onodera, Kazuki; Henkel, Christof; Ron, Shlomo; Romano, Jonathan; Nicol, John J; Nye, Grace P; Wu, Yuan; Choe, Christian; Reade, Walter; Das, Rhiju.
Afiliação
  • He S; Department of Chemical Engineering, Texas A&M University, TX, USA.
  • Huang R; Department of Biochemistry, Stanford CA, USA.
  • Townley J; Eterna Massive Open Laboratory.
  • Kretsch RC; Biophysics Program, Stanford CA, USA.
  • Karagianes TG; Department of Biochemistry, Stanford CA, USA.
  • Cox DBT; Department of Biochemistry, Stanford CA, USA.
  • Blair H; Department of Medicine, Division of Hematology, and Department of Biochemistry, Stanford CA, USA.
  • Penzar D; Department of Mathematics, Stanford CA, USA.
  • Vyaltsev V; AIRI, Moscow, Russia.
  • Aristova E; Vavilov Institute of General Genetics, Moscow 119991, Russia.
  • Zinkevich A; Institute of Translational Medicine, Pirogov Russian National Research Medical University, Moscow 117997, Russia.
  • Bakulin A; Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Russian Federation.
  • Sohn H; Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Russian Federation.
  • Krstevski D; Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Russian Federation.
  • Fukui T; Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Russian Federation.
  • Tatematsu F; Department of Chemical Engineering, Texas A&M University, TX, USA.
  • Uchida Y; Department of Biochemistry, Stanford CA, USA.
  • Jang D; Eterna Massive Open Laboratory.
  • Lee JS; Biophysics Program, Stanford CA, USA.
  • Shieh R; Department of Medicine, Division of Hematology, and Department of Biochemistry, Stanford CA, USA.
  • Ma T; Department of Mathematics, Stanford CA, USA.
  • Martynov E; AIRI, Moscow, Russia.
  • Shugaev MV; Vavilov Institute of General Genetics, Moscow 119991, Russia.
  • Bukhari HST; Institute of Translational Medicine, Pirogov Russian National Research Medical University, Moscow 117997, Russia.
  • Fujikawa K; Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Russian Federation.
  • Onodera K; GO Inc., Tokyo, Japan.
  • Henkel C; Department of Electrical and Computer Engineering, Inha University, Incheon, Republic of Korea.
  • Ron S; DeltaX, Seoul, Republic of Korea.
  • Romano J; Faculty of Computational Mathematics and Cybernetics, Lomonosov Moscow State University, Russian Federation.
  • Nicol JJ; Department of Materials Science and Engineering, University of Virginia, Charlottesville, VA 22904-4745, USA.
  • Nye GP; Vergesense, CA.
  • Wu Y; DeNA, Tokyo, Japan.
  • Choe C; NVIDIA, Tokyo, Japan.
  • Reade W; NVIDIA, Munich.
  • Das R; Department of Bioengineering, Stanford CA, USA.
bioRxiv ; 2024 Jun 11.
Article em En | MEDLINE | ID: mdl-38464325
ABSTRACT
Prediction of RNA structure from sequence remains an unsolved problem, and progress has been slowed by a paucity of experimental data. Here, we present Ribonanza, a dataset of chemical mapping measurements on two million diverse RNA sequences collected through Eterna and other crowdsourced initiatives. Ribonanza measurements enabled solicitation, training, and prospective evaluation of diverse deep neural networks through a Kaggle challenge, followed by distillation into a single, self-contained model called RibonanzaNet. When fine tuned on auxiliary datasets, RibonanzaNet achieves state-of-the-art performance in modeling experimental sequence dropout, RNA hydrolytic degradation, and RNA secondary structure, with implications for modeling RNA tertiary structure.

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article