Your browser doesn't support javascript.
loading
A community-powered search of machine learning strategy space to find NMR property prediction models.
Bratholm, Lars A; Gerrard, Will; Anderson, Brandon; Bai, Shaojie; Choi, Sunghwan; Dang, Lam; Hanchar, Pavel; Howard, Addison; Kim, Sanghoon; Kolter, Zico; Kondor, Risi; Kornbluth, Mordechai; Lee, Youhan; Lee, Youngsoo; Mailoa, Jonathan P; Nguyen, Thanh Tu; Popovic, Milos; Rakocevic, Goran; Reade, Walter; Song, Wonho; Stojanovic, Luka; Thiede, Erik H; Tijanic, Nebojsa; Torrubia, Andres; Willmott, Devin; Butts, Craig P; Glowacki, David R.
Afiliação
  • Bratholm LA; School of Chemistry, University of Bristol, Bristol, United Kingdom.
  • Gerrard W; School of Mathematics, University of Bristol, Bristol, United Kingdom.
  • Anderson B; School of Chemistry, University of Bristol, Bristol, United Kingdom.
  • Bai S; Department of Computer Science, The University of Chicago, Chicago, IL, United States of America.
  • Choi S; Department of Statistics, The University of Chicago, Chicago, IL, United States of America.
  • Dang L; Atomwise, San Francisco, CA, United States of America.
  • Hanchar P; Bosch Center for Artificial Intelligence, Pittsburgh, PA, United States of America.
  • Howard A; Carnegie Mellon University, Pittsburgh, PA, United States of America.
  • Kim S; National Institute of Supercomputing and Network, Korea Institute of Science and Technology Information, Yuseong-gu, Daejeon, Republic of Korea.
  • Kolter Z; BNP Paribas Cardif, Nanterre Cedex, France.
  • Kondor R; Fyusion, Inc., San Francisco, CA, United States of America.
  • Kornbluth M; Kaggle, Google Inc., Mountain View, CA, United States of America.
  • Lee Y; Ebay Korea, Gangnam Gu, Seoul, Republic of Korea.
  • Lee Y; Bosch Center for Artificial Intelligence, Pittsburgh, PA, United States of America.
  • Mailoa JP; Carnegie Mellon University, Pittsburgh, PA, United States of America.
  • Nguyen TT; Department of Computer Science, The University of Chicago, Chicago, IL, United States of America.
  • Popovic M; Department of Statistics, The University of Chicago, Chicago, IL, United States of America.
  • Rakocevic G; Center for Computational Mathematics, Flatiron Institute, New York, NY, United States of America.
  • Reade W; Bosch Research and Technology Center, Cambridge, MA, United States of America.
  • Song W; Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology, Yuseong-gu, Daejeon, Republic of Korea.
  • Stojanovic L; MINDS AND COMPANY, Gangnam-gu, Seoul, Republic of Korea.
  • Thiede EH; Bosch Research and Technology Center, Cambridge, MA, United States of America.
  • Tijanic N; BNP Paribas Cardif, Nanterre Cedex, France.
  • Torrubia A; Totient Inc, Belgrade, Serbia.
  • Willmott D; Totient Inc, Belgrade, Serbia.
  • Butts CP; Kaggle, Google Inc., Mountain View, CA, United States of America.
  • Glowacki DR; KAIST Web Security & Privacy Lab, Yuseong-gu, Daejeon, Republic of Korea.
PLoS One ; 16(7): e0253612, 2021.
Article em En | MEDLINE | ID: mdl-34283864
ABSTRACT
The rise of machine learning (ML) has created an explosion in the potential strategies for using data to make scientific predictions. For physical scientists wishing to apply ML strategies to a particular domain, it can be difficult to assess in advance what strategy to adopt within a vast space of possibilities. Here we outline the results of an online community-powered effort to swarm search the space of ML strategies and develop algorithms for predicting atomic-pairwise nuclear magnetic resonance (NMR) properties in molecules. Using an open-source dataset, we worked with Kaggle to design and host a 3-month competition which received 47,800 ML model predictions from 2,700 teams in 84 countries. Within 3 weeks, the Kaggle community produced models with comparable accuracy to our best previously published 'in-house' efforts. A meta-ensemble model constructed as a linear combination of the top predictions has a prediction accuracy which exceeds that of any individual model, 7-19x better than our previous state-of-the-art. The results highlight the potential of transformer architectures for predicting quantum mechanical (QM) molecular properties.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Previsões / Ciência do Cidadão Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Previsões / Ciência do Cidadão Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article