Your browser doesn't support javascript.
loading
Message-passing neural networks for high-throughput polymer screening.
St John, Peter C; Phillips, Caleb; Kemper, Travis W; Wilson, A Nolan; Guan, Yanfei; Crowley, Michael F; Nimlos, Mark R; Larsen, Ross E.
Afiliação
  • St John PC; Biosciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA.
  • Phillips C; Computational Science Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA.
  • Kemper TW; Computational Science Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA.
  • Wilson AN; National Biaoenergy Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA.
  • Guan Y; Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523-1872, USA.
  • Crowley MF; Biosciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA.
  • Nimlos MR; National Biaoenergy Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA.
  • Larsen RE; Computational Science Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA.
J Chem Phys ; 150(23): 234111, 2019 Jun 21.
Article em En | MEDLINE | ID: mdl-31228909
ABSTRACT
Machine learning methods have shown promise in predicting molecular properties, and given sufficient training data, machine learning approaches can enable rapid high-throughput virtual screening of large libraries of compounds. Graph-based neural network architectures have emerged in recent years as the most successful approach for predictions based on molecular structure and have consistently achieved the best performance on benchmark quantum chemical datasets. However, these models have typically required optimized 3D structural information for the molecule to achieve the highest accuracy. These 3D geometries are costly to compute for high levels of theory, limiting the applicability and practicality of machine learning methods in high-throughput screening applications. In this study, we present a new database of candidate molecules for organic photovoltaic applications, comprising approximately 91 000 unique chemical structures. Compared to existing datasets, this dataset contains substantially larger molecules (up to 200 atoms) as well as extrapolated properties for long polymer chains. We show that message-passing neural networks trained with and without 3D structural information for these molecules achieve similar accuracy, comparable to state-of-the-art methods on existing benchmark datasets. These results therefore emphasize that for larger molecules with practical applications, near-optimal prediction results can be obtained without using optimized 3D geometry as an input. We further show that learned molecular representations can be leveraged to reduce the training data required to transfer predictions to a new density functional theory functional.

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies / Prognostic_studies / Screening_studies Idioma: En Revista: J Chem Phys Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies / Prognostic_studies / Screening_studies Idioma: En Revista: J Chem Phys Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Estados Unidos