Your browser doesn't support javascript.
loading
Ensemble learning of coarse-grained molecular dynamics force fields with a kernel approach.
Wang, Jiang; Chmiela, Stefan; Müller, Klaus-Robert; Noé, Frank; Clementi, Cecilia.
Afiliação
  • Wang J; Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA.
  • Chmiela S; Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany.
  • Müller KR; Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany.
  • Noé F; Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA.
  • Clementi C; Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA.
J Chem Phys ; 152(19): 194106, 2020 May 21.
Article em En | MEDLINE | ID: mdl-33687259
ABSTRACT
Gradient-domain machine learning (GDML) is an accurate and efficient approach to learn a molecular potential and associated force field based on the kernel ridge regression algorithm. Here, we demonstrate its application to learn an effective coarse-grained (CG) model from all-atom simulation data in a sample efficient manner. The CG force field is learned by following the thermodynamic consistency principle, here by minimizing the error between the predicted CG force and the all-atom mean force in the CG coordinates. Solving this problem by GDML directly is impossible because coarse-graining requires averaging over many training data points, resulting in impractical memory requirements for storing the kernel matrices. In this work, we propose a data-efficient and memory-saving alternative. Using ensemble learning and stratified sampling, we propose a 2-layer training scheme that enables GDML to learn an effective CG model. We illustrate our method on a simple biomolecular system, alanine dipeptide, by reconstructing the free energy landscape of a CG variant of this molecule. Our novel GDML training scheme yields a smaller free energy error than neural networks when the training set is small, and a comparably high accuracy when the training set is sufficiently large.

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Revista: J Chem Phys Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Revista: J Chem Phys Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos