ABSTRACT
We present a Gaussian process regression (GPR) scheme with an adaptive regularization scheme applied to the QM7 and QM9 test set, several protonated water clusters and specifically to the problem of atomic hydrogen adsorption on graphene sheets. For the last system our goal is to achieve good predictive accuracy with only a few training points. Therefore, we assess for these systems a self-correcting multilayer GPR model, in which the prediction is corrected by a chain of additional GPR models. In our adaptive regularization scheme, we impose no noise on the training data, but use an approach based on the data itself to account for its impurity. The strength of this strategy is that the data points are treated differently based on their importance and that the regularization can still be controlled by a single parameter. We assess how the accuracy of the prediction depends on this parameter. We can show that the new regularization scheme as well as the multilayer approach results in more robust predictors. Furthermore, we demonstrate that the predictor can be in good agreement with the density-functional theory results.
ABSTRACT
We present an automatized workflow which, starting from molecular dynamics simulations, identifies reaction events, filters them, and prepares them for accurate quantum chemical calculations using, for example, Density Functional Theory (DFT) or Coupled Cluster methods. The capabilities of the automatized workflow are demonstrated by the example of simulations for the combustion of some polycyclic aromatic hydrocarbons (PAHs). It is shown how key elementary reaction candidates are filtered out of a much larger set of redundant reactions and refined further. The molecular species in question are optimized using DFT and reaction energies, barrier heights, and reaction rates are calculated. The setup is general enough to include at this stage configurational sampling, which can be exploited in the future. Using the introduced machinery, we investigate how the observed reaction types depend on the gas atmosphere used in the molecular dynamics simulation. For the re-optimization on the DFT level, we show how the additional information needed to switch from reactive force-field to electronic structure calculations can be filled in and study how well ReaxFF and DFT agree with each other and shine light on the perspective of using more accurate semi-empirical methods in the MD simulation.