Your browser doesn't support javascript.
loading
Size-independent neural networks based first-principles method for accurate prediction of heat of formation of fuels.
Yang, GuanYa; Wu, Jiang; Chen, ShuGuang; Zhou, WeiJun; Sun, Jian; Chen, GuanHua.
Afiliação
  • Yang G; Department of Chemistry, The University of Hong Kong, Pokfulam Road, Hong Kong, China.
  • Wu J; Department of Chemistry, The University of Hong Kong, Pokfulam Road, Hong Kong, China.
  • Chen S; Department of Chemistry, The University of Hong Kong, Pokfulam Road, Hong Kong, China.
  • Zhou W; Department of Chemistry, The University of Hong Kong, Pokfulam Road, Hong Kong, China.
  • Sun J; Department of Chemistry, The University of Hong Kong, Pokfulam Road, Hong Kong, China.
  • Chen G; Department of Chemistry, The University of Hong Kong, Pokfulam Road, Hong Kong, China.
J Chem Phys ; 148(24): 241738, 2018 Jun 28.
Article em En | MEDLINE | ID: mdl-29960359
Neural network-based first-principles method for predicting heat of formation (HOF) was previously demonstrated to be able to achieve chemical accuracy in a broad spectrum of target molecules [L. H. Hu et al., J. Chem. Phys. 119, 11501 (2003)]. However, its accuracy deteriorates with the increase in molecular size. A closer inspection reveals a systematic correlation between the prediction error and the molecular size, which appears correctable by further statistical analysis, calling for a more sophisticated machine learning algorithm. Despite the apparent difference between simple and complex molecules, all the essential physical information is already present in a carefully selected set of small molecule representatives. A model that can capture the fundamental physics would be able to predict large and complex molecules from information extracted only from a small molecules database. To this end, a size-independent, multi-step multi-variable linear regression-neural network-B3LYP method is developed in this work, which successfully improves the overall prediction accuracy by training with smaller molecules only. And in particular, the calculation errors for larger molecules are drastically reduced to the same magnitudes as those of the smaller molecules. Specifically, the method is based on a 164-molecule database that consists of molecules made of hydrogen and carbon elements. 4 molecular descriptors were selected to encode molecule's characteristics, among which raw HOF calculated from B3LYP and the molecular size are also included. Upon the size-independent machine learning correction, the mean absolute deviation (MAD) of the B3LYP/6-311+G(3df,2p)-calculated HOF is reduced from 16.58 to 1.43 kcal/mol and from 17.33 to 1.69 kcal/mol for the training and testing sets (small molecules), respectively. Furthermore, the MAD of the testing set (large molecules) is reduced from 28.75 to 1.67 kcal/mol.

Texto completo: 1 Bases de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: J Chem Phys Ano de publicação: 2018 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Bases de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: J Chem Phys Ano de publicação: 2018 Tipo de documento: Article País de afiliação: China