Your browser doesn't support javascript.
loading
What Does the Machine Learn? Knowledge Representations of Chemical Reactivity.
Kammeraad, Joshua A; Goetz, Jack; Walker, Eric A; Tewari, Ambuj; Zimmerman, Paul M.
Affiliation
  • Kammeraad JA; Department of Chemistry, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109, United States.
  • Goetz J; Department of Statistics, University of Michigan, 1085 South University Avenue, Ann Arbor, Michigan 48109, United States.
  • Walker EA; Department of Chemistry, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109, United States.
  • Tewari A; Department of Statistics, University of Michigan, 1085 South University Avenue, Ann Arbor, Michigan 48109, United States.
  • Zimmerman PM; Department of Chemistry, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109, United States.
J Chem Inf Model ; 60(3): 1290-1301, 2020 03 23.
Article in En | MEDLINE | ID: mdl-32091880
In a departure from conventional chemical approaches, data-driven models of chemical reactions have recently been shown to be statistically successful using machine learning. These models, however, are largely black box in character and have not provided the kind of chemical insights that historically advanced the field of chemistry. To examine the knowledgebase of machine-learning models-what does the machine learn-this article deconstructs black-box machine-learning models of a diverse chemical reaction data set. Through experimentation with chemical representations and modeling techniques, the analysis provides insights into the nature of how statistical accuracy can arise, even when the model lacks informative physical principles. By peeling back the layers of these complicated models we arrive at a minimal, chemically intuitive model (and no machine learning involved). This model is based on systematic reaction-type classification and Evans-Polanyi relationships within reaction types which are easily visualized and interpreted. Through exploring this simple model, we gain deeper understanding of the data set and uncover a means for expert interactions to improve the model's reliability.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Machine Learning Type of study: Prognostic_studies Language: En Journal: J Chem Inf Model Journal subject: INFORMATICA MEDICA / QUIMICA Year: 2020 Document type: Article Affiliation country: United States Country of publication: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Machine Learning Type of study: Prognostic_studies Language: En Journal: J Chem Inf Model Journal subject: INFORMATICA MEDICA / QUIMICA Year: 2020 Document type: Article Affiliation country: United States Country of publication: United States