Your browser doesn't support javascript.
loading
Chemistry-informed molecular graph as reaction descriptor for machine-learned retrosynthesis planning.
Zhang, Baicheng; Zhang, Xiaolong; Du, Wenjie; Song, Zhaokun; Zhang, Guozhen; Zhang, Guoqing; Wang, Yang; Chen, Xin; Jiang, Jun; Luo, Yi.
Afiliación
  • Zhang B; School of Chemistry and Materials Science, University of Science and Technology of China, Hefei, Anhui 230026, China.
  • Zhang X; School of Chemistry and Materials Science, University of Science and Technology of China, Hefei, Anhui 230026, China.
  • Du W; School of Software Engineering, University of Science and Technology of China, Hefei, Anhui 230026, China.
  • Song Z; Hefei JiShu Quantum Technology Co. Ltd., Hefei, Anhui 230026, China.
  • Zhang G; School of Chemistry and Materials Science, University of Science and Technology of China, Hefei, Anhui 230026, China.
  • Zhang G; School of Chemistry and Materials Science, University of Science and Technology of China, Hefei, Anhui 230026, China.
  • Wang Y; Hefei National Laboratory, University of Science and Technology of China, Hefei 230088, China.
  • Chen X; School of Software Engineering, University of Science and Technology of China, Hefei, Anhui 230026, China.
  • Jiang J; Gusu Laboratory of Materials, Suzhou, Jiangsu 215123, China.
  • Luo Y; School of Chemistry and Materials Science, University of Science and Technology of China, Hefei, Anhui 230026, China.
Proc Natl Acad Sci U S A ; 119(41): e2212711119, 2022 10 11.
Article en En | MEDLINE | ID: mdl-36191228
ABSTRACT
Infusing "chemical wisdom" should improve the data-driven approaches that rely exclusively on historical synthetic data for automatic retrosynthesis planning. For this purpose, we designed a chemistry-informed molecular graph (CIMG) to describe chemical reactions. A collection of key information that is most relevant to chemical reactions is integrated in CIMGNMR chemical shifts as vertex features, bond dissociation energies as edge features, and solvent/catalyst information as global features. For any given compound as a target, a product CIMG is generated and exploited by a graph neural network (GNN) model to choose reaction template(s) leading to this product. A reactant CIMG is then inferred and used in two GNN models to select appropriate catalyst and solvent, respectively. Finally, a fourth GNN model compares the two CIMG descriptors to check the plausibility of the proposed reaction. A reaction vector is obtained for every molecule in training these models. The chemical wisdom of reaction propensity contained in the pretrained reaction vectors is exploited to autocategorize molecules/reactions and to accelerate Monte Carlo tree search (MCTS) for multistep retrosynthesis planning. Full synthetic routes with recommended catalysts/solvents are predicted efficiently using this CIMG-based approach.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Redes Neurales de la Computación / Aprendizaje Automático Tipo de estudio: Health_economic_evaluation / Prognostic_studies Idioma: En Revista: Proc Natl Acad Sci U S A Año: 2022 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Redes Neurales de la Computación / Aprendizaje Automático Tipo de estudio: Health_economic_evaluation / Prognostic_studies Idioma: En Revista: Proc Natl Acad Sci U S A Año: 2022 Tipo del documento: Article País de afiliación: China