Transformer-based multitask learning for reaction prediction under low-resource circumstances.

Qiao, Haoran; Wu, Yejian; Zhang, Yun; Zhang, Chengyun; Wu, Xinyi; Wu, Zhipeng; Zhao, Qingjie; Wang, Xinqiao; Li, Huiyu; Duan, Hongliang

Qiao, Haoran; Wu, Yejian; Zhang, Yun; Zhang, Chengyun; Wu, Xinyi; Wu, Zhipeng; Zhao, Qingjie; Wang, Xinqiao; Li, Huiyu; Duan, Hongliang.

Afiliação

Qiao H; College of Mathematics and Physics, Shanghai University of Electric Power Shanghai 200090 China huiyuli@shiep.edu.cn.
Wu Y; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.
Zhang Y; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.
Zhang C; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.
Wu X; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.
Wu Z; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.
Zhao Q; Innovation Research Institute of Traditional Chinese Medicine, Shanghai University of Traditional Chinese Medicine Shanghai 201203 China.
Wang X; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.
Li H; College of Mathematics and Physics, Shanghai University of Electric Power Shanghai 200090 China huiyuli@shiep.edu.cn.
Duan H; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.

RSC Adv ; 12(49): 32020-32026, 2022 Nov 03.

Article em En | MEDLINE | ID: mdl-36380947

RESUMO

Recently, effective and rapid deep-learning methods for predicting chemical reactions have significantly aided the research and development of organic chemistry and drug discovery. Owing to the insufficiency of related chemical reaction data, computer-assisted predictions based on low-resource chemical datasets generally have low accuracy despite the exceptional ability of deep learning in retrosynthesis and synthesis. To address this issue, we introduce two types of multitask models: retro-forward reaction prediction transformer (RFRPT) and multiforward reaction prediction transformer (MFRPT). These models integrate multitask learning with the transformer model to predict low-resource reactions in forward reaction prediction and retrosynthesis. Our results demonstrate that introducing multitask learning significantly improves the average top-1 accuracy, and the RFRPT (76.9%) and MFRPT (79.8%) outperform the transformer baseline model (69.9%). These results also demonstrate that a multitask framework can capture sufficient chemical knowledge and effectively mitigate the impact of the deficiency of low-resource data in processing reaction prediction tasks. Both RFRPT and MFRPT methods significantly improve the predictive performance of transformer models, which are powerful methods for eliminating the restriction of limited training data.

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google