Transformer-based multitask learning for reaction prediction under low-resource circumstances.

Qiao, Haoran; Wu, Yejian; Zhang, Yun; Zhang, Chengyun; Wu, Xinyi; Wu, Zhipeng; Zhao, Qingjie; Wang, Xinqiao; Li, Huiyu; Duan, Hongliang

Qiao, Haoran; Wu, Yejian; Zhang, Yun; Zhang, Chengyun; Wu, Xinyi; Wu, Zhipeng; Zhao, Qingjie; Wang, Xinqiao; Li, Huiyu; Duan, Hongliang.

Affiliation

Qiao H; College of Mathematics and Physics, Shanghai University of Electric Power Shanghai 200090 China huiyuli@shiep.edu.cn.
Wu Y; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.
Zhang Y; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.
Zhang C; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.
Wu X; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.
Wu Z; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.
Zhao Q; Innovation Research Institute of Traditional Chinese Medicine, Shanghai University of Traditional Chinese Medicine Shanghai 201203 China.
Wang X; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.
Li H; College of Mathematics and Physics, Shanghai University of Electric Power Shanghai 200090 China huiyuli@shiep.edu.cn.
Duan H; Artificial Intelligence Aided Drug Discovery Institute, College of Pharmaceutical Sciences, Zhejiang University of Technology Hangzhou 310014 China hduan@zjut.edu.cn.

RSC Adv ; 12(49): 32020-32026, 2022 Nov 03.

Article in En | MEDLINE | ID: mdl-36380947

ABSTRACT

ABSTRACT

Recently, effective and rapid deep-learning methods for predicting chemical reactions have significantly aided the research and development of organic chemistry and drug discovery. Owing to the insufficiency of related chemical reaction data, computer-assisted predictions based on low-resource chemical datasets generally have low accuracy despite the exceptional ability of deep learning in retrosynthesis and synthesis. To address this issue, we introduce two types of multitask models retro-forward reaction prediction transformer (RFRPT) and multiforward reaction prediction transformer (MFRPT). These models integrate multitask learning with the transformer model to predict low-resource reactions in forward reaction prediction and retrosynthesis. Our results demonstrate that introducing multitask learning significantly improves the average top-1 accuracy, and the RFRPT (76.9%) and MFRPT (79.8%) outperform the transformer baseline model (69.9%). These results also demonstrate that a multitask framework can capture sufficient chemical knowledge and effectively mitigate the impact of the deficiency of low-resource data in processing reaction prediction tasks. Both RFRPT and MFRPT methods significantly improve the predictive performance of transformer models, which are powerful methods for eliminating the restriction of limited training data.

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: RSC Adv Year: 2022 Document type: Article

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google