Your browser doesn't support javascript.
loading
ALipSol: An Attention-Driven Mixture-of-Experts Model for Lipophilicity and Solubility Prediction.
Wu, Jialu; Wang, Junmei; Wu, Zhenxing; Zhang, Shengyu; Deng, Yafeng; Kang, Yu; Cao, Dongsheng; Hsieh, Chang-Yu; Hou, Tingjun.
Afiliação
  • Wu J; Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058Zhejiang, P. R. China.
  • Wang J; CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018Zhejiang, P. R. China.
  • Wu Z; Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, Pennsylvania15261, United States.
  • Zhang S; Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058Zhejiang, P. R. China.
  • Deng Y; CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018Zhejiang, P. R. China.
  • Kang Y; Tencent Quantum Laboratory, Tencent, Shenzhen, 518057Guangdong, P. R. China.
  • Cao D; CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018Zhejiang, P. R. China.
  • Hsieh CY; Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058Zhejiang, P. R. China.
  • Hou T; Xiangya School of Pharmaceutical Sciences, Central South University, Changsha, 410004Hunan, P. R. China.
J Chem Inf Model ; 62(23): 5975-5987, 2022 Dec 12.
Article em En | MEDLINE | ID: mdl-36417544
ABSTRACT
Lipophilicity (logD) and aqueous solubility (logSw) play a central role in drug development. The accurate prediction of these properties remains to be solved due to data scarcity. Current methodologies neglect the intrinsic relationships between physicochemical properties and usually ignore the ionization effects. Here, we propose an attention-driven mixture-of-experts (MoE) model named ALipSol, which explicitly reproduces the hierarchy of task relationships. We adopt the principle of divide-and-conquer by breaking down the complex end point (logD or logSw) into simpler ones (acidic pKa, basic pKa, and logP) and allocating a specific expert network for each subproblem. Subsequently, we implement transfer learning to extract knowledge from related tasks, thus alleviating the dilemma of limited data. Additionally, we substitute the gating network with an attention mechanism to better capture the dynamic task relationships on a per-example basis. We adopt local fine-tuning and consensus prediction to further boost model performance. Extensive evaluation experiments verify the success of the ALipSol model, which achieves RMSE improvement of 8.04%, 2.49%, 8.57%, 12.8%, and 8.60% on the Lipop, ESOL, AqSolDB, external logD, and external logS data sets, respectively, compared with Attentive FP and the state-of-the-art in silico tools. In particular, our model yields more significant advantages (Welch's t-test) for small training data, implying its high robustness and generalizability. The interpretability analysis proves that the atom contributions learned by ALipSol are more reasonable compared with the vanilla Attentive FP, and the substitution effects in benzene derivatives agreed well with empirical constants, revealing the potential of our model to extract useful patterns from data and provide guidance for lead optimization.
Assuntos

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Água Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: J Chem Inf Model Assunto da revista: INFORMATICA MEDICA / QUIMICA Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Água Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: J Chem Inf Model Assunto da revista: INFORMATICA MEDICA / QUIMICA Ano de publicação: 2022 Tipo de documento: Article