Your browser doesn't support javascript.
loading
Breaking the barriers of data scarcity in drug-target affinity prediction.
Pei, Qizhi; Wu, Lijun; Zhu, Jinhua; Xia, Yingce; Xie, Shufang; Qin, Tao; Liu, Haiguang; Liu, Tie-Yan; Yan, Rui.
Afiliação
  • Pei Q; Gaoling School of Artificial Intelligence, Renmin University of China, No.59, Zhong Guan Cun Avenue, Haidian District, 100872, Beijing, China.
  • Wu L; Microsoft Research AI4Science, No.5, Dan Ling Street, Haidian District, 100080, Beijing, China.
  • Zhu J; CAS Key Laboratory of GIPAS, EEIS Department, University of Science and Technology of China, No.96, JinZhai Road, Baohe District, 230026, Hefei, Anhui Province, China.
  • Xia Y; Microsoft Research AI4Science, No.5, Dan Ling Street, Haidian District, 100080, Beijing, China.
  • Xie S; Gaoling School of Artificial Intelligence, Renmin University of China, No.59, Zhong Guan Cun Avenue, Haidian District, 100872, Beijing, China.
  • Qin T; Engineering Research Center of Next-Generation Intelligent Search and Recommendation, Ministry of Education.
  • Liu H; Microsoft Research AI4Science, No.5, Dan Ling Street, Haidian District, 100080, Beijing, China.
  • Liu TY; Microsoft Research AI4Science, No.5, Dan Ling Street, Haidian District, 100080, Beijing, China.
  • Yan R; Beijing Key Laboratory of Big Data Management and Analysis Methods.
Brief Bioinform ; 24(6)2023 09 22.
Article em En | MEDLINE | ID: mdl-37903413
ABSTRACT
Accurate prediction of drug-target affinity (DTA) is of vital importance in early-stage drug discovery, facilitating the identification of drugs that can effectively interact with specific targets and regulate their activities. While wet experiments remain the most reliable method, they are time-consuming and resource-intensive, resulting in limited data availability that poses challenges for deep learning approaches. Existing methods have primarily focused on developing techniques based on the available DTA data, without adequately addressing the data scarcity issue. To overcome this challenge, we present the Semi-Supervised Multi-task training (SSM) framework for DTA prediction, which incorporates three simple yet highly effective strategies (1) A multi-task training approach that combines DTA prediction with masked language modeling using paired drug-target data. (2) A semi-supervised training method that leverages large-scale unpaired molecules and proteins to enhance drug and target representations. This approach differs from previous methods that only employed molecules or proteins in pre-training. (3) The integration of a lightweight cross-attention module to improve the interaction between drugs and targets, further enhancing prediction accuracy. Through extensive experiments on benchmark datasets such as BindingDB, DAVIS and KIBA, we demonstrate the superior performance of our framework. Additionally, we conduct case studies on specific drug-target binding activities, virtual screening experiments, drug feature visualizations and real-world applications, all of which showcase the significant potential of our work. In conclusion, our proposed SSM-DTA framework addresses the data limitation challenge in DTA prediction and yields promising results, paving the way for more efficient and accurate drug discovery processes.
Assuntos
Palavras-chave

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Benchmarking / Descoberta de Drogas Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Benchmarking / Descoberta de Drogas Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: China