Your browser doesn't support javascript.
loading
Structure-Aware Multimodal Deep Learning for Drug-Protein Interaction Prediction.
Wang, Penglei; Zheng, Shuangjia; Jiang, Yize; Li, Chengtao; Liu, Junhong; Wen, Chang; Patronov, Atanas; Qian, Dahong; Chen, Hongming; Yang, Yuedong.
Affiliation
  • Wang P; School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China.
  • Zheng S; School of Data and Computer Science, Sun Yat-Sen Universit, Guangzhou 510275, China.
  • Jiang Y; Galixir, Beijing 100080, China.
  • Li C; Galixir, Beijing 100080, China.
  • Liu J; Galixir, Beijing 100080, China.
  • Wen C; Guangzhou Laboratory, Guangzhou 510000, China.
  • Patronov A; MolecularAI, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Gothenburg 405 30, Sweden.
  • Qian D; School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China.
  • Chen H; Guangzhou Laboratory, Guangzhou 510000, China.
  • Yang Y; School of Data and Computer Science, Sun Yat-Sen Universit, Guangzhou 510275, China.
J Chem Inf Model ; 62(5): 1308-1317, 2022 03 14.
Article in En | MEDLINE | ID: mdl-35200015
ABSTRACT
Identifying drug-protein interactions (DPIs) is crucial in drug discovery, and a number of machine learning methods have been developed to predict DPIs. Existing methods usually use unrealistic data sets with hidden bias, which will limit the accuracy of virtual screening methods. Meanwhile, most DPI prediction methods pay more attention to molecular representation but lack effective research on protein representation and high-level associations between different instances. To this end, we present the novel structure-aware multimodal deep DPI prediction model, STAMP-DPI, which was trained on a curated industry-scale benchmark data set. We built a high-quality benchmark data set named GalaxyDB for DPI prediction. This industry-scale data set along with an unbiased training procedure resulted in a more robust benchmark study. For informative protein representation, we constructed a structure-aware graph neural network method from the protein sequence by combining predicted contact maps and graph neural networks. Through further integration of structure-based representation and high-level pretrained embeddings for molecules and proteins, our model effectively captures the feature representation of the interactions between them. As a result, STAMP-DPI outperformed state-of-the-art DPI prediction methods by decreasing 7.00% mean square error (MSE) in the Davis data set and improving 8.89% area under the curve (AUC) in the GalaxyDB data set. Moreover, our model is an interpretable model with the transformer-based interaction mechanism, which can accurately reveal the binding sites between molecules and proteins.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Deep Learning Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: J Chem Inf Model Journal subject: INFORMATICA MEDICA / QUIMICA Year: 2022 Document type: Article Affiliation country: China Publication country: EEUU / ESTADOS UNIDOS / ESTADOS UNIDOS DA AMERICA / EUA / UNITED STATES / UNITED STATES OF AMERICA / US / USA

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Deep Learning Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: J Chem Inf Model Journal subject: INFORMATICA MEDICA / QUIMICA Year: 2022 Document type: Article Affiliation country: China Publication country: EEUU / ESTADOS UNIDOS / ESTADOS UNIDOS DA AMERICA / EUA / UNITED STATES / UNITED STATES OF AMERICA / US / USA