Pesquisa | BVS Doenças Infecciosas e Parasitárias

1.

Gram matrix: an efficient representation of molecular conformation and learning objective for molecular pretraining.

Xiang, Wenkai; Zhong, Feisheng; Ni, Lin; Zheng, Mingyue; Li, Xutong; Shi, Qian; Wang, Dingyan.

Brief Bioinform ; 25(4)2024 May 23.

Artigo em Inglês | MEDLINE | ID: mdl-38990515

RESUMO

Accurate prediction of molecular properties is fundamental in drug discovery and development, providing crucial guidance for effective drug design. A critical factor in achieving accurate molecular property prediction lies in the appropriate representation of molecular structures. Presently, prevalent deep learning-based molecular representations rely on 2D structure information as the primary molecular representation, often overlooking essential three-dimensional (3D) conformational information due to the inherent limitations of 2D structures in conveying atomic spatial relationships. In this study, we propose employing the Gram matrix as a condensed representation of 3D molecular structures and for efficient pretraining objectives. Subsequently, we leverage this matrix to construct a novel molecular representation model, Pre-GTM, which inherently encapsulates 3D information. The model accurately predicts the 3D structure of a molecule by estimating the Gram matrix. Our findings demonstrate that Pre-GTM model outperforms the baseline Graphormer model and other pretrained models in the QM9 and MoleculeNet quantitative property prediction task. The integration of the Gram matrix as a condensed representation of 3D molecular structure, incorporated into the Pre-GTM model, opens up promising avenues for its potential application across various domains of molecular research, including drug design, materials science, and chemical engineering.

Assuntos

Conformação Molecular , Modelos Moleculares , Desenho de Fármacos , Aprendizado Profundo , Descoberta de Drogas , Algoritmos

2.

KinomeMETA: meta-learning enhanced kinome-wide polypharmacology profiling.

Ren, Qun; Qu, Ning; Sun, Jingjing; Zhou, Jingyi; Liu, Jin; Ni, Lin; Tong, Xiaochu; Zhang, Zimei; Kong, Xiangtai; Wen, Yiming; Wang, Yitian; Wang, Dingyan; Luo, Xiaomin; Zhang, Sulin; Zheng, Mingyue; Li, Xutong.

Brief Bioinform ; 25(1)2023 11 22.

Artigo em Inglês | MEDLINE | ID: mdl-38113075

RESUMO

Kinase inhibitors are crucial in cancer treatment, but drug resistance and side effects hinder the development of effective drugs. To address these challenges, it is essential to analyze the polypharmacology of kinase inhibitor and identify compound with high selectivity profile. This study presents KinomeMETA, a framework for profiling the activity of small molecule kinase inhibitors across a panel of 661 kinases. By training a meta-learner based on a graph neural network and fine-tuning it to create kinase-specific learners, KinomeMETA outperforms benchmark multi-task models and other kinase profiling models. It provides higher accuracy for understudied kinases with limited known data and broader coverage of kinase types, including important mutant kinases. Case studies on the discovery of new scaffold inhibitors for membrane-associated tyrosine- and threonine-specific cdc2-inhibitory kinase and selective inhibitors for fibroblast growth factor receptors demonstrate the role of KinomeMETA in virtual screening and kinome-wide activity profiling. Overall, KinomeMETA has the potential to accelerate kinase drug discovery by more effectively exploring the kinase polypharmacology landscape.

Assuntos

Antineoplásicos , Polifarmacologia , Proteínas Serina-Treonina Quinases , Descoberta de Drogas

3.

An inductive graph neural network model for compound-protein interaction prediction based on a homogeneous graph.

Wan, Xiaozhe; Wu, Xiaolong; Wang, Dingyan; Tan, Xiaoqin; Liu, Xiaohong; Fu, Zunyun; Jiang, Hualiang; Zheng, Mingyue; Li, Xutong.

Brief Bioinform ; 23(3)2022 05 13.

Artigo em Inglês | MEDLINE | ID: mdl-35275993

RESUMO

Identifying the potential compound-protein interactions (CPIs) plays an essential role in drug development. The computational approaches for CPI prediction can reduce time and costs of experimental methods and have benefited from the continuously improved graph representation learning. However, most of the network-based methods use heterogeneous graphs, which is challenging due to their complex structures and heterogeneous attributes. Therefore, in this work, we transformed the compound-protein heterogeneous graph to a homogeneous graph by integrating the ligand-based protein representations and overall similarity associations. We then proposed an Inductive Graph AggrEgator-based framework, named CPI-IGAE, for CPI prediction. CPI-IGAE learns the low-dimensional representations of compounds and proteins from the homogeneous graph in an end-to-end manner. The results show that CPI-IGAE performs better than some state-of-the-art methods. Further ablation study and visualization of embeddings reveal the advantages of the model architecture and its role in feature extraction, and some of the top ranked CPIs by CPI-IGAE have been validated by a review of recent literature. The data and source codes are available at https://github.com/wanxiaozhe/CPI-IGAE.

Assuntos

Desenvolvimento de Medicamentos , Redes Neurais de Computação , Mapas de Interação de Proteínas , Proteínas , Mapeamento de Interação de Proteínas , Proteínas/química , Software

4.

Drug repurposing against breast cancer by integrating drug-exposure expression profiles and drug-drug links based on graph neural network.

Cui, Chen; Ding, Xiaoyu; Wang, Dingyan; Chen, Lifan; Xiao, Fu; Xu, Tingyang; Zheng, Mingyue; Luo, Xiaomin; Jiang, Hualiang; Chen, Kaixian.

Bioinformatics ; 37(18): 2930-2937, 2021 09 29.

Artigo em Inglês | MEDLINE | ID: mdl-33739367

RESUMO

MOTIVATION: Breast cancer is one of the leading causes of cancer deaths among women worldwide. It is necessary to develop new breast cancer drugs because of the shortcomings of existing therapies. The traditional discovery process is time-consuming and expensive. Repositioning of clinically approved drugs has emerged as a novel approach for breast cancer therapy. However, serendipitous or experiential repurposing cannot be used as a routine method. RESULTS: In this study, we proposed a graph neural network model GraphRepur based on GraphSAGE for drug repurposing against breast cancer. GraphRepur integrated two major classes of computational methods, drug network-based and drug signature-based. The differentially expressed genes of disease, drug-exposure gene expression data and the drug-drug links information were collected. By extracting the drug signatures and topological structure information contained in the drug relationships, GraphRepur can predict new drugs for breast cancer, outperforming previous state-of-the-art approaches and some classic machine learning methods. The high-ranked drugs have indeed been reported as new uses for breast cancer treatment recently. AVAILABILITYAND IMPLEMENTATION: The source code of our model and datasets are available at: https://github.com/cckamy/GraphRepur and https://figshare.com/articles/software/GraphRepur_Breast_Cancer_Drug_Repurposing/14220050. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Neoplasias da Mama , Feminino , Humanos , Neoplasias da Mama/tratamento farmacológico , Neoplasias da Mama/genética , Reposicionamento de Medicamentos/métodos , Software , Redes Neurais de Computação , Aprendizado de Máquina

5.

TransformerCPI: improving compound-protein interaction prediction by sequence-based deep learning with self-attention mechanism and label reversal experiments.

Chen, Lifan; Tan, Xiaoqin; Wang, Dingyan; Zhong, Feisheng; Liu, Xiaohong; Yang, Tianbiao; Luo, Xiaomin; Chen, Kaixian; Jiang, Hualiang; Zheng, Mingyue.

Bioinformatics ; 36(16): 4406-4414, 2020 08 15.

Artigo em Inglês | MEDLINE | ID: mdl-32428219

RESUMO

MOTIVATION: Identifying compound-protein interaction (CPI) is a crucial task in drug discovery and chemogenomics studies, and proteins without three-dimensional structure account for a large part of potential biological targets, which requires developing methods using only protein sequence information to predict CPI. However, sequence-based CPI models may face some specific pitfalls, including using inappropriate datasets, hidden ligand bias and splitting datasets inappropriately, resulting in overestimation of their prediction performance. RESULTS: To address these issues, we here constructed new datasets specific for CPI prediction, proposed a novel transformer neural network named TransformerCPI, and introduced a more rigorous label reversal experiment to test whether a model learns true interaction features. TransformerCPI achieved much improved performance on the new experiments, and it can be deconvolved to highlight important interacting regions of protein sequences and compound atoms, which may contribute chemical biology studies with useful guidance for further ligand structural optimization. AVAILABILITY AND IMPLEMENTATION: https://github.com/lifanchen-simm/transformerCPI.

Assuntos

Aprendizado Profundo , Sequência de Aminoácidos , Ligantes , Redes Neurais de Computação , Proteínas/genética

6.

Distinctive Roles and Mechanisms of Human Neutrophil Peptides in Experimental Sepsis and Acute Respiratory Distress Syndrome.

Wu, Jianfeng; Han, Bing; Fanelli, Vito; Wen, Xiaoyan; Huang, Yongbo; Luo, Alice; Ghazarian, Mirna; Wang, Dingyan; Khang, Julie; Morriello, Florence; Liaw, Patricia C; Marshall, John; Zhong, Nanshan; Guan, Xiangdong; Slutsky, Arthur S; Li, Yimin; Zhang, Haibo.

Crit Care Med ; 46(9): e921-e927, 2018 09.

Artigo em Inglês | MEDLINE | ID: mdl-29979223

RESUMO

OBJECTIVES: To examine the effects and mechanisms of human neutrophil peptides in systemic infection and noninfectious inflammatory lung injury. DESIGN: Prospective experimental study. SETTING: University hospital-based research laboratory. SUBJECTS: In vitro human cells and in vivo mouse models. INTERVENTIONS: Wild-type (Friend virus B-type) and conditional leukocyte human neutrophil peptides transgenic mice were subjected to either sepsis induced by cecal ligation and puncture or acute lung injury by intratracheal instillation of hydrochloric acid followed by mechanical ventilation. Using human neutrophil peptides as bait, the basal cell adhesion molecule (CD239) and the purinergic P2Y purinoceptor 6 receptor were identified as the putative human neutrophil peptides receptor complex in human lung epithelial cells. MEASUREMENTS AND MAIN RESULTS: In the cecal ligation and puncture sepsis model, Friend virus B-type mice exhibited higher systemic bacterial load, cytokine production, and lung injury than human neutrophil peptides transgenic mice. Conversely, an increased lung cytokine production was seen in Friend virus B-type mice, which was further enhanced in human neutrophil peptides transgenic mice in response to two-hit lung injury induced by hydrochloric acid and mechanical ventilation. The human neutrophil peptides-mediated inflammatory response was mediated through the basal cell adhesion molecule-P2Y purinoceptor 6 receptor signal pathway in human lung epithelial cells. CONCLUSIONS: Human neutrophil peptides are critical in host defense against infectious sepsis by their cationic antimicrobial properties but may exacerbate tissue injury when neutrophil-mediated inflammatory responses are excessive in noninfectious lung injury. Targeting the basal cell adhesion molecule/P2Y purinoceptor 6 signaling pathway may serve as a novel approach to attenuate the neutrophil-mediated inflammatory responses and injury while maintaining the antimicrobial function of human neutrophil peptides in critical illness.

Assuntos

Síndrome do Desconforto Respiratório/imunologia , Sepse/imunologia , alfa-Defensinas/fisiologia , Células Epiteliais Alveolares , Animais , Células Cultivadas , Modelos Animais de Doenças , Células Epiteliais , Humanos , Camundongos

7.

Mechanical Stress and the Induction of Lung Fibrosis via the Midkine Signaling Pathway.

Zhang, Rong; Pan, Ying; Fanelli, Vito; Wu, Sulong; Luo, Alice Aili; Islam, Diana; Han, Bing; Mao, Pu; Ghazarian, Mirna; Zeng, Wenmei; Spieth, Peter M; Wang, Dingyan; Khang, Julie; Mo, Hongyin; Liu, Xiaoqing; Uhlig, Stefan; Liu, Mingyao; Laffey, John; Slutsky, Arthur S; Li, Yimin; Zhang, Haibo.

Am J Respir Crit Care Med ; 192(3): 315-23, 2015 Aug 01.

Artigo em Inglês | MEDLINE | ID: mdl-25945397

RESUMO

RATIONALE: Lung-protective ventilatory strategies have been widely used in patients with acute respiratory distress syndrome (ARDS), but the ARDS mortality rate remains unacceptably high and there is no proven pharmacologic therapy. OBJECTIVES: Mechanical ventilation can induce oxidative stress and lung fibrosis, which may contribute to high dependency on ventilator support and increased ARDS mortality. We hypothesized that the novel cytokine, midkine (MK), which can be up-regulated in oxidative stress, plays a key role in the pathogenesis of ARDS-associated lung fibrosis. METHODS: Blood samples were collected from 17 patients with ARDS and 10 healthy donors. Human lung epithelial cells were challenged with hydrogen chloride followed by mechanical stretch for 72 hours. Wild-type and MK gene-deficient (MK(-/-)) mice received two-hit injury of acid aspiration and mechanical ventilation, and were monitored for 14 days. MEASUREMENTS AND MAIN RESULTS: Plasma concentrations of MK were higher in patients with ARDS than in healthy volunteers. Exposure to mechanical stretch of lung epithelial cells led to an epithelial-mesenchymal transition profile associated with increased expression of angiotensin-converting enzyme, which was attenuated by silencing MK, its receptor Notch2, or NADP reduced oxidase 1. An increase in collagen deposition and hydroxyproline level and a decrease in lung tissue compliance seen in wild-type mice were largely attenuated in MK(-/-) mice. CONCLUSIONS: Mechanical stretch can induce an epithelial-mesenchymal transition phenotype mediated by the MK-Notch2-angiotensin-converting enzyme signaling pathway, contributing to lung remodeling. The MK pathway is a potential therapeutic target in the context of ARDS-associated lung fibrosis.

Assuntos

Citocinas/sangue , Fibrose Pulmonar/fisiopatologia , Respiração Artificial , Síndrome do Desconforto Respiratório/fisiopatologia , Transdução de Sinais/fisiologia , Estresse Mecânico , Animais , Células Cultivadas , Modelos Animais de Doenças , Transição Epitelial-Mesenquimal/fisiologia , Feminino , Humanos , Masculino , Camundongos , Pessoa de Meia-Idade , Midkina , Fibrose Pulmonar/sangue , Síndrome do Desconforto Respiratório/sangue

8.

Application of ensemble learning for predicting GABA_A receptor agonists.

Xiao, Fu; Ding, Xiaoyu; Shi, Yan; Wang, Dingyan; Wang, Yitian; Cui, Chen; Zhu, Tingfei; Chen, Kaixian; Xiang, Ping; Luo, Xiaomin.

Comput Biol Med ; 169: 107958, 2024 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-38194778

RESUMO

BACKGROUND: Over the past few decades, agonists binding to the benzodiazepine site of the GABAA receptor have been successfully developed as clinical drugs. Different modulators (agonist, antagonist, and reverse agonist) bound to benzodiazepine sites exhibit different or even opposite pharmacological effects, however, their structures are so similar that it is difficult to distinguish them based solely on molecular skeleton. This study aims to develop classification models for predicting the agonists. METHODS: 306 agonists or non-agonists were collected from literature. Six machine learning algorithms including RF, XGBoost, AdaBoost, GBoost, SVM, and ANN algorithms were employed for model development. Using six descriptors including 1D/2D Descriptors, ECFP4, 2D-Pharmacophore, MACCS, PubChem, and Estate fingerprint to characterize chemical structures. The model interpretability was explored by SHAP method. RESULTS: The best model demonstrated an AUC value of 0.905 and an MCC value of 0.808 for the test set. The PubMac-based model (PubMac-GB) achieved best AUC values of 0.935 for test set. The SHAP analysis results emphasized that MaccsFP62, ECFP_624, ECFP_724, and PubchemFP213 were the crucial molecular features. Applicability domain analysis was also performed to determine reliable prediction boundaries for the model. The PubMac-GB model was applied to virtual screening for potential GABAA agonists and the top 100 compounds were given. CONCLUSION: Overall, our ensemble learning-based model (PubMac-GB) achieved comparable performance and would be helpful in effectively identifying agonists of GABAA receptors.

Assuntos

Agonistas de Receptores de GABA-A , Receptores de GABA-A , Receptores de GABA-A/metabolismo , Benzodiazepinas , Aprendizado de Máquina , Ácido gama-Aminobutírico

9.

Reducing overconfident errors in molecular property classification using Posterior Network.

Fan, Zhehuan; Yu, Jie; Zhang, Xiang; Chen, Yijie; Sun, Shihui; Zhang, Yuanyuan; Chen, Mingan; Xiao, Fu; Wu, Wenyong; Li, Xutong; Zheng, Mingyue; Luo, Xiaomin; Wang, Dingyan.

Patterns (N Y) ; 5(6): 100991, 2024 Jun 14.

Artigo em Inglês | MEDLINE | ID: mdl-39005492

RESUMO

Deep-learning-based classification models are increasingly used for predicting molecular properties in drug development. However, traditional classification models using the Softmax function often give overconfident mispredictions for out-of-distribution samples, highlighting a critical lack of accurate uncertainty estimation. Such limitations can result in substantial costs and should be avoided during drug development. Inspired by advances in evidential deep learning and Posterior Network, we replaced the Softmax function with a normalizing flow to enhance the uncertainty estimation ability of the model in molecular property classification. The proposed strategy was evaluated across diverse scenarios, including simulated experiments based on a synthetic dataset, ADMET predictions, and ligand-based virtual screening. The results demonstrate that compared with the vanilla model, the proposed strategy effectively alleviates the problem of giving overconfident but incorrect predictions. Our findings support the promising application of evidential deep learning in drug development and offer a valuable framework for further research.

10.

Highly accurate carbohydrate-binding site prediction with DeepGlycanSite.

He, Xinheng; Zhao, Lifen; Tian, Yinping; Li, Rui; Chu, Qinyu; Gu, Zhiyong; Zheng, Mingyue; Wang, Yusong; Li, Shaoning; Jiang, Hualiang; Jiang, Yi; Wen, Liuqing; Wang, Dingyan; Cheng, Xi.

Nat Commun ; 15(1): 5163, 2024 Jun 17.

Artigo em Inglês | MEDLINE | ID: mdl-38886381

RESUMO

As the most abundant organic substances in nature, carbohydrates are essential for life. Understanding how carbohydrates regulate proteins in the physiological and pathological processes presents opportunities to address crucial biological problems and develop new therapeutics. However, the diversity and complexity of carbohydrates pose a challenge in experimentally identifying the sites where carbohydrates bind to and act on proteins. Here, we introduce a deep learning model, DeepGlycanSite, capable of accurately predicting carbohydrate-binding sites on a given protein structure. Incorporating geometric and evolutionary features of proteins into a deep equivariant graph neural network with the transformer architecture, DeepGlycanSite remarkably outperforms previous state-of-the-art methods and effectively predicts binding sites for diverse carbohydrates. Integrating with a mutagenesis study, DeepGlycanSite reveals the guanosine-5'-diphosphate-sugar-recognition site of an important G-protein coupled receptor. These findings demonstrate DeepGlycanSite is invaluable for carbohydrate-binding site prediction and could provide insights into molecular mechanisms underlying carbohydrate-regulation of therapeutically important proteins.

Assuntos

Aprendizado Profundo , Sítios de Ligação , Carboidratos/química , Ligação Proteica , Redes Neurais de Computação , Humanos , Proteínas/metabolismo , Proteínas/química , Modelos Moleculares

11.

Targeted Degradation of SOS1 Exhibits Potent Anticancer Activity and Overcomes Resistance in KRAS-Mutant Tumors and BCR-ABL-Positive Leukemia.

Luo, Ziwei; Lin, Chencen; Yu, Chuwei; Yuan, Changxian; Wu, Wenyong; Xu, Xiaowei; Sun, Renhong; Jia, Yan; Wang, Yafang; Shen, Jie; Wang, Dingyan; Wang, Sinan; Jiang, Hualiang; Jiang, Biao; Yang, Xiaobao; Xie, Chengying.

Cancer Res ; 2024 Oct 22.

Artigo em Inglês | MEDLINE | ID: mdl-39437162

RESUMO

SOS1 is an essential guanine nucleotide exchange factor for RAS that also plays a critical role in the activation of the small GTPase RAC mediated by BCR-ABL in leukemogenesis. Despite this, small molecule inhibitors targeting SOS1 have shown limited efficacy in clinical trials for KRAS mutant cancers, and their potential as a therapeutic approach for chronic myeloid leukemia (CML) remains largely unexplored. In this study, we developed a potent SOS1 PROTAC SIAIS562055, which was designed by connecting a CRBN ligand to an analogue of the SOS1 inhibitor BI-3406. SIAIS562055 exhibited sustained degradation of SOS1 and inhibition of downstream ERK pathways, resulting in superior anti-proliferative activity compared to small molecule inhibitors. SIAIS562055 also potentiated the activity of both KRAS inhibitors in KRAS-mutant cancers and ABL inhibitors in BCR-ABL+ CML. In KRAS-mutant xenografts, SIAIS562055 displayed promising antitumor potency as a monotherapy and enhanced ERK inhibition and tumor regression when combined with KRAS inhibitors, overcoming acquired resistance. In CML cells, SIAIS562055 promoted the active uptake of BCR-ABL inhibitors by upregulating the carnitine/organic cation transporter SLC22A4. SIAIS562055 and BCR-ABL inhibitors synergistically enhanced inhibition of ABL phosphorylation and downstream signaling, demonstrating robust antitumor activities in both mouse xenografts and primary CML patient samples. In summary, this study suggests that PROTAC-mediated SOS1 degradation represents an eï¬ective therapeutic strategy for treating not only KRAS-mutant cancers but also BCR-ABL-harboring leukemia.

12.

Learning protein fitness landscapes with deep mutational scanning data from multiple sources.

Chen, Lin; Zhang, Zehong; Li, Zhenghao; Li, Rui; Huo, Ruifeng; Chen, Lifan; Wang, Dingyan; Luo, Xiaomin; Chen, Kaixian; Liao, Cangsong; Zheng, Mingyue.

Cell Syst ; 14(8): 706-721.e5, 2023 08 16.

Artigo em Inglês | MEDLINE | ID: mdl-37591206

RESUMO

One of the key points of machine learning-assisted directed evolution (MLDE) is the accurate learning of the fitness landscape, a conceptual mapping from sequence variants to the desired function. Here, we describe a multi-protein training scheme that leverages the existing deep mutational scanning data from diverse proteins to aid in understanding the fitness landscape of a new protein. Proof-of-concept trials are designed to validate this training scheme in three aspects: random and positional extrapolation for single-variant effects, zero-shot fitness predictions for new proteins, and extrapolation for higher-order variant effects from single-variant effects. Moreover, our study identified previously overlooked strong baselines, and their unexpectedly good performance brings our attention to the pitfalls of MLDE. Overall, these results may improve our understanding of the association between different protein fitness profiles and shed light on developing better machine learning-assisted approaches to the directed evolution of proteins. A record of this paper's transparent peer review process is included in the supplemental information.

Assuntos

Aprendizado de Máquina , Revisão por Pares , Mutação/genética

13.

LogD7.4 prediction enhanced by transferring knowledge from chromatographic retention time, microscopic pKa and logP.

Wang, Yitian; Xiong, Jiacheng; Xiao, Fu; Zhang, Wei; Cheng, Kaiyang; Rao, Jingxin; Niu, Buying; Tong, Xiaochu; Qu, Ning; Zhang, Runze; Wang, Dingyan; Chen, Kaixian; Li, Xutong; Zheng, Mingyue.

J Cheminform ; 15(1): 76, 2023 Sep 05.

Artigo em Inglês | MEDLINE | ID: mdl-37670374

RESUMO

Lipophilicity is a fundamental physical property that significantly affects various aspects of drug behavior, including solubility, permeability, metabolism, distribution, protein binding, and toxicity. Accurate prediction of lipophilicity, measured by the logD7.4 value (the distribution coefficient between n-octanol and buffer at physiological pH 7.4), is crucial for successful drug discovery and design. However, the limited availability of data for logD modeling poses a significant challenge to achieving satisfactory generalization capability. To address this challenge, we have developed a novel logD7.4 prediction model called RTlogD, which leverages knowledge from multiple sources. RTlogD combines pre-training on a chromatographic retention time (RT) dataset since the RT is influenced by lipophilicity. Additionally, microscopic pKa values are incorporated as atomic features, providing valuable insights into ionizable sites and ionization capacity. Furthermore, logP is integrated as an auxiliary task within a multitask learning framework. We conducted ablation studies and presented a detailed analysis, showcasing the effectiveness and interpretability of RT, pKa, and logP in the RTlogD model. Notably, our RTlogD model demonstrated superior performance compared to commonly used algorithms and prediction tools. These results underscore the potential of the RTlogD model to improve the accuracy and generalization of logD prediction in drug discovery and design. In summary, the RTlogD model addresses the challenge of limited data availability in logD modeling by leveraging knowledge from RT, microscopic pKa, and logP. Incorporating these factors enhances the predictive capabilities of our model, and it holds promise for real-world applications in drug discovery and design scenarios.

14.

Tora3D: an autoregressive torsion angle prediction model for molecular 3D conformation generation.

Zhang, Zimei; Wang, Gang; Li, Rui; Ni, Lin; Zhang, RunZe; Cheng, Kaiyang; Ren, Qun; Kong, Xiangtai; Ni, Shengkun; Tong, Xiaochu; Luo, Li; Wang, Dingyan; Lu, Xiaojie; Zheng, Mingyue; Li, Xutong.

J Cheminform ; 15(1): 57, 2023 Jun 07.

Artigo em Inglês | MEDLINE | ID: mdl-37287071

RESUMO

Three-dimensional (3D) conformations of a small molecule profoundly affect its binding to the target of interest, the resulting biological effects, and its disposition in living organisms, but it is challenging to accurately characterize the conformational ensemble experimentally. Here, we proposed an autoregressive torsion angle prediction model Tora3D for molecular 3D conformer generation. Rather than directly predicting the conformations in an end-to-end way, Tora3D predicts a set of torsion angles of rotatable bonds by an interpretable autoregressive method and reconstructs the 3D conformations from them, which keeps structural validity during reconstruction. Another advancement of our method over other conformational generation methods is the ability to use energy to guide the conformation generation. In addition, we propose a new message-passing mechanism that applies the Transformer to the graph to solve the difficulty of remote message passing. Tora3D shows superior performance to prior computational models in the trade-off between accuracy and efficiency, and ensures conformational validity, accuracy, and diversity in an interpretable way. Overall, Tora3D can be used for the quick generation of diverse molecular conformations and 3D-based molecular representation, contributing to a wide range of downstream drug design tasks.

15.

Computing the relative binding affinity of ligands based on a pairwise binding comparison network.

Yu, Jie; Li, Zhaojun; Chen, Geng; Kong, Xiangtai; Hu, Jie; Wang, Dingyan; Cao, Duanhua; Li, Yanbei; Huo, Ruifeng; Wang, Gang; Liu, Xiaohong; Jiang, Hualiang; Li, Xutong; Luo, Xiaomin; Zheng, Mingyue.

Nat Comput Sci ; 3(10): 860-872, 2023 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-38177766

RESUMO

Structure-based lead optimization is an open challenge in drug discovery, which is still largely driven by hypotheses and depends on the experience of medicinal chemists. Here we propose a pairwise binding comparison network (PBCNet) based on a physics-informed graph attention mechanism, specifically tailored for ranking the relative binding affinity among congeneric ligands. Benchmarking on two held-out sets (provided by Schrödinger and Merck) containing over 460 ligands and 16 targets, PBCNet demonstrated substantial advantages in terms of both prediction accuracy and computational efficiency. Equipped with a fine-tuning operation, the performance of PBCNet reaches that of Schrödinger's FEP+, which is much more computationally intensive and requires substantial expert intervention. A further simulation-based experiment showed that active learning-optimized PBCNet may accelerate lead optimization campaigns by 473%. Finally, for the convenience of users, a web service for PBCNet is established to facilitate complex relative binding affinity prediction through an easy-to-operate graphical interface.

Assuntos

Descoberta de Drogas , Simulação de Dinâmica Molecular , Ligação Proteica , Simulação de Acoplamento Molecular , Ligantes

16.

Uncertainty quantification: Can we trust artificial intelligence in drug discovery?

Yu, Jie; Wang, Dingyan; Zheng, Mingyue.

iScience ; 25(8): 104814, 2022 Aug 19.

Artigo em Inglês | MEDLINE | ID: mdl-35996575

RESUMO

The problem of human trust is one of the most fundamental problems in applied artificial intelligence in drug discovery. In silico models have been widely used to accelerate the process of drug discovery in recent years. However, most of these models can only give reliable predictions within a limited chemical space that the training set covers (applicability domain). Predictions of samples falling outside the applicability domain are unreliable and sometimes dangerous for the drug-design decision-making process. Uncertainty quantification accordingly has drawn great attention to enable autonomous drug designing. By quantifying the confidence level of model predictions, the reliability of the predictions can be quantitatively represented to assist researchers in their molecular reasoning and experimental design. Here we summarize the state-of-the-art approaches to uncertainty quantification and underline how they can be used for drug design and discovery projects. Furthermore, we also outline four representative application scenarios of uncertainty quantification in drug discovery.

17.

Graph neural network approaches for drug-target interactions.

Zhang, Zehong; Chen, Lifan; Zhong, Feisheng; Wang, Dingyan; Jiang, Jiaxin; Zhang, Sulin; Jiang, Hualiang; Zheng, Mingyue; Li, Xutong.

Curr Opin Struct Biol ; 73: 102327, 2022 04.

Artigo em Inglês | MEDLINE | ID: mdl-35074533

RESUMO

Developing new drugs remains prohibitively expensive, time-consuming, and often involves safety issues. Accurate prediction of drug-target interactions (DTIs) can guide the drug discovery process and thus facilitate drug development. Non-Euclidian data such as drug-like molecule structures, key pocket residue structures, and protein interaction networks can be represented effectively using graphs. Therefore, the emerging graph neural network has been rapidly applied to predict DTIs, and proved effective in finding repositioning drugs and accelerating drug discovery. In this review, we provide a brief overview of deep neural networks used in DTI models. Then, we summarize the database required for DTI prediction, followed by a comprehensive introduction of applications of graph neural networks for DTI prediction. We also highlight current challenges and future directions to guide the further development of this field.

Assuntos

Desenvolvimento de Medicamentos , Redes Neurais de Computação , Descoberta de Drogas , Interações Medicamentosas , Estrutura Molecular

18.

Cocrystal Prediction of Bexarotene by Graph Convolution Network and Bioavailability Improvement.

Xiao, Fu; Cheng, Yinxiang; Wang, Jian-Rong; Wang, Dingyan; Zhang, Yuanyuan; Chen, Kaixian; Mei, Xuefeng; Luo, Xiaomin.

Pharmaceutics ; 14(10)2022 Oct 16.

Artigo em Inglês | MEDLINE | ID: mdl-36297633

RESUMO

Bexarotene (BEX) was approved by the FDA in 1999 for the treatment of cutaneous T-cell lymphoma (CTCL). The poor aqueous solubility causes the low bioavailability of the drug and thereby limits the clinical application. In this study, we developed a GCN-based deep learning model (CocrystalGCN) for in-silico screening of the cocrystals of BEX. The results show that our model obtained high performance relative to baseline models. The top 30 of 109 coformer candidates were scored by CocrystalGCN and then validated experimentally. Finally, cocrystals of BEX-pyrazine, BEX-2,5-dimethylpyrazine, BEX-methyl isonicotinate, and BEX-ethyl isonicotinate were successfully obtained. The crystal structures were determined by single-crystal X-ray diffraction. Powder X-ray diffraction, differential scanning calorimetry, and thermogravimetric analysis were utilized to characterize these multi-component forms. All cocrystals present superior solubility and dissolution over the parent drug. The pharmacokinetic studies show that the plasma exposures (AUC0-8h) of BEX-pyrazine and BEX-2,5-dimethylpyrazine are 1.7 and 1.8 times that of the commercially available BEX powder, respectively. This work sets a good example for integrating virtual prediction and experimental screening to discover the new cocrystals of water-insoluble drugs.

19.

Facing small and biased data dilemma in drug discovery with enhanced federated learning approaches.

Xiong, Zhaoping; Cheng, Ziqiang; Lin, Xinyuan; Xu, Chi; Liu, Xiaohong; Wang, Dingyan; Luo, Xiaomin; Zhang, Yong; Jiang, Hualiang; Qiao, Nan; Zheng, Mingyue.

Sci China Life Sci ; 65(3): 529-539, 2022 03.

Artigo em Inglês | MEDLINE | ID: mdl-34319533

RESUMO

Artificial intelligence (AI) models usually require large amounts of high-quality training data, which is in striking contrast to the situation of small and biased data faced by current drug discovery pipelines. The concept of federated learning has been proposed to utilize distributed data from different sources without leaking sensitive information of the data. This emerging decentralized machine learning paradigm is expected to dramatically improve the success rate of AI-powered drug discovery. Here, we simulated the federated learning process with different property and activity datasets from different sources, among which overlapping molecules with high or low biases exist in the recorded values. Beyond the benefit of gaining more data, we also demonstrated that federated training has a regularization effect superior to centralized training on the pooled datasets with high biases. Moreover, different network architectures for clients and aggregation algorithms for coordinators have been compared on the performance of federated learning, where personalized federated learning shows promising results. Our work demonstrates the applicability of federated learning in predicting drug-related properties and highlights its promising role in addressing the small and biased data dilemma in drug discovery.

Assuntos

Inteligência Artificial , Descoberta de Drogas , Algoritmos , Conjuntos de Dados como Assunto , Canal de Potássio ERG1/antagonistas & inibidores

20.

Blood-brain barrier penetration prediction enhanced by uncertainty estimation.

Tong, Xiaochu; Wang, Dingyan; Ding, Xiaoyu; Tan, Xiaoqin; Ren, Qun; Chen, Geng; Rong, Yu; Xu, Tingyang; Huang, Junzhou; Jiang, Hualiang; Zheng, Mingyue; Li, Xutong.

J Cheminform ; 14(1): 44, 2022 Jul 07.

Artigo em Inglês | MEDLINE | ID: mdl-35799215

RESUMO

Blood-brain barrier is a pivotal factor to be considered in the process of central nervous system (CNS) drug development, and it is of great significance to rapidly explore the blood-brain barrier permeability (BBBp) of compounds in silico in early drug discovery process. Here, we focus on whether and how uncertainty estimation methods improve in silico BBBp models. We briefly surveyed the current state of in silico BBBp prediction and uncertainty estimation methods of deep learning models, and curated an independent dataset to determine the reliability of the state-of-the-art algorithms. The results exhibit that, despite the comparable performance on BBBp prediction between graph neural networks-based deep learning models and conventional physicochemical-based machine learning models, the GROVER-BBBp model shows greatly improvement when using uncertainty estimations. In particular, the strategy combined Entropy and MC-dropout can increase the accuracy of distinguishing BBB + from BBB - to above 99% by extracting predictions with high confidence level (uncertainty score < 0.1). Case studies on preclinical/clinical drugs for Alzheimer' s disease and marketed antitumor drugs that verified by literature proved the application value of uncertainty estimation enhanced BBBp prediction model, that may facilitate the drug discovery in the field of CNS diseases and metastatic brain tumors.

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA