Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 211
Filter
1.
BMC Bioinformatics ; 25(1): 275, 2024 Aug 23.
Article in English | MEDLINE | ID: mdl-39179993

ABSTRACT

BACKGROUND: The rise of network pharmacology has led to the widespread use of network-based computational methods in predicting drug target interaction (DTI). However, existing DTI prediction models typically rely on a limited amount of data to extract drug and target features, potentially affecting the comprehensiveness and robustness of features. In addition, although multiple networks are used for DTI prediction, the integration of heterogeneous information often involves simplistic aggregation and attention mechanisms, which may impose certain limitations. RESULTS: MSH-DTI, a deep learning model for predicting drug-target interactions, is proposed in this paper. The model uses self-supervised learning methods to obtain drug and target structure features. A Heterogeneous Interaction-enhanced Feature Fusion Module is designed for multi-graph construction, and the graph convolutional networks are used to extract node features. With the help of an attention mechanism, the model focuses on the important parts of different features for prediction. Experimental results show that the AUROC and AUPR of MSH-DTI are 0.9620 and 0.9605 respectively, outperforming other models on the DTINet dataset. CONCLUSION: The proposed MSH-DTI is a helpful tool to discover drug-target interactions, which is also validated through case studies in predicting new DTIs.


Subject(s)
Deep Learning , Supervised Machine Learning , Computational Biology/methods , Network Pharmacology/methods
2.
Fundam Res ; 4(4): 715-737, 2024 Jul.
Article in English | MEDLINE | ID: mdl-39156568

ABSTRACT

Drug discovery is costly and time consuming, and modern drug discovery endeavors are progressively reliant on computational methodologies, aiming to mitigate temporal and financial expenditures associated with the process. In particular, the time required for vaccine and drug discovery is prolonged during emergency situations such as the coronavirus 2019 pandemic. Recently, the performance of deep learning methods in drug virtual screening has been particularly prominent. It has become a concern for researchers how to summarize the existing deep learning in drug virtual screening, select different models for different drug screening problems, exploit the advantages of deep learning models, and further improve the capability of deep learning in drug virtual screening. This review first introduces the basic concepts of drug virtual screening, common datasets, and data representation methods. Then, large numbers of common deep learning methods for drug virtual screening are compared and analyzed. In addition, a dataset of different sizes is constructed independently to evaluate the performance of each deep learning model for the difficult problem of large-scale ligand virtual screening. Finally, the existing challenges and future directions in the field of virtual screening are presented.

3.
Sci Rep ; 14(1): 18104, 2024 08 05.
Article in English | MEDLINE | ID: mdl-39103483

ABSTRACT

The study of drug-target interaction plays an important role in the process of drug development. The subject of DTI forecasting has advanced significantly in the last several years, yielding numerous significant research findings and methodologies. Heterogeneous data sources provide richer information and comprehensive perspectives for drug-target interaction prediction, so many existing methods rely on heterogeneous networks, and graph embedding technology becomes an important technology to extract information from heterogeneous networks. These approaches, however, are less concerned with potential noisy information in heterogeneous networks and more focused on the extent of information extraction in those networks. Based on this, a potential DTI predictive network model called FBRWPC is proposed in this paper. It uses a fine-grained similarity selection program to first integrate similarity on similar networks and then a bidirectional random walk graph embedding learning method with restart to obtain an updated drug target interaction matrix. Through the use of similarity selection and fine-grained selection similarity integration, the framework can effectively filter out the noise present in heterogeneous networks and enhance the model's prediction performance. The experimental findings demonstrate that, even after being split up into four distinct types of data sets, FBRWPC can still retain great prediction performance, a sign of the model's resilience and good generalization.


Subject(s)
Algorithms , Humans , Drug Development/methods , Pharmaceutical Preparations
4.
BMC Biol ; 22(1): 156, 2024 Jul 18.
Article in English | MEDLINE | ID: mdl-39020316

ABSTRACT

BACKGROUND: Identification of potential drug-target interactions (DTIs) with high accuracy is a key step in drug discovery and repositioning, especially concerning specific drug targets. Traditional experimental methods for identifying the DTIs are arduous, time-intensive, and financially burdensome. In addition, robust computational methods have been developed for predicting the DTIs and are widely applied in drug discovery research. However, advancing more precise algorithms for predicting DTIs is essential to meet the stringent standards demanded by drug discovery. RESULTS: We proposed a novel method called GSRF-DTI, which integrates networks with a deep learning algorithm to identify DTIs. Firstly, GSRF-DTI learned the embedding representation of drugs and targets by integrating multiple drug association information and target association information, respectively. Then, GSRF-DTI considered the influence of drug-target pair (DTP) association on DTI prediction to construct a drug-target pair network (DTP-NET). Next, we utilized GraphSAGE on DTP-NET to learn the potential features of the network and applied random forest (RF) to predict the DTIs. Furthermore, we conducted ablation experiments to validate the necessity of integrating different types of network features for identifying DTIs. It is worth noting that GSRF-DTI proposed three novel DTIs. CONCLUSIONS: GSRF-DTI not only considered the influence of the interaction relationship between drug and target but also considered the impact of DTP association relationship on DTI prediction. We initially use GraphSAGE to aggregate the neighbor information of nodes for better identification. Experimental analysis on Luo's dataset and the newly constructed dataset revealed that the GSRF-DTI framework outperformed several state-of-the-art methods significantly.


Subject(s)
Drug Discovery , Drug Discovery/methods , Deep Learning , Computational Biology/methods , Algorithms , Pharmaceutical Preparations
5.
AIMS Neurosci ; 11(2): 203-211, 2024.
Article in English | MEDLINE | ID: mdl-38988885

ABSTRACT

Obsessive-compulsive disorder (OCD) is a chronic psychiatric disease in which patients suffer from obsessions compelling them to engage in specific rituals as a temporary measure to alleviate stress. In this study, deep learning-based methods were used to build three models which predict the likelihood of a molecule interacting with three biological targets relevant to OCD, SERT, D2, and NMDA. Then, an ensemble model based on those models was created which underwent external validation on a large drug database using random sampling. Finally, case studies of molecules exhibiting high scores underwent bibliographic validation showcasing that good performance in the ensemble model can indicate connection with OCD pathophysiology, suggesting that it can be used to screen molecule databases for drug-repurposing purposes.

6.
Int J Mol Sci ; 25(11)2024 May 21.
Article in English | MEDLINE | ID: mdl-38891780

ABSTRACT

The kinetics and mechanism of drug binding to its target are critical to pharmacological efficacy. A high throughput (HTS) screen often results in hundreds of hits, of which usually only simple IC50 values are determined during reconfirmation. However, kinetic parameters such as residence time for reversible inhibitors and the kinact/KI ratio, which is the critical measure for evaluating covalent inactivators, are early predictive measures to assess the chances of success of the hits in the clinic. Using the promising cancer target human histone deacetylase 8 as an example, we present a robust method that calculates concentration-dependent apparent rate constants for the inhibition or inactivation of HDAC8 from dose-response curves recorded after different pre-incubation times. With these data, hit compounds can be classified according to their mechanism of action, and the relevant kinetic parameters can be calculated in a highly parallel fashion. HDAC8 inhibitors with known modes of action were correctly assigned to their mechanism, and the binding mechanisms of some hits from an internal HDAC8 screening campaign were newly determined. The oxonitriles SVE04 and SVE27 were classified as fast reversible HDAC8 inhibitors with moderate time-constant IC50 values of 4.2 and 2.6 µM, respectively. The hit compound TJ-19-24 and SAH03 behave like slow two-step inactivators or reversible inhibitors, with a very low reverse isomerization rate.


Subject(s)
Histone Deacetylase Inhibitors , Histone Deacetylases , Repressor Proteins , Humans , Histone Deacetylases/metabolism , Histone Deacetylases/chemistry , Histone Deacetylase Inhibitors/pharmacology , Histone Deacetylase Inhibitors/chemistry , Kinetics , Repressor Proteins/metabolism , Repressor Proteins/antagonists & inhibitors , Repressor Proteins/chemistry , Protein Binding , High-Throughput Screening Assays/methods
7.
Drug Discov Today ; 29(7): 104055, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38852835

ABSTRACT

Metal-based drugs hold promise as potent anticancer agents owing to their unique interactions with cellular targets. This review discusses recent advances in our understanding of the intricate molecular interactions of metal-based anticancer compounds with specific therapeutic targets in cancer cells. Advanced computational and experimental methodologies delineate the binding mechanisms, structural dynamics and functional outcomes of these interactions. In addition, the review sheds light on the precise modes of action of these drugs, their efficacy and the potential avenues for further optimization in cancer-treatment strategies and the development of targeted and effective metal-based therapies for combating various forms of cancer.


Subject(s)
Antineoplastic Agents , DNA , Neoplasms , RNA , Humans , Antineoplastic Agents/pharmacology , Antineoplastic Agents/chemistry , Antineoplastic Agents/therapeutic use , Neoplasms/drug therapy , Metals/chemistry , Animals
8.
Technol Health Care ; 32(S1): 49-64, 2024.
Article in English | MEDLINE | ID: mdl-38759038

ABSTRACT

BACKGROUND: Drug repositioning (DR) refers to a method used to find new targets for existing drugs. This method can effectively reduce the development cost of drugs, save time on drug development, and reduce the risks of drug design. The traditional experimental methods related to DR are time-consuming, expensive, and have a high failure rate. Several computational methods have been developed with the increase in data volume and computing power. In the last decade, matrix factorization (MF) methods have been widely used in DR issues. However, these methods still have some challenges. (1) The model easily falls into a bad local optimal solution due to the high noise and high missing rate in the data. (2) Single similarity information makes the learning power of the model insufficient in terms of identifying the potential associations accurately. OBJECTIVE: We proposed self-paced learning with dual similarity information and MF (SPLDMF), which introduced the self-paced learning method and more information related to drugs and targets into the model to improve prediction performance. METHODS: Combining self-paced learning first can effectively alleviate the model prone to fall into a bad local optimal solution because of the high noise and high data missing rate. Then, we incorporated more data into the model to improve the model's capacity for learning. RESULTS: Our model achieved the best results on each dataset tested. For example, the area under the receiver operating characteristic curve and the precision-recall curve of SPLDMF was 0.982 and 0.815, respectively, outperforming the state-of-the-art methods. CONCLUSION: The experimental results on five benchmark datasets and two extended datasets demonstrated the effectiveness of our approach in predicting drug-target interactions.


Subject(s)
Drug Repositioning , Humans , Drug Repositioning/methods , Machine Learning , Algorithms
9.
Brief Bioinform ; 25(3)2024 Mar 27.
Article in English | MEDLINE | ID: mdl-38762789

ABSTRACT

Identifying drug-target interactions (DTIs) holds significant importance in drug discovery and development, playing a crucial role in various areas such as virtual screening, drug repurposing and identification of potential drug side effects. However, existing methods commonly exploit only a single type of feature from drugs and targets, suffering from miscellaneous challenges such as high sparsity and cold-start problems. We propose a novel framework called MSI-DTI (Multi-Source Information-based Drug-Target Interaction Prediction) to enhance prediction performance, which obtains feature representations from different views by integrating biometric features and knowledge graph representations from multi-source information. Our approach involves constructing a Drug-Target Knowledge Graph (DTKG), obtaining multiple feature representations from diverse information sources for SMILES sequences and amino acid sequences, incorporating network features from DTKG and performing an effective multi-source information fusion. Subsequently, we employ a multi-head self-attention mechanism coupled with residual connections to capture higher-order interaction information between sparse features while preserving lower-order information. Experimental results on DTKG and two benchmark datasets demonstrate that our MSI-DTI outperforms several state-of-the-art DTIs prediction methods, yielding more accurate and robust predictions. The source codes and datasets are publicly accessible at https://github.com/KEAML-JLU/MSI-DTI.


Subject(s)
Drug Discovery , Computational Biology/methods , Algorithms , Humans
10.
BMC Bioinformatics ; 25(1): 141, 2024 Apr 02.
Article in English | MEDLINE | ID: mdl-38566002

ABSTRACT

Accurate and efficient prediction of drug-target interaction (DTI) is critical to advance drug development and reduce the cost of drug discovery. Recently, the employment of deep learning methods has enhanced DTI prediction precision and efficacy, but it still encounters several challenges. The first challenge lies in the efficient learning of drug and protein feature representations alongside their interaction features to enhance DTI prediction. Another important challenge is to improve the generalization capability of the DTI model within real-world scenarios. To address these challenges, we propose CAT-DTI, a model based on cross-attention and Transformer, possessing domain adaptation capability. CAT-DTI effectively captures the drug-target interactions while adapting to out-of-distribution data. Specifically, we use a convolution neural network combined with a Transformer to encode the distance relationship between amino acids within protein sequences and employ a cross-attention module to capture the drug-target interaction features. Generalization to new DTI prediction scenarios is achieved by leveraging a conditional domain adversarial network, aligning DTI representations under diverse distributions. Experimental results within in-domain and cross-domain scenarios demonstrate that CAT-DTI model overall improves DTI prediction performance compared with previous methods.


Subject(s)
Drug Development , Drug Discovery , Drug Interactions , Amino Acid Sequence , Amino Acids
11.
BMC Bioinformatics ; 25(1): 156, 2024 Apr 20.
Article in English | MEDLINE | ID: mdl-38641811

ABSTRACT

BACKGROUND: Accurately identifying drug-target interaction (DTI), affinity (DTA), and binding sites (DTS) is crucial for drug screening, repositioning, and design, as well as for understanding the functions of target. Although there are a few online platforms based on deep learning for drug-target interaction, affinity, and binding sites identification, there is currently no integrated online platforms for all three aspects. RESULTS: Our solution, the novel integrated online platform Drug-Online, has been developed to facilitate drug screening, target identification, and understanding the functions of target in a progressive manner of "interaction-affinity-binding sites". Drug-Online platform consists of three parts: the first part uses the drug-target interaction identification method MGraphDTA, based on graph neural networks (GNN) and convolutional neural networks (CNN), to identify whether there is a drug-target interaction. If an interaction is identified, the second part employs the drug-target affinity identification method MMDTA, also based on GNN and CNN, to calculate the strength of drug-target interaction, i.e., affinity. Finally, the third part identifies drug-target binding sites, i.e., pockets. The method pt-lm-gnn used in this part is also based on GNN. CONCLUSIONS: Drug-Online is a reliable online platform that integrates drug-target interaction, affinity, and binding sites identification. It is freely available via the Internet at http://39.106.7.26:8000/Drug-Online/ .


Subject(s)
Deep Learning , Drug Interactions , Binding Sites , Drug Delivery Systems , Drug Evaluation, Preclinical
12.
Comput Biol Chem ; 110: 108058, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38593480

ABSTRACT

Exploring the relationship between proteins and drugs plays a significant role in discovering new synthetic drugs. The Drug-Target Interaction (DTI) prediction is a fundamental task in the relationship between proteins and drugs. Unlike encoding proteins by amino acids, we use amino acid subsequence to encode proteins, which simulates the biological process of DTI better. For this research purpose, we proposed a novel deep learning framework based on Bidirectional Encoder Representation from Transformers (BERT), which integrates high-frequency subsequence embedding and transfer learning methods to complete the DTI prediction task. As the first key module, subsequence embedding allows to explore the functional interaction units from drug and protein sequences and then contribute to finding DTI modules. As the second key module, transfer learning promotes the model learn the common DTI features from protein and drug sequences in a large dataset. Overall, the BERT-based model can learn two kinds features through the multi-head self-attention mechanism: internal features of sequence and interaction features of both proteins and drugs, respectively. Compared with other methods, BERT-based methods enable more DTI-related features to be discovered by means of attention scores which associated with tokenized protein/drug subsequences. We conducted extensive experiments for the DTI prediction task on three different benchmark datasets. The experimental results show that the model achieves an average prediction metrics higher than most baseline methods. In order to verify the importance of transfer learning, we conducted an ablation study on datasets, and the results show the superiority of transfer learning. In addition, we test the scalability of the model on the dataset in unseen drugs and proteins, and the results of the experiments show that it is acceptable in scalability.


Subject(s)
Deep Learning , Proteins , Proteins/chemistry , Proteins/metabolism , Pharmaceutical Preparations/chemistry , Computational Biology
13.
Methods ; 226: 21-27, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38608849

ABSTRACT

Knowledge graph intent graph attention mechanism Predicting drug-target interactions (DTIs) plays a crucial role in drug discovery and drug development. Considering the high cost and risk of biological experiments, developing computational approaches to explore the interactions between drugs and targets can effectively reduce the time and cost of drug development. Recently, many methods have made significant progress in predicting DTIs. However, existing approaches still suffer from the high sparsity of DTI datasets and the cold start problem. In this paper, we develop a new model to predict drug-target interactions via a knowledge graph and intent graph named DTKGIN. Our method can effectively capture biological environment information for targets and drugs by mining their associated relations in the knowledge graph and considering drug-target interactions at a fine-grained level in the intent graph. DTKGIN learns the representation of drugs and targets from the knowledge graph and the intent graph. Then the probabilities of interactions between drugs and targets are obtained through the inner product of the representation of drugs and targets. Experimental results show that our proposed method outperforms other state-of-the-art methods in 10-fold cross-validation, especially in cold-start experimental settings. Furthermore, the case studies demonstrate the effectiveness of DTKGIN in predicting potential drug-target interactions. The code is available on GitHub: https://github.com/Royluoyi123/DTKGIN.


Subject(s)
Drug Discovery , Drug Discovery/methods , Humans , Algorithms , Computational Biology/methods , Drug Development/methods
14.
J Cell Mol Med ; 28(7): e18224, 2024 04.
Article in English | MEDLINE | ID: mdl-38509739

ABSTRACT

Drug-target interaction (DTI) prediction is essential for new drug design and development. Constructing heterogeneous network based on diverse information about drugs, proteins and diseases provides new opportunities for DTI prediction. However, the inherent complexity, high dimensionality and noise of such a network prevent us from taking full advantage of these network characteristics. This article proposes a novel method, NGCN, to predict drug-target interactions from an integrated heterogeneous network, from which to extract relevant biological properties and association information while maintaining the topology information. It focuses on learning the topology representation of drugs and targets to improve the performance of DTI prediction. Unlike traditional methods, it focuses on learning the low-dimensional topology representation of drugs and targets via graph-based convolutional neural network. NGCN achieves substantial performance improvements over other state-of-the-art methods, such as a nearly 1.0% increase in AUPR value. Moreover, we verify the robustness of NGCN through benchmark tests, and the experimental results demonstrate it is an extensible framework capable of combining heterogeneous information for DTI prediction.


Subject(s)
Drug Design , Neural Networks, Computer
15.
Int J Mol Sci ; 25(5)2024 Feb 26.
Article in English | MEDLINE | ID: mdl-38473928

ABSTRACT

Rheumatoid arthritis (RA) is a systemic autoimmune disease characterized by swelling in at least one joint. Owing to an overactive immune response, extra-articular manifestations are observed in certain cases, with interstitial lung disease (ILD) being the most common. Rheumatoid arthritis-associated interstitial lung disease (RA-ILD) is characterized by chronic inflammation of the interstitial space, which causes fibrosis and the scarring of lung tissue. Controlling inflammation and pulmonary fibrosis in RA-ILD is important because they are associated with high morbidity and mortality. Pirfenidone and nintedanib are specific drugs against idiopathic pulmonary fibrosis and showed efficacy against RA-ILD in several clinical trials. Immunosuppressants and disease-modifying antirheumatic drugs (DMARDs) with anti-fibrotic effects have also been used to treat RA-ILD. Immunosuppressants moderate the overexpression of cytokines and immune cells to reduce pulmonary damage and slow the progression of fibrosis. DMARDs with mild anti-fibrotic effects target specific fibrotic pathways to regulate fibrogenic cellular activity, extracellular matrix homeostasis, and oxidative stress levels. Therefore, specific medications are required to effectively treat RA-ILD. In this review, the commonly used RA-ILD treatments are discussed based on their molecular mechanisms and clinical trial results. In addition, a computational approach is proposed to develop specific drugs for RA-ILD.


Subject(s)
Antirheumatic Agents , Arthritis, Rheumatoid , Idiopathic Pulmonary Fibrosis , Lung Diseases, Interstitial , Humans , Lung Diseases, Interstitial/etiology , Arthritis, Rheumatoid/drug therapy , Idiopathic Pulmonary Fibrosis/drug therapy , Inflammation/drug therapy , Antirheumatic Agents/therapeutic use , Immunosuppressive Agents/therapeutic use
16.
Structure ; 32(5): 611-620.e4, 2024 May 02.
Article in English | MEDLINE | ID: mdl-38447575

ABSTRACT

Identifying binding compounds against a target protein is crucial for large-scale virtual screening in drug development. Recently, network-based methods have been developed for compound-protein interaction (CPI) prediction. However, they are difficult to be applied to unseen (i.e., never-seen-before) proteins and compounds. In this study, we propose SgCPI to incorporate local known interacting networks to predict CPI interactions. SgCPI randomly samples the local CPI network of the query compound-protein pair as a subgraph and applies a heterogeneous graph neural network (HGNN) to embed the active/inactive message of the subgraph. For unseen compounds and proteins, SgCPI-KD takes SgCPI as the teacher model to distillate its knowledge by estimating the potential neighbors. Experimental results indicate: (1) the sampled subgraphs of the CPI network introduce efficient knowledge for unseen molecular prediction with the HGNNs, and (2) the knowledge distillation strategy is beneficial to the double-blind interaction prediction by estimating molecular neighbors and distilling knowledge.


Subject(s)
Neural Networks, Computer , Proteins , Proteins/chemistry , Proteins/metabolism , Protein Binding , Humans
17.
Interdiscip Sci ; 2024 Mar 14.
Article in English | MEDLINE | ID: mdl-38483753

ABSTRACT

Recognizing drug-target interactions (DTI) stands as a pivotal element in the expansive field of drug discovery. Traditional biological wet experiments, although valuable, are time-consuming and costly as methods. Recently, computational methods grounded in network learning have demonstrated great advantages by effective topological feature extraction and attracted extensive research attention. However, most existing network-based learning methods only consider the low-order binary correlation between individual drug and target, neglecting the potential higher-order correlation information derived from multiple drugs and targets. High-order information, as an essential component, exhibits complementarity with low-order information. Hence, the incorporation of higher-order associations between drugs and targets, while adequately integrating them with the existing lower-order information, could potentially yield substantial breakthroughs in predicting drug-target interactions. We propose a novel dual channels network-based learning model CHL-DTI that converges high-order information from hypergraphs and low-order information from ordinary graph for drug-target interaction prediction. The convergence of high-low order information in CHL-DTI is manifested in two key aspects. First, during the feature extraction stage, the model integrates both high-level semantic information and low-level topological information by combining hypergraphs and ordinary graph. Second, CHL-DTI fully fuse the innovative introduced drug-protein pairs (DPP) hypergraph network structure with ordinary topological network structure information. Extensive experimentation conducted on three public datasets showcases the superior performance of CHL-DTI in DTI prediction tasks when compared to SOTA methods. The source code of CHL-DTI is available at https://github.com/UPCLyy/CHL-DTI .

18.
Artif Intell Med ; 149: 102778, 2024 Mar.
Article in English | MEDLINE | ID: mdl-38462280

ABSTRACT

Many computational methods have been proposed to identify potential drug-target interactions (DTIs) to expedite drug development. Graph neural network (GNN) methods are considered to be one of the most effective approaches. However, shallow GNN methods can only aggregate local information from nodes. Also, deep GNN methods may result in over-smoothing while obtaining long-distance neighbourhood information. As a result, existing GNN methods struggle to extract the complete features of the graph. Additionally, the number of known DTIs is insufficient, and there are far more unknown drug-target pairs than known DTIs, leading to class imbalance. This article proposes a model that combines graph autoencoder and self-supervised learning to accurately encode multilevel features of graphs using only a small number of labelled samples. We introduce a positive sample compensation coefficient to the objective function to mitigate the impact of class imbalance. Experiments on two datasets demonstrated that our model outperforms the four baseline methods, and the new DTIs predicted by the SSLDTI model were verified by the DrugBank database.


Subject(s)
Drug Development , Neural Networks, Computer , Databases, Factual , Supervised Machine Learning
19.
Math Biosci Eng ; 21(2): 2608-2625, 2024 Jan 18.
Article in English | MEDLINE | ID: mdl-38454698

ABSTRACT

In the drug discovery process, time and costs are the most typical problems resulting from the experimental screening of drug-target interactions (DTIs). To address these limitations, many computational methods have been developed to achieve more accurate predictions. However, identifying DTIs mostly rely on separate learning tasks with drug and target features that neglect interaction representation between drugs and target. In addition, the lack of these relationships may lead to a greatly impaired performance on the prediction of DTIs. Aiming at capturing comprehensive drug-target representations and simplifying the network structure, we propose an integrative approach with a convolution broad learning system for the DTI prediction (ConvBLS-DTI) to reduce the impact of the data sparsity and incompleteness. First, given the lack of known interactions for the drug and target, the weighted K-nearest known neighbors (WKNKN) method was used as a preprocessing strategy for unknown drug-target pairs. Second, a neighborhood regularized logistic matrix factorization (NRLMF) was applied to extract features of updated drug-target interaction information, which focused more on the known interaction pair parties. Then, a broad learning network incorporating a convolutional neural network was established to predict DTIs, which can make classification more effective using a different perspective. Finally, based on the four benchmark datasets in three scenarios, the ConvBLS-DTI's overall performance out-performed some mainstream methods. The test results demonstrate that our model achieves improved prediction effect on the area under the receiver operating characteristic curve and the precision-recall curve.


Subject(s)
Drug Discovery , Neural Networks, Computer , Drug Discovery/methods , ROC Curve
20.
Comput Biol Med ; 173: 108339, 2024 May.
Article in English | MEDLINE | ID: mdl-38547658

ABSTRACT

The application of Artificial Intelligence (AI) to screen drug molecules with potential therapeutic effects has revolutionized the drug discovery process, with significantly lower economic cost and time consumption than the traditional drug discovery pipeline. With the great power of AI, it is possible to rapidly search the vast chemical space for potential drug-target interactions (DTIs) between candidate drug molecules and disease protein targets. However, only a small proportion of molecules have labelled DTIs, consequently limiting the performance of AI-based drug screening. To solve this problem, a machine learning-based approach with great ability to generalize DTI prediction across molecules is desirable. Many existing machine learning approaches for DTI identification failed to exploit the full information with respect to the topological structures of candidate molecules. To develop a better approach for DTI prediction, we propose GraphormerDTI, which employs the powerful Graph Transformer neural network to model molecular structures. GraphormerDTI embeds molecular graphs into vector-format representations through iterative Transformer-based message passing, which encodes molecules' structural characteristics by node centrality encoding, node spatial encoding and edge encoding. With a strong structural inductive bias, the proposed GraphormerDTI approach can effectively infer informative representations for out-of-sample molecules and as such, it is capable of predicting DTIs across molecules with an exceptional performance. GraphormerDTI integrates the Graph Transformer neural network with a 1-dimensional Convolutional Neural Network (1D-CNN) to extract the drugs' and target proteins' representations and leverages an attention mechanism to model the interactions between them. To examine GraphormerDTI's performance for DTI prediction, we conduct experiments on three benchmark datasets, where GraphormerDTI achieves a superior performance than five state-of-the-art baselines for out-of-molecule DTI prediction, including GNN-CPI, GNN-PT, DeepEmbedding-DTI, MolTrans and HyperAttentionDTI, and is on a par with the best baseline for transductive DTI prediction. The source codes and datasets are publicly accessible at https://github.com/mengmeng34/GraphormerDTI.


Subject(s)
Artificial Intelligence , Drug Discovery , Drug Evaluation, Preclinical , Neural Networks, Computer , Benchmarking
SELECTION OF CITATIONS
SEARCH DETAIL