Búsqueda | Portal Regional de la BVS

1.

Interaction-Based Inductive Bias in Graph Neural Networks: Enhancing Protein-Ligand Binding Affinity Predictions From 3D Structures.

Yang, Ziduo; Zhong, Weihe; Lv, Qiujie; Dong, Tiejun; Chen, Guanxing; Chen, Calvin Yu-Chian.

IEEE Trans Pattern Anal Mach Intell ; PP2024 May 13.

Artículo en Inglés | MEDLINE | ID: mdl-38739515

RESUMEN

Inductive bias in machine learning (ML) is the set of assumptions describing how a model makes predictions. Different ML-based methods for protein-ligand binding affinity (PLA) prediction have different inductive biases, leading to different levels of generalization capability and interpretability. Intuitively, the inductive bias of an ML-based model for PLA prediction should fit in with biological mechanisms relevant for binding to achieve good predictions with meaningful reasons. To this end, we propose an interaction-based inductive bias to restrict neural networks to functions relevant for binding with two assumptions: (1) A protein-ligand complex can be naturally expressed as a heterogeneous graph with covalent and non-covalent interactions; (2) The predicted PLA is the sum of pairwise atom-atom affinities determined by non-covalent interactions. The interaction-based inductive bias is embodied by an explainable heterogeneous interaction graph neural network (EHIGN) for explicitly modeling pairwise atom-atom interactions to predict PLA from 3D structures. Extensive experiments demonstrate that EHIGN achieves better generalization capability than other state-of-the-art ML-based baselines in PLA prediction and structure-based virtual screening. More importantly, comprehensive analyses of distance-affinity, pose-affinity, and substructure-affinity relations suggest that the interaction-based inductive bias can guide the model to learn atomic interactions that are consistent with physical reality. As a case study to demonstrate practical usefulness, our method is tested for predicting the efficacy of Nirmatrelvir against SARS-CoV-2 variants. EHIGN successfully recognizes the changes in the efficacy of Nirmatrelvir for different SARS-CoV-2 variants with meaningful reasons.

2.

MFTrans: A multi-feature transformer network for protein secondary structure prediction.

Chen, Yifu; Chen, Guanxing; Chen, Calvin Yu-Chian.

Int J Biol Macromol ; 267(Pt 1): 131311, 2024 May.

Artículo en Inglés | MEDLINE | ID: mdl-38599417

RESUMEN

In the rapidly evolving field of computational biology, accurate prediction of protein secondary structures is crucial for understanding protein functions, facilitating drug discovery, and advancing disease diagnostics. In this paper, we propose MFTrans, a deep learning-based multi-feature fusion network aimed at enhancing the precision and efficiency of Protein Secondary Structure Prediction (PSSP). This model employs a Multiple Sequence Alignment (MSA) Transformer in combination with a multi-view deep learning architecture to effectively capture both global and local features of protein sequences. MFTrans integrates diverse features generated by protein sequences, including MSA, sequence information, evolutionary information, and hidden state information, using a multi-feature fusion strategy. The MSA Transformer is utilized to interleave row and column attention across the input MSA, while a Transformer encoder and decoder are introduced to enhance the extracted high-level features. A hybrid network architecture, combining a convolutional neural network with a bidirectional Gated Recurrent Unit (BiGRU) network, is used to further extract high-level features after feature fusion. In independent tests, our experimental results show that MFTrans has superior generalization ability, outperforming other state-of-the-art PSSP models by 3 % on average on public benchmarks including CASP12, CASP13, CASP14, TEST2016, TEST2018, and CB513. Case studies further highlight its advanced performance in predicting mutation sites. MFTrans contributes significantly to the protein science field, opening new avenues for drug discovery, disease diagnosis, and protein.

Asunto(s)

Biología Computacional , Estructura Secundaria de Proteína , Proteínas , Proteínas/química , Biología Computacional/métodos , Aprendizaje Profundo , Redes Neurales de la Computación , Algoritmos , Alineación de Secuencia , Análisis de Secuencia de Proteína/métodos

3.

PSSP-MFFNet: A Multifeature Fusion Network for Protein Secondary Structure Prediction.

Chen, Yifu; Chen, Guanxing; Chen, Calvin Yu-Chian.

ACS Omega ; 9(5): 5985-5994, 2024 Feb 06.

Artículo en Inglés | MEDLINE | ID: mdl-38343972

RESUMEN

Protein secondary structure prediction (PSSP) is a fundamental task in modern bioinformatics research and is particularly important for uncovering the functional mechanisms of proteins. To improve the accuracy of PSSP, various general and essential features generated from amino acid sequences are often used for predicting the secondary structure. In this paper, we propose PSSP-MFFNet, a deep learning-based multi-feature fusion network for PSSP, which incorporates a multi-view deep learning architecture with the multiple sequence alignment (MSA) Transformer to efficiently capture global and local features of protein sequences. In practice, PSSP-MFFNet adopts a multi-feature fusion strategy, integrating different features generated from protein sequences, including MSA, sequence information, evolutionary information, and hidden state information. Moreover, we employ the MSA Transformer to interleave row and column attention across the input MSA. A hybrid network architecture of convolutional neural networks and long short-term memory networks is applied to extract high-level features after feature fusion. Furthermore, we introduce a transformer encoder to enhance the extracted high-level features. Comparative experimental results on independent tests demonstrate that PSSP-MFFNet has excellent generalization ability, outperforming other state-of-the-art PSSP models by an average of 1% on public benchmarks, including CASP12, CASP13, CASP14, TEST2018, and CB513. Our method can contribute to a better understanding of the biological functions of proteins, which has significant implications for drug discovery, disease diagnosis, and protein engineering.

4.

MRI-based automated machine learning model for preoperative identification of variant histology in muscle-invasive bladder carcinoma.

Huang, Jingwen; Chen, Guanxing; Liu, Haiqing; Jiang, Wei; Mai, Siyao; Zhang, Lingli; Zeng, Hong; Wu, Shaoxu; Chen, Calvin Yu-Chian; Wu, Zhuo.

Eur Radiol ; 34(3): 1804-1815, 2024 Mar.

Artículo en Inglés | MEDLINE | ID: mdl-37658139

RESUMEN

OBJECTIVES: It is essential yet highly challenging to preoperatively diagnose variant histologies such as urothelial carcinoma with squamous differentiation (UC w/SD) from pure UC in patients with muscle-invasive bladder carcinoma (MIBC), as their treatment strategy varies significantly. We developed a non-invasive automated machine learning (AutoML) model to preoperatively differentiate UC w/SD from pure UC in patients with MIBC. METHODS: A total of 119 MIBC patients who underwent baseline bladder MRI were enrolled in this study, including 38 patients with UC w/SD and 81 patients with pure UC. These patients were randomly assigned to a training set or a test set (3:1). An AutoML model was built from the training set, using 13 selected radiomic features from T2-weighted imaging, semantic features (ADC values), and clinical features (tumor length, tumor stage, lymph node metastasis status), and subsequent ten-fold cross-validation was performed. A test set was used to validate the proposed model. The AUC of the ROC curve was then calculated for the model. RESULTS: This AutoML model enabled robust differentiation of UC w/SD and pure UC in patients with MIBC in both training set (ten-fold cross-validation AUC = 0.955, 95% confidence interval [CI]: 0.944-0.965) and test set (AUC = 0.932, 95% CI: 0.812-1.000). CONCLUSION: The presented AutoML model, that incorporates the radiomic, semantic, and clinical features from baseline MRI, could be useful for preoperative differentiation of UC w/SD and pure UC. CLINICAL RELEVANCE STATEMENT: This MRI-based automated machine learning (AutoML) study provides a non-invasive and low-cost preoperative prediction tool to identify the muscle-invasive bladder cancer patients with variant histology, which may serve as a useful tool for clinical decision-making. KEY POINTS: â¢ It is important to preoperatively diagnose variant histology from urothelial carcinoma in patients with muscle-invasive bladder carcinoma (MIBC), as their treatment strategy varies significantly. â¢ An automated machine learning (AutoML) model based on baseline bladder MRI can identify the variant histology (squamous differentiation) from urothelial carcinoma preoperatively in patients with MIBC. â¢ The developed AutoML model is a non-invasive and low-cost preoperative prediction tool, which may be useful for clinical decision-making.

Asunto(s)

Carcinoma de Células Escamosas , Carcinoma de Células Transicionales , Neoplasias de la Vejiga Urinaria , Humanos , Carcinoma de Células Escamosas/patología , Aprendizaje Automático , Imagen por Resonancia Magnética , Músculos/patología , Estudios Retrospectivos , Vejiga Urinaria/diagnóstico por imagen , Vejiga Urinaria/cirugía , Vejiga Urinaria/patología , Neoplasias de la Vejiga Urinaria/diagnóstico por imagen , Neoplasias de la Vejiga Urinaria/cirugía , Neoplasias de la Vejiga Urinaria/patología

5.

LDCNet: Lightweight dynamic convolution network for laparoscopic procedures image segmentation.

Yin, Yiyang; Luo, Shuangling; Zhou, Jun; Kang, Liang; Chen, Calvin Yu-Chian.

Neural Netw ; 170: 441-452, 2024 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-38039682

RESUMEN

Medical image segmentation is fundamental for modern healthcare systems, especially for reducing the risk of surgery and treatment planning. Transanal total mesorectal excision (TaTME) has emerged as a recent focal point in laparoscopic research, representing a pivotal modality in the therapeutic arsenal for the treatment of colon & rectum cancers. Real-time instance segmentation of surgical imagery during TaTME procedures can serve as an invaluable tool in assisting surgeons, ultimately reducing surgical risks. The dynamic variations in size and shape of anatomical structures within intraoperative images pose a formidable challenge, rendering the precise instance segmentation of TaTME images a task of considerable complexity. Deep learning has exhibited its efficacy in Medical image segmentation. However, existing models have encountered challenges in concurrently achieving a satisfactory level of accuracy while maintaining manageable computational complexity in the context of TaTME data. To address this conundrum, we propose a lightweight dynamic convolution Network (LDCNet) that has the same superior segmentation performance as the state-of-the-art (SOTA) medical image segmentation network while running at the speed of the lightweight convolutional neural network. Experimental results demonstrate the promising performance of LDCNet, which consistently exceeds previous SOTA approaches. Codes are available at github.com/yinyiyang416/LDCNet.

Asunto(s)

Neoplasias Colorrectales , Laparoscopía , Humanos , Recto/cirugía , Laparoscopía/métodos , Redes Neurales de la Computación , Procesamiento de Imagen Asistido por Computador/métodos

6.

Discovery of Kinetin in inhibiting colorectal cancer progression via enhancing PSMB1-mediated RAB34 degradation.

Jiang, Xuefei; Yang, Lanlan; Chen, Guanxing; Feng, Xingzhi; Liu, Yiting; Gao, Qianling; Mai, Mingru; Chen, Calvin Yu-Chian; Ye, Shubiao; Yang, Zihuan.

Cancer Lett ; 584: 216600, 2024 Mar 01.

Artículo en Inglés | MEDLINE | ID: mdl-38159835

RESUMEN

Colorectal cancer (CRC) is one of the most prevalent malignancies worldwide. Understanding the underlying mechanism driving CRC progression and identifying potential therapeutic drug targets are of utmost urgency. We previously utilized LC-MS-based proteomic profiling to identify proteins associated with postoperative progression in stage II/III CRC. Here, we revealed that proteasome subunit beta type-1 (PSMB1) is an independent predictor for postoperative progression in stage II/III CRC. Mechanistically, PSMB1 binds directly to onco-protein RAB34 and promotes its proteasome-dependent degradation, potentially leading to the inactivation of the MEK/ERK signaling pathway and inhibition of CRC progression. To further identify potential anticancer drugs, we screened a library of 2509 FDA-approved drugs using computer-aided drug design (CADD) and identified Kinetin as a potentiating agent for PSMB1. Functional assays confirmed that Kinetin enhanced the interaction between PSMB1 and RAB34, hence facilitated the degradation of RAB34 protein and decreased the MEK/ERK phosphorylation. Kinetin suppresses CRC progression in patient-derived xenograft (PDX) and liver metastasis models. Conclusively, our study identifies PSMB1 as a potential biomarker and therapeutic target for CRC, and Kinetin as an anticancer drug by enhancing proteasome-dependent onco-protein degradation.

Asunto(s)

Neoplasias Colorrectales , Complejo de la Endopetidasa Proteasomal , Humanos , Complejo de la Endopetidasa Proteasomal/metabolismo , Cinetina , Proteómica , Neoplasias Colorrectales/tratamiento farmacológico , Neoplasias Colorrectales/genética , Neoplasias Colorrectales/metabolismo , Quinasas de Proteína Quinasa Activadas por Mitógenos , Línea Celular Tumoral

7.

Equivariant Flexible Modeling of the Protein-Ligand Binding Pose with Geometric Deep Learning.

Dong, Tiejun; Yang, Ziduo; Zhou, Jun; Chen, Calvin Yu-Chian.

J Chem Theory Comput ; 19(22): 8446-8459, 2023 Nov 28.

Artículo en Inglés | MEDLINE | ID: mdl-37938978

RESUMEN

Flexible modeling of the protein-ligand complex structure is a fundamental challenge for in silico drug development. Recent studies have improved commonly used docking tools by incorporating extra-deep learning-based steps. However, such strategies limit their accuracy and efficiency because they retain massive sampling pressure and lack consideration for flexible biomolecular changes. In this study, we propose FlexPose, a geometric graph network capable of direct flexible modeling of complex structures in Euclidean space without the following conventional sampling and scoring strategies. Our model adopts two key designs: scalar-vector dual feature representation and SE(3)-equivariant network, to manage dynamic structural changes, as well as two strategies: conformation-aware pretraining and weakly supervised learning, to boost model generalizability in unseen chemical space. Benefiting from these paradigms, our model dramatically outperforms all tested popular docking tools and recently advanced deep learning methods, especially in tasks involving protein conformation changes. We further investigate the impact of protein and ligand similarity on the model performance with two conformation-aware strategies. Moreover, FlexPose provides an affinity estimation and model confidence for postanalysis.

Asunto(s)

Aprendizaje Profundo , Ligandos , Simulación del Acoplamiento Molecular , Proteínas/química , Conformación Proteica , Unión Proteica

8.

TCMBank: bridges between the largest herbal medicines, chemical ingredients, target proteins, and associated diseases with intelligence text mining.

Lv, Qiujie; Chen, Guanxing; He, Haohuai; Yang, Ziduo; Zhao, Lu; Chen, Hsin-Yi; Chen, Calvin Yu-Chian.

Chem Sci ; 14(39): 10684-10701, 2023 Oct 11.

Artículo en Inglés | MEDLINE | ID: mdl-37829020

RESUMEN

Traditional Chinese Medicine (TCM) has long been viewed as a precious source of modern drug discovery. AI-assisted drug discovery (AIDD) has been investigated extensively. However, there are still two challenges in applying AIDD to guide TCM drug discovery: the lack of a large amount of standardized TCM-related information and AIDD is prone to pathological failures in out-of-domain data. We have released TCM Database@Taiwan in 2011, and it has been widely disseminated and used. Now, we developed TCMBank, the largest systematic free TCM database, which is an extension of TCM Database@Taiwan. TCMBank contains 9192 herbs, 61 966 ingredients (unduplicated), 15 179 targets, 32 529 diseases, and their pairwise relationships. By integrating multiple data sources, TCMBank provides 3D structure information of ingredients and provides a standard list and detailed information on herbs, ingredients, targets and diseases. TCMBank has an intelligent document identification module that continuously adds TCM-related information retrieved from the literature in PubChem. In addition, driven by TCMBank big data, we developed an ensemble learning-based drug discovery protocol for identifying potential leads and drug repurposing. We take colorectal cancer and Alzheimer's disease as examples to demonstrate how to accelerate drug discovery by artificial intelligence. Using TCMBank, researchers can view literature-driven relationship mapping between herbs/ingredients and genes/diseases, allowing the understanding of molecular action mechanisms for ingredients and identification of new potentially effective treatments. TCMBank is available at https://TCMBank.CN/.

9.

Hybrid neural network approaches to predict drug-target binding affinity for drug repurposing: screening for potential leads for Alzheimer's disease.

Wu, Xialin; Li, Zhuojian; Chen, Guanxing; Yin, Yiyang; Chen, Calvin Yu-Chian.

Front Mol Biosci ; 10: 1227371, 2023.

Artículo en Inglés | MEDLINE | ID: mdl-37441162

RESUMEN

Alzheimer's disease (AD) is a neurodegenerative disease that primarily affects elderly individuals. Recent studies have found that sigma-1 receptor (S1R) agonists can maintain endoplasmic reticulum stress homeostasis, reduce neuronal apoptosis, and enhance mitochondrial function and autophagy, making S1R a target for AD therapy. Traditional experimental methods are costly and inefficient, and rapid and accurate prediction methods need to be developed, while drug repurposing provides new ways and options for AD treatment. In this paper, we propose HNNDTA, a hybrid neural network for drug-target affinity (DTA) prediction, to facilitate drug repurposing for AD treatment. The study combines protein-protein interaction (PPI) network analysis, the HNNDTA model, and molecular docking to identify potential leads for AD. The HNNDTA model was constructed using 13 drug encoding networks and 9 target encoding networks with 2506 FDA-approved drugs as the candidate drug library for S1R and related proteins. Seven potential drugs were identified using network pharmacology and DTA prediction results of the HNNDTA model. Molecular docking simulations were further performed using the AutoDock Vina tool to screen haloperidol and bromperidol as lead compounds for AD treatment. Absorption, distribution, metabolism, excretion, and toxicity (ADMET) evaluation results indicated that both compounds had good pharmacokinetic properties and were virtually non-toxic. The study proposes a new approach to computer-aided drug design that is faster and more economical, and can improve hit rates for new drug compounds. The results of this study provide new lead compounds for AD treatment, which may be effective due to their multi-target action. HNNDTA is freely available at https://github.com/lizhj39/HNNDTA.

10.

3D graph neural network with few-shot learning for predicting drug-drug interactions in scaffold-based cold start scenario.

Lv, Qiujie; Zhou, Jun; Yang, Ziduo; He, Haohuai; Chen, Calvin Yu-Chian.

Neural Netw ; 165: 94-105, 2023 Aug.

Artículo en Inglés | MEDLINE | ID: mdl-37276813

RESUMEN

Understanding drug-drug interactions (DDI) of new drugs is critical for minimizing unexpected adverse drug reactions. The modeling of new drugs is called a cold start scenario. In this scenario, Only a few structural information or physicochemical information about new drug is available. The 3D conformation of drug molecules usually plays a crucial role in chemical properties compared to the 2D structure. 3D graph network with few-shot learning is a promising solution. However, the 3D heterogeneity of drug molecules and the discretization of atomic distributions lead to spatial confusion in few-shot learning. Here, we propose a 3D graph neural network with few-shot learning, Meta3D-DDI, to predict DDI events in cold start scenario. The 3DGNN ensures rotation and translation invariance by calculating atomic pairwise distances, and incorporates 3D structure and distance information in the information aggregation stage. The continuous filter interaction module can continuously simulate the filter to obtain the interaction between the target atom and other atoms. Meta3D-DDI further develops a FSL strategy based on bilevel optimization to transfer meta-knowledge for DDI prediction tasks from existing drugs to new drugs. In addition, the existing cold start setting may cause the scaffold structure information in the training set to leak into the test set. We design scaffold-based cold start scenario to ensure that the drug scaffolds in the training set and test set do not overlap. The extensive experiments demonstrate that our architecture achieves the SOTA performance for DDI prediction under scaffold-based cold start scenario on two real-world datasets. The visual experiment shows that Meta3D-DDI significantly improves the learning for DDI prediction of new drugs. We also demonstrate how Meta3D-DDI can reduce the amount of data required to make meaningful DDI predictions.

Asunto(s)

Conocimiento , Aprendizaje , Interacciones Farmacológicas , Redes Neurales de la Computación , Rotación

11.

NHGNN-DTA: a node-adaptive hybrid graph neural network for interpretable drug-target binding affinity prediction.

He, Haohuai; Chen, Guanxing; Chen, Calvin Yu-Chian.

Bioinformatics ; 39(6)2023 06 01.

Artículo en Inglés | MEDLINE | ID: mdl-37252835

RESUMEN

MOTIVATION: Large-scale prediction of drug-target affinity (DTA) plays an important role in drug discovery. In recent years, machine learning algorithms have made great progress in DTA prediction by utilizing sequence or structural information of both drugs and proteins. However, sequence-based algorithms ignore the structural information of molecules and proteins, while graph-based algorithms are insufficient in feature extraction and information interaction. RESULTS: In this article, we propose NHGNN-DTA, a node-adaptive hybrid neural network for interpretable DTA prediction. It can adaptively acquire feature representations of drugs and proteins and allow information to interact at the graph level, effectively combining the advantages of both sequence-based and graph-based approaches. Experimental results have shown that NHGNN-DTA achieved new state-of-the-art performance. It achieved the mean squared error (MSE) of 0.196 on the Davis dataset (below 0.2 for the first time) and 0.124 on the KIBA dataset (3% improvement). Meanwhile, in the case of cold start scenario, NHGNN-DTA proved to be more robust and more effective with unseen inputs than baseline methods. Furthermore, the multi-head self-attention mechanism endows the model with interpretability, providing new exploratory insights for drug discovery. The case study on Omicron variants of SARS-CoV-2 illustrates the efficient utilization of drug repurposing in COVID-19. AVAILABILITY AND IMPLEMENTATION: The source code and data are available at https://github.com/hehh77/NHGNN-DTA.

Asunto(s)

COVID-19 , Humanos , SARS-CoV-2 , Redes Neurales de la Computación , Algoritmos

12.

Retrosynthesis prediction using an end-to-end graph generative architecture for molecular graph editing.

Zhong, Weihe; Yang, Ziduo; Chen, Calvin Yu-Chian.

Nat Commun ; 14(1): 3009, 2023 May 25.

Artículo en Inglés | MEDLINE | ID: mdl-37230985

RESUMEN

Retrosynthesis planning, the process of identifying a set of available reactions to synthesize the target molecules, remains a major challenge in organic synthesis. Recently, computer-aided synthesis planning has gained renewed interest and various retrosynthesis prediction algorithms based on deep learning have been proposed. However, most existing methods are limited to the applicability and interpretability of model predictions, and further improvement of predictive accuracy to a more practical level is still required. In this work, inspired by the arrow-pushing formalism in chemical reaction mechanisms, we present an end-to-end architecture for retrosynthesis prediction called Graph2Edits. Specifically, Graph2Edits is based on graph neural network to predict the edits of the product graph in an auto-regressive manner, and sequentially generates transformation intermediates and final reactants according to the predicted edits sequence. This strategy combines the two-stage processes of semi-template-based methods into one-pot learning, improving the applicability in some complicated reactions, and also making its predictions more interpretable. Evaluated on the standard benchmark dataset USPTO-50k, our model achieves the state-of-the-art performance for semi-template-based retrosynthesis with a promising 55.1% top-1 accuracy.

13.

Meta Learning With Graph Attention Networks for Low-Data Drug Discovery.

Lv, Qiujie; Chen, Guanxing; Yang, Ziduo; Zhong, Weihe; Chen, Calvin Yu-Chian.

IEEE Trans Neural Netw Learn Syst ; PP2023 Mar 06.

Artículo en Inglés | MEDLINE | ID: mdl-37028032

RESUMEN

Finding candidate molecules with favorable pharmacological activity, low toxicity, and proper pharmacokinetic properties is an important task in drug discovery. Deep neural networks have made impressive progress in accelerating and improving drug discovery. However, these techniques rely on a large amount of label data to form accurate predictions of molecular properties. At each stage of the drug discovery pipeline, usually, only a few biological data of candidate molecules and derivatives are available, indicating that the application of deep neural networks for low-data drug discovery is still a formidable challenge. Here, we propose a meta learning architecture with graph attention network, Meta-GAT, to predict molecular properties in low-data drug discovery. The GAT captures the local effects of atomic groups at the atom level through the triple attentional mechanism and implicitly captures the interactions between different atomic groups at the molecular level. GAT is used to perceive molecular chemical environment and connectivity, thereby effectively reducing sample complexity. Meta-GAT further develops a meta learning strategy based on bilevel optimization, which transfers meta knowledge from other attribute prediction tasks to low-data target tasks. In summary, our work demonstrates how meta learning can reduce the amount of data required to make meaningful predictions of molecules in low-data scenarios. Meta learning is likely to become the new learning paradigm in low-data drug discovery. The source code is publicly available at: https://github.com/lol88/Meta-GAT.

14.

DSIL-DDI: A Domain-Invariant Substructure Interaction Learning for Generalizable Drug-Drug Interaction Prediction.

Tang, Zhenchao; Chen, Guanxing; Yang, Hualin; Zhong, Weihe; Chen, Calvin Yu-Chian.

IEEE Trans Neural Netw Learn Syst ; PP2023 Feb 14.

Artículo en Inglés | MEDLINE | ID: mdl-37022856

RESUMEN

Drug-drug interactions (DDIs) trigger unexpected pharmacological effects in vivo, often with unknown causal mechanisms. Deep learning methods have been developed to better understand DDI. However, learning domain-invariant representations for DDI remains a challenge. Generalizable DDI predictions are closer to reality than source domain predictions. For existing methods, it is difficult to achieve out-of-distribution (OOD) predictions. In this article, focusing on substructure interaction, we propose DSIL-DDI, a pluggable substructure interaction module that can learn domain-invariant representations of DDIs from source domain. We evaluate DSIL-DDI on three scenarios: the transductive setting (all drugs in test set appear in training set), the inductive setting (test set contains new drugs that were not present in training set), and OOD generalization setting (training set and test set belong to two different datasets). The results demonstrate that DSIL-DDI improve the generalization and interpretability of DDI prediction modeling and provides valuable insights for OOD DDI predictions. DSIL-DDI can help doctors ensuring the safety of drug administration and reducing the harm caused by drug abuse.

15.

TCMBank-the largest TCM database provides deep learning-based Chinese-Western medicine exclusion prediction.

Lv, Qiujie; Chen, Guanxing; He, Haohuai; Yang, Ziduo; Zhao, Lu; Zhang, Kang; Chen, Calvin Yu-Chian.

Signal Transduct Target Ther ; 8(1): 127, 2023 03 31.

Artículo en Inglés | MEDLINE | ID: mdl-36997527

Asunto(s)

Bases de Datos Factuales , Aprendizaje Profundo , Medicamentos Herbarios Chinos , Medicina Tradicional China , Humanos

16.

Multitask deep learning with dynamic task balancing for quantum mechanical properties prediction.

Yang, Ziduo; Zhong, Weihe; Lv, Qiujie; Chen, Calvin Yu-Chian.

Phys Chem Chem Phys ; 24(9): 5383-5393, 2022 Mar 02.

Artículo en Inglés | MEDLINE | ID: mdl-35169821

RESUMEN

Predicting quantum mechanical properties (QMPs) is very important for the innovation of material and chemistry science. Multitask deep learning models have been widely used in QMPs prediction. However, existing multitask learning models often train multiple QMPs prediction tasks simultaneously without considering the internal relationships and differences between tasks, which may cause the model to overfit easy tasks. In this study, we first proposed a multiscale dynamic attention graph neural network (MDGNN) for molecular representation learning. The MDGNN was designed in a multitask learning fashion that can solve multiple learning tasks at the same time. We then introduced a dynamic task balancing (DTB) strategy combining task differences and difficulties to reduce overfitting across multiple tasks. Finally, we adopted gradient-weighted class activation mapping (Grad-CAM) to analyze a deep learning model for frontier molecular orbital, highest occupied molecular orbital (HOMO) and lowest unoccupied molecular orbital (LUMO) energy level predictions. We evaluated our approach using two large QMPs datasets and compared the proposed method to the state-of-the-art multitask learning models. The MDGNN outperforms other multitask learning approaches on two datasets. The DTB strategy can further improve the performance of MDGNN significantly. Moreover, we show that Grad-CAM creates explanations that are consistent with the molecular orbitals theory. These advantages demonstrate that the proposed method improves the generalization and interpretation capability of QMPs prediction modeling.

Asunto(s)

Aprendizaje Profundo , Aprendizaje Automático , Redes Neurales de la Computación

17.

FusionDTA: attention-based feature polymerizer and knowledge distillation for drug-target binding affinity prediction.

Yuan, Weining; Chen, Guanxing; Chen, Calvin Yu-Chian.

Brief Bioinform ; 23(1)2022 01 17.

Artículo en Inglés | MEDLINE | ID: mdl-34929738

RESUMEN

The prediction of drug-target affinity (DTA) plays an increasingly important role in drug discovery. Nowadays, lots of prediction methods focus on feature encoding of drugs and proteins, but ignore the importance of feature aggregation. However, the increasingly complex encoder networks lead to the loss of implicit information and excessive model size. To this end, we propose a deep-learning-based approach namely FusionDTA. For the loss of implicit information, a novel muti-head linear attention mechanism was utilized to replace the rough pooling method. This allows FusionDTA aggregates global information based on attention weights, instead of selecting the largest one as max-pooling does. To solve the redundancy issue of parameters, we applied knowledge distillation in FusionDTA by transfering learnable information from teacher model to student. Results show that FusionDTA performs better than existing models for the test domain on all evaluation metrics. We obtained concordance index (CI) index of 0.913 and 0.906 in Davis and KIBA dataset respectively, compared with 0.893 and 0.891 of previous state-of-art model. Under the cold-start constrain, our model proved to be more robust and more effective with unseen inputs than baseline methods. In addition, the knowledge distillation did save half of the parameters of the model, with only 0.006 reduction in CI index. Even FusionDTA with half the parameters could easily exceed the baseline on all metrics. In general, our model has superior performance and improves the effect of drug-target interaction (DTI) prediction. The visualization of DTI can effectively help predict the binding region of proteins during structure-based drug design.

Asunto(s)

Desarrollo de Medicamentos , Proteínas , Descubrimiento de Drogas , Humanos , Conocimiento , Proteínas/química

18.

Enhancing adversarial defense for medical image analysis systems with pruning and attention mechanism.

Chen, Lun; Zhao, Lu; Chen, Calvin Yu-Chian.

Med Phys ; 48(10): 6198-6212, 2021 Oct.

Artículo en Inglés | MEDLINE | ID: mdl-34487364

RESUMEN

PURPOSE: Deep learning has achieved impressive performance across a variety of tasks, including medical image processing. However, recent research has shown that deep neural networks (DNNs) are susceptible to small adversarial perturbations in the image, which raise safety concerns about the deployment of these systems in clinical settings. METHODS: To improve the defense of the medical imaging system against adversarial examples, we propose a new model-based defense framework for medical image DNNs model equipped with pruning and attention mechanism module based on the analysis of the reason why existing medical image DNNs models are vulnerable to attacks from adversarial examples is that complex biological texture of medical imaging and overparameterized medical image DNNs model. RESULTS: Three benchmark medical image datasets have verified the effectiveness of our method in improving the robustness of medical image DNNs models. In the chest X-ray datasets, our defending method can even achieve up 77.18% defense rate for projected gradient descent attack and 69.49% defense rate for DeepFool attack. And through ablation experiments on the pruning module and the attention mechanism module, it is verified that the use of pruning and attention mechanism can effectively improve the robustness of the medical image DNNs model. CONCLUSIONS: Compared with the existing model-based defense methods proposed for natural images, our defense method is more suitable for medical images. Our method can be a general strategy to approach the design of more explainable and secure medical deep learning systems, and can be widely used in various medical image tasks to improve the robustness of medical models.

Asunto(s)

Procesamiento de Imagen Asistido por Computador , Redes Neurales de la Computación

19.

CARes-UNet: Content-aware residual UNet for lesion segmentation of COVID-19 from chest CT images.

Xu, Xinhua; Wen, Yuhang; Zhao, Lu; Zhang, Yi; Zhao, Youjun; Tang, Zixuan; Yang, Ziduo; Chen, Calvin Yu-Chian.

Med Phys ; 48(11): 7127-7140, 2021 Nov.

Artículo en Inglés | MEDLINE | ID: mdl-34528263

RESUMEN

PURPOSE: Coronavirus disease 2019 (COVID-19) has caused a serious global health crisis. It has been proven that the deep learning method has great potential to assist doctors in diagnosing COVID-19 by automatically segmenting the lesions in computed tomography (CT) slices. However, there are still several challenges restricting the application of these methods, including high variation in lesion characteristics and low contrast between lesion areas and healthy tissues. Moreover, the lack of high-quality labeled samples and large number of patients lead to the urgency to develop a high accuracy model, which performs well not only under supervision but also with semi-supervised methods. METHODS: We propose a content-aware lung infection segmentation deep residual network (content-aware residual UNet (CARes-UNet)) to segment the lesion areas of COVID-19 from the chest CT slices. In our CARes-UNet, the residual connection was used in the convolutional block, which alleviated the degradation problem during the training. Then, the content-aware upsampling modules were introduced to improve the performance of the model while reducing the computation cost. Moreover, to achieve faster convergence, an advanced optimizer named Ranger was utilized to update the model's parameters during training. Finally, we employed a semi-supervised segmentation framework to deal with the problem of lacking pixel-level labeled data. RESULTS: We evaluated our approach using three public datasets with multiple metrics and compared its performance to several models. Our method outperforms other models in multiple indicators, for instance in terms of Dice coefficient on COVID-SemiSeg Dataset, CARes-UNet got the score 0.731, and semi-CARes-UNet further boosted it to 0.776. More ablation studies were done and validated the effectiveness of each key component of our proposed model. CONCLUSIONS: Compared with the existing neural network methods applied to the COVID-19 lesion segmentation tasks, our CARes-UNet can gain more accurate segmentation results, and semi-CARes-UNet can further improve it using semi-supervised learning methods while presenting a possible way to solve the problem of lack of high-quality annotated samples. Our CARes-UNet and semi-CARes-UNet can be used in artificial intelligence-empowered computer-aided diagnosis system to improve diagnostic accuracy in this ongoing COVID-19 pandemic.

Asunto(s)

COVID-19 , Pandemias , Inteligencia Artificial , Humanos , Procesamiento de Imagen Asistido por Computador , SARS-CoV-2 , Tomografía Computarizada por Rayos X

20.

Predicting Drug-Target Interactions with Deep-Embedding Learning of Graphs and Sequences.

Chen, Wei; Chen, Guanxing; Zhao, Lu; Chen, Calvin Yu-Chian.

J Phys Chem A ; 125(25): 5633-5642, 2021 Jul 01.

Artículo en Inglés | MEDLINE | ID: mdl-34142824

RESUMEN

Computational approaches for predicting drug-target interactions (DTIs) play an important role in drug discovery since conventional screening experiments are time-consuming and expensive. In this study, we proposed end-to-end representation learning of a graph neural network with an attention mechanism and an attentive bidirectional long short-term memory (BiLSTM) to predict DTIs. For efficient training, we introduced a bidirectional encoder representations from transformers (BERT) pretrained method to extract substructure features from protein sequences and a local breadth-first search (BFS) to learn subgraph information from molecular graphs. Integrating both models, we developed a DTI prediction system. As a result, the proposed method achieved high performances with increases of 2.4% and 9.4% for AUC and recall, respectively, on unbalanced datasets compared with other methods. Extensive experiments showed that our model can relatively screen potential drugs for specific protein. Furthermore, visualizing the attention weights provides biological insight.

Asunto(s)

Biología Computacional/métodos , Gráficos por Computador , Aprendizaje Profundo , Descubrimiento de Drogas/métodos , Preparaciones Farmacéuticas/metabolismo , Proteínas/química , Proteínas/metabolismo , Secuencia de Aminoácidos

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA