RESUMO
@#Abstract: Kirsten rat sarcoma viral oncogene homolog (KRAS) gene is one of the most commonly mutated oncogenes. It has been found that KRAS inhibitors have the potential therapeutic effect on cancer patients with this gene mutation. In this study, machine learning was applied to develop a QSAR(quantitative structure-activity relationship) model for KRAS small molecule inhibitors. A total of 1857data points of IC50 and SMILES(simplified molecular input line entry system) for KRAS inhibitors were collected from three databases: ChEMBL, BindingDB, and PubChem. And nine different classifiers were constructed using three different feature screening methods combined with three machine learning models, namely, random forest, support vector machine, and extreme gradient boosting machine. The results showed that the SVM model combined with mutual information feature selection exhibited the best performance: AUCtest=0.912, ACCtest=0.859, F1test=0.890. Moreover, it also demonstrated good predictive performance on the external validation set(AUCExt=0.944, RecallExt=0.856, FPRExt=0.111). This study provides a new technical route for KRAS inhibitor screening in natural product databases using artificial intelligence methods.
RESUMO
@#The prediction of compound-protein interaction (CPI) is a critical technological tool for discovering lead compounds and drug repurposing during the process of drug development.In recent years, deep learning has been widely used in CPI research, which has accelerated the development of CPI prediction in drug discovery.This review focuses on feature-based CPI prediction models.First, we described the datasets, as well as typical feature representation methods commonly used for compounds and proteins in CPI prediction.Based on the critical problems in modeling, we discussed models for CPI prediction from two perspectives: multimodal features and attention mechanisms.Then, the performance of 12 selected models was evaluated on 3 benchmark datasets for both classification and regression tasks.Finally, the review summarizes the existing challenges in this field and prospects for future directions.We believe that this investigation will provide some reference and insight for further research on CPI prediction.