Your browser doesn't support javascript.
loading
An ensemble machine learning model generates a focused screening library for the identification of CDK8 inhibitors.
Lin, Tony Eight; Yen, Dyan; HuangFu, Wei-Chun; Wu, Yi-Wen; Hsu, Jui-Yi; Yen, Shih-Chung; Sung, Tzu-Ying; Hsieh, Jui-Hua; Pan, Shiow-Lin; Yang, Chia-Ron; Huang, Wei-Jan; Hsu, Kai-Cheng.
Affiliation
  • Lin TE; Graduate Institute of Cancer Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan.
  • Yen D; Ph.D. Program for Cancer Molecular Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan.
  • HuangFu WC; Graduate Institute of Cancer Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan.
  • Wu YW; Graduate Institute of Cancer Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan.
  • Hsu JY; Ph.D. Program for Cancer Molecular Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan.
  • Yen SC; TMU Research Center of Cancer Translational Medicine, Taipei Medical University, Taipei, Taiwan.
  • Sung TY; Graduate Institute of Cancer Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan.
  • Hsieh JH; Graduate Institute of Cancer Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan.
  • Pan SL; Ph.D. Program for Cancer Molecular Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan.
  • Yang CR; Warshel Institute for Computational Biology, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong, People's Republic of China.
  • Huang WJ; Biomedical Translation Research Center, Academia Sinica, Taipei, Taiwan.
  • Hsu KC; Division of Translational Toxicology, National Institute of Environmental Health Sciences, National Institutes of Health, Durham, North Carolina, USA.
Protein Sci ; 33(6): e5007, 2024 Jun.
Article in En | MEDLINE | ID: mdl-38723187
ABSTRACT
The identification of an effective inhibitor is an important starting step in drug development. Unfortunately, many issues such as the characterization of protein binding sites, the screening library, materials for assays, etc., make drug screening a difficult proposition. As the size of screening libraries increases, more resources will be inefficiently consumed. Thus, new strategies are needed to preprocess and focus a screening library towards a targeted protein. Herein, we report an ensemble machine learning (ML) model to generate a CDK8-focused screening library. The ensemble model consists of six different algorithms optimized for CDK8 inhibitor classification. The models were trained using a CDK8-specific fragment library along with molecules containing CDK8 activity. The optimized ensemble model processed a commercial library containing 1.6 million molecules. This resulted in a CDK8-focused screening library containing 1,672 molecules, a reduction of more than 99.90%. The CDK8-focused library was then subjected to molecular docking, and 25 candidate compounds were selected. Enzymatic assays confirmed six CDK8 inhibitors, with one compound producing an IC50 value of ≤100 nM. Analysis of the ensemble ML model reveals the role of the CDK8 fragment library during training. Structural analysis of molecules reveals the hit compounds to be structurally novel CDK8 inhibitors. Together, the results highlight a pipeline for curating a focused library for a specific protein target, such as CDK8.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Protein Kinase Inhibitors / Drug Evaluation, Preclinical / Cyclin-Dependent Kinase 8 / Machine Learning Limits: Humans Language: En Journal: Protein Sci Journal subject: BIOQUIMICA Year: 2024 Document type: Article Affiliation country: Taiwán

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Protein Kinase Inhibitors / Drug Evaluation, Preclinical / Cyclin-Dependent Kinase 8 / Machine Learning Limits: Humans Language: En Journal: Protein Sci Journal subject: BIOQUIMICA Year: 2024 Document type: Article Affiliation country: Taiwán