RESUMEN
Cyclic peptides are versatile therapeutic agents that boast high binding affinity, minimal toxicity, and the potential to engage challenging protein targets. However, the pharmaceutical utility of cyclic peptides is limited by their low membrane permeability-an essential indicator of oral bioavailability and intracellular targeting. Current machine learning-based models of cyclic peptide permeability show variable performance owing to the limitations of experimental data. Furthermore, these methods use features derived from the whole molecule that have traditionally been used to predict small molecules and ignore the unique structural properties of cyclic peptides. This study presents CycPeptMP: an accurate and efficient method to predict cyclic peptide membrane permeability. We designed features for cyclic peptides at the atom-, monomer-, and peptide-levels and seamlessly integrated these into a fusion model using deep learning technology. Additionally, we applied various data augmentation techniques to enhance model training efficiency using the latest data. The fusion model exhibited excellent prediction performance for the logarithm of permeability, with a mean absolute error of $0.355$ and correlation coefficient of $0.883$. Ablation studies demonstrated that all feature levels contributed and were relatively essential to predicting membrane permeability, confirming the effectiveness of augmentation to improve prediction accuracy. A comparison with a molecular dynamics-based method showed that CycPeptMP accurately predicted peptide permeability, which is otherwise difficult to predict using simulations.
Asunto(s)
Permeabilidad de la Membrana Celular , Péptidos Cíclicos , Péptidos Cíclicos/química , Péptidos Cíclicos/metabolismo , Aprendizaje Automático , Aprendizaje Profundo , Biología Computacional/métodosRESUMEN
Protein-ligand docking plays a significant role in structure-based drug discovery. This methodology aims to estimate the binding mode and binding free energy between the drug-targeted protein and candidate chemical compounds, utilizing protein tertiary structure information. Reformulation of this docking as a quadratic unconstrained binary optimization (QUBO) problem to obtain solutions via quantum annealing has been attempted. However, previous studies did not consider the internal degrees of freedom of the compound that is mandatory and essential. In this study, we formulated fragment-based protein-ligand flexible docking, considering the internal degrees of freedom of the compound by focusing on fragments (rigid chemical substructures of compounds) as a QUBO problem. We introduced four factors essential for fragment-based docking in the Hamiltonian: (1) interaction energy between the target protein and each fragment, (2) clashes between fragments, (3) covalent bonds between fragments, and (4) the constraint that each fragment of the compound is selected for a single placement. We also implemented a proof-of-concept system and conducted redocking for the protein-compound complex structure of Aldose reductase (a drug target protein) using SQBM+, which is a simulated quantum annealer. The predicted binding pose reconstructed from the best solution was near-native (RMSD = 1.26 Å), which can be further improved (RMSD = 0.27 Å) using conventional energy minimization. The results indicate the validity of our QUBO problem formulation.
RESUMEN
MOTIVATION: In recent years, cyclic peptide drugs have been receiving increasing attention because they can target proteins that are difficult to be tackled by conventional small-molecule drugs or antibody drugs. Plasma protein binding rate (%PPB) is a significant pharmacokinetic property of a compound in drug discovery and design. However, due to structural differences, previous computational prediction methods developed for small-molecule compounds cannot be successfully applied to cyclic peptides, and methods for predicting the PPB rate of cyclic peptides with high accuracy are not yet available. RESULTS: Cyclic peptides are larger than small molecules, and their local structures have a considerable impact on PPB; thus, molecular descriptors expressing residue-level local features of cyclic peptides, instead of those expressing the entire molecule, as well as the circularity of the cyclic peptides should be considered. Therefore, we developed a prediction method named CycPeptPPB using deep learning that considers both factors. First, the macrocycle ring of cyclic peptides was decomposed residue by residue. The residue-based descriptors were arranged according to the sequence information of the cyclic peptide. Furthermore, the circular data augmentation method was used, and the circular convolution method CyclicConv was devised to express the cyclic structure. CycPeptPPB exhibited excellent performance, with mean absolute error (MAE) of 4.79% and correlation coefficient (R) of 0.92 for the public drug dataset, compared to the prediction performance of the existing PPB rate prediction software (MAE=15.08%, R=0.63). AVAILABILITY AND IMPLEMENTATION: The data underlying this article are available in the online supplementary material. The source code of CycPeptPPB is available at https://github.com/akiyamalab/cycpeptppb. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Aprendizaje Profundo , Péptidos Cíclicos , Proteínas Sanguíneas , Unión Proteica , Programas InformáticosRESUMEN
Peptides have attracted much attention recently owing to their well-balanced properties as drugs against protein-protein interaction (PPI) surfaces. Molecular simulation-based predictions of binding sites and amino acid residues with high affinity to PPI surfaces are expected to accelerate the design of peptide drugs. Mixed-solvent molecular dynamics (MSMD), which adds probe molecules or fragments of functional groups as solutes to the hydration model, detects the binding hotspots and cryptic sites induced by small molecules. The detection results vary depending on the type of probe molecule; thus, they provide important information for drug design. For rational peptide drug design using MSMD, we proposed MSMD with amino acid residue probes, named amino acid probe-based MSMD (AAp-MSMD), to detect hotspots and identify favorable amino acid types on protein surfaces to which peptide drugs bind. We assessed our method in terms of hotspot detection at the amino acid probe level and binding free energy prediction with amino acid probes at the PPI site for the complex structure that formed the PPI. In hotspot detection, the max-spatial probability distribution map (max-PMAP) obtained from AAp-MSMD detected the PPI site, to which each type of amino acid can bind favorably. In the binding free energy prediction using amino acid probes, ΔGFE obtained from AAp-MSMD roughly estimated the experimental binding affinities from the structure-activity relationship. AAp-MSMD, with amino acid probes, provides estimated binding sites and favorable amino acid types at the PPI site of a target protein.
Asunto(s)
Aminoácidos , Simulación de Dinámica Molecular , Solventes/química , Aminoácidos/metabolismo , Proteínas/química , Sitios de Unión , Péptidos/química , Unión ProteicaRESUMEN
Recently, cyclic peptides have been considered breakthrough drugs because they can interact with "undruggable" targets such as intracellular protein-protein interactions. Membrane permeability is an essential indicator of oral bioavailability and intracellular targeting, and the development of membrane-permeable peptides is a bottleneck in cyclic peptide drug discovery. Although many experimental data on membrane permeability of cyclic peptides have been reported, a comprehensive database is not yet available. A comprehensive membrane permeability database is essential for developing computational methods for cyclic peptide drug design. In this study, we constructed CycPeptMPDB, the first web-accessible database of cyclic peptide membrane permeability. We collected information on a total of 7334 cyclic peptides, including the structure and experimentally measured membrane permeability, from 45 published papers and 2 patents from pharmaceutical companies. To unambiguously represent cyclic peptides larger than small molecules, we used the hierarchical editing language for macromolecules notation to generate a uniform sequence representation of peptides. In addition to data storage, CycPeptMPDB provides several supporting functions such as online data visualization, data analysis, and downloading. CycPeptMPDB is expected to be a valuable platform to support membrane permeability research on cyclic peptides. CycPeptMPDB can be freely accessed at http://cycpeptmpdb.com.
Asunto(s)
Péptidos Cíclicos , Péptidos , Péptidos Cíclicos/química , Péptidos/química , Diseño de Fármacos , Descubrimiento de Drogas/métodos , Permeabilidad , Permeabilidad de la Membrana CelularRESUMEN
Cyclic peptides have attracted attention as a promising pharmaceutical modality due to their potential to selectively inhibit previously undruggable targets, such as intracellular protein-protein interactions. Poor membrane permeability is the biggest bottleneck hindering successful drug discovery based on cyclic peptides. Therefore, the development of computational methods that can predict membrane permeability and support elucidation of the membrane permeation mechanism of drug candidate peptides is much sought after. In this study, we developed a protocol to simulate the behavior in membrane permeation steps and estimate the membrane permeability of large cyclic peptides with more than or equal to 10 residues. This protocol requires the use of a more realistic membrane model than a single-lipid phospholipid bilayer. To select a membrane model, we first analyzed the effect of cholesterol concentration in the model membrane on the potential of mean force and hydrogen bonding networks along the direction perpendicular to the membrane surface as predicted by molecular dynamics simulations using cyclosporine A. These results suggest that a membrane model with 40 or 50 mol % cholesterol was suitable for predicting the permeation process. Subsequently, two types of membrane models containing 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphocholine and 40 and 50 mol % cholesterol were used. To validate the efficiency of our protocol, the membrane permeability of 18 ten-residue peptides was predicted. Correlation coefficients of R > 0.8 between the experimental and calculated permeability values were obtained with both model membranes. The results of this study demonstrate that the lipid membrane is not just a medium but also among the main factors determining the membrane permeability of molecules. The computational protocol proposed in this study and the findings obtained on the effect of membrane model composition will contribute to building a schematic view of the membrane permeation process. Furthermore, the results of this study will eventually aid the elucidation of design rules for peptide drugs with high membrane permeability.
Asunto(s)
Simulación de Dinámica Molecular , Péptidos Cíclicos , Colesterol/química , Ciclosporina , Membrana Dobles de Lípidos/química , Péptidos/química , Péptidos Cíclicos/farmacología , Permeabilidad , Preparaciones Farmacéuticas , Fosfatidilcolinas/química , FosfolípidosRESUMEN
To ensure efficiency in discovery and development, the application of computational technology is essential. Although virtual screening techniques are widely applied in the early stages of drug discovery research, the computational methods used in lead optimization to improve activity and reduce the toxicity of compounds are still evolving. In this study, we propose a method to construct the residue interaction profile of the chemical structure used in the lead optimization by performing "inverse" mixed-solvent molecular dynamics (MSMD) simulation. Contrary to constructing a protein-based, atom interaction profile, we constructed a probe-based, protein residue interaction profile using MSMD trajectories. It provides us the profile of the preferred protein environments of probes without co-crystallized structures. We assessed the method using three probes: benzamidine, catechol, and benzene. As a result, the residue interaction profile of each probe obtained by MSMD was a reasonable physicochemical description of the general non-covalent interaction. Moreover, comparison with the X-ray structure containing each probe as a ligand shows that the map of the interaction profile matches the arrangement of amino acid residues in the X-ray structure.
Asunto(s)
Simulación de Dinámica Molecular , Sondas Moleculares , Ligandos , Proteínas/química , Solventes/químicaRESUMEN
In the polyomino puzzle, the aim is to fill a finite space using several polyomino pieces with no overlaps or blanks. Because it is an NP-complete combinatorial optimization problem, various probabilistic and approximated approaches have been applied to find solutions. Several previous studies embedded the polyomino puzzle in a QUBO problem, where the original objective function and constraints are transformed into the Hamiltonian function of the simulated Ising model. A solution to the puzzle is obtained by searching for a ground state of Hamiltonian by simulating the dynamics of the multiple-spin system. However, previous methods could solve only tiny polyomino puzzles considering a few combinations because their Hamiltonian designs were not efficient. We propose an improved Hamiltonian design that introduces new constraints and guiding terms to weakly encourage favorable spins and pairs in the early stages of computation. The proposed model solves the pentomino puzzle represented by approximately 2000 spins with >90% probability. Additionally, we extended the method to a generalized problem where each polyomino piece could be used zero or more times and solved it with approximately 100% probability. The proposed method also appeared to be effective for the 3D polycube puzzle, which is similar to applications in fragment-based drug discovery.
RESUMEN
Cosolvent molecular dynamics (CMD) simulations involve an MD simulation of a protein in the presence of explicit water molecules mixed with cosolvent molecules to perform hotspot detection, binding site identification, and binding energy estimation, while other existing methods (e.g., MixMD, SILCS, and MDmix) utilize small molecules that represent functional groups of compounds. However, the cosolvent selections employed in these methods differ and there are only a few cosolvents that are commonly used in these methods. In this study, we proposed a systematic method for constructing a set of cosolvents for drug discovery, termed the EXtended PRObes set construction by REpresentative Retrieval (EXPRORER). First, we extracted typical substructures from FDA-approved drugs, generated 138 cosolvent structures, and for each cosolvent molecule, we conducted CMD simulations to generate a spatial probability distribution map of cosolvent atoms (PMAP). Analyses of PMAP similarity revealed that a cosolvent pair with a PMAP similarity greater than 0.70-0.75 shared similar structural features. We present a method for the construction of a cosolvent subset that satisfies a similarity threshold for all cosolvents, and we tested the constructed sets for four proteins. To our knowledge, this is the first study to include a systematic proposal for cosolvent set construction, and thus, the EXPRORER cosolvents will provide deeper insights into ligand binding sites of various proteins.
Asunto(s)
Simulación de Dinámica Molecular , Proteínas , Sitios de Unión , Ligandos , SolventesRESUMEN
Membrane permeability is a significant obstacle facing the development of cyclic peptide drugs. However, membrane permeation mechanisms are poorly understood. To investigate common features of permeable (and nonpermeable) designs, it is necessary to reproduce the membrane permeation process of cyclic peptides through the lipid bilayer. We simulated the membrane permeation process of 100 six-residue cyclic peptides across the lipid bilayer based on steered molecular dynamics (MD) and replica-exchange umbrella sampling simulations and predicted membrane permeability using the inhomogeneous solubility-diffusion model and a modified version of it. Furthermore, we confirmed the effectiveness of this protocol by predicting the membrane permeability of 56 eight-residue cyclic peptides with diverse chemical structures, including some confidential designs from a pharmaceutical company. As a result, a reasonable correlation between experimentally assessed and calculated membrane permeability of cyclic peptides was observed for the peptide libraries, except for strongly hydrophobic peptides. Our analysis of the MD trajectory demonstrated that most peptides were stabilized in the boundary region between bulk water and membrane and that for most peptides, the process of crossing the center of the membrane is the main obstacle to membrane permeation. The height of this barrier is well correlated with the electrostatic interaction between the peptide and the surrounding media. The structural and energetic features of the representative peptide at each vertical position within the membrane were also analyzed, revealing that peptides permeate the membrane by changing their orientation and conformation according to the surrounding environment.
Asunto(s)
Membrana Dobles de Lípidos , Simulación de Dinámica Molecular , Conformación Molecular , Péptidos Cíclicos , PermeabilidadRESUMEN
Druglikeness is a useful concept for screening drug candidate compounds. We developed QEX, which is a new druglikeness index specific to individual targets. QEX is an improvement of the quantitative estimate of druglikeness (QED) method, which is a popular quantitative evaluation method of druglikeness proposed by Bickerton et al. QEX models the physicochemical properties of compounds that act on each target protein based on the concept of QED modeling physicochemical properties from information on US Food and Drug Administration-approved drugs. The result of the evaluation of PubChem assay data revealed that QEX showed better performance than the original QED did (the area under the curve value of the receiver operating characteristic curve improved by 0.069-0.236). We also present the c-Src inhibitor filtering results of the QEX constructed using Src family kinase inhibitors as a case study. QEX distinguished the inhibitors and non-inhibitors better than QED did. QEX works efficiently even when datasets of inactive compounds are unavailable. If both active and inactive compounds are present, QEX can be used as an initial filter to enhance the screening ability of conventional ligand-based virtual screenings.
Asunto(s)
Descubrimiento de Drogas , Inhibidores de Proteínas Quinasas , Familia-src Quinasas/antagonistas & inhibidores , Modelos MolecularesRESUMEN
BACKGROUND: Protein-protein interactions (PPIs) play several roles in living cells, and computational PPI prediction is a major focus of many researchers. The three-dimensional (3D) structure and binding surface are important for the design of PPI inhibitors. Therefore, rigid body protein-protein docking calculations for two protein structures are expected to allow elucidation of PPIs different from known complexes in terms of 3D structures because known PPI information is not explicitly required. We have developed rapid PPI prediction software based on protein-protein docking, called MEGADOCK. In order to fully utilize the benefits of computational PPI predictions, it is necessary to construct a comprehensive database to gather prediction results and their predicted 3D complex structures and to make them easily accessible. Although several databases exist that provide predicted PPIs, the previous databases do not contain a sufficient number of entries for the purpose of discovering novel PPIs. RESULTS: In this study, we constructed an integrated database of MEGADOCK PPI predictions, named MEGADOCK-Web. MEGADOCK-Web provides more than 10 times the number of PPI predictions than previous databases and enables users to conduct PPI predictions that cannot be found in conventional PPI prediction databases. In MEGADOCK-Web, there are 7528 protein chains and 28,331,628 predicted PPIs from all possible combinations of those proteins. Each protein structure is annotated with PDB ID, chain ID, UniProt AC, related KEGG pathway IDs, and known PPI pairs. Additionally, MEGADOCK-Web provides four powerful functions: 1) searching precalculated PPI predictions, 2) providing annotations for each predicted protein pair with an experimentally known PPI, 3) visualizing candidates that may interact with the query protein on biochemical pathways, and 4) visualizing predicted complex structures through a 3D molecular viewer. CONCLUSION: MEGADOCK-Web provides a huge amount of comprehensive PPI predictions based on docking calculations with biochemical pathways and enables users to easily and quickly assess PPI feasibilities by archiving PPI predictions. MEGADOCK-Web also promotes the discovery of new PPIs and protein functions and is freely available for use at http://www.bi.cs.titech.ac.jp/megadock-web/ .
Asunto(s)
Bases de Datos de Proteínas , Internet , Mapeo de Interacción de Proteínas/métodos , Proteínas/química , Proteínas/metabolismo , Programas InformáticosRESUMEN
BACKGROUND: Cyclic peptide-based drug discovery is attracting increasing interest owing to its potential to avoid target protein depletion. In drug discovery, it is important to maintain the biostability of a drug within the proper range. Plasma protein binding (PPB) is the most important index of biostability, and developing a computational method to predict PPB of drug candidate compounds contributes to the acceleration of drug discovery research. PPB prediction of small molecule drug compounds using machine learning has been conducted thus far; however, no study has investigated cyclic peptides because experimental information of cyclic peptides is scarce. RESULTS: First, we adopted sparse modeling and small molecule information to construct a PPB prediction model for cyclic peptides. As cyclic peptide data are limited, applying multidimensional nonlinear models involves concerns regarding overfitting. However, models constructed by sparse modeling can avoid overfitting, offering high generalization performance and interpretability. More than 1000 PPB data of small molecules are available, and we used them to construct a prediction models with two enumeration methods: enumerating lasso solutions (ELS) and forward beam search (FBS). The accuracies of the prediction models constructed by ELS and FBS were equal to or better than those of conventional non-linear models (MAE = 0.167-0.174) on cross-validation of a small molecule compound dataset. Moreover, we showed that the prediction accuracies for cyclic peptides were close to those for small molecule compounds (MAE = 0.194-0.288). Such high accuracy could not be obtained by a simple method of learning from cyclic peptide data directly by lasso regression (MAE = 0.286-0.671) or ridge regression (MAE = 0.244-0.354). CONCLUSION: In this study, we proposed a machine learning techniques that uses low-dimensional sparse modeling to predict the PPB value of cyclic peptides computationally. The low-dimensional sparse model not only exhibits excellent generalization performance but also improves interpretation of the prediction model. This can provide common an noteworthy knowledge for future cyclic peptide drug discovery studies.
Asunto(s)
Proteínas Sanguíneas/metabolismo , Simulación por Computador , Aprendizaje Automático , Modelos Teóricos , Péptidos Cíclicos/metabolismo , Preparaciones Farmacéuticas/metabolismo , Bibliotecas de Moléculas Pequeñas/metabolismo , Humanos , Unión ProteicaRESUMEN
MOTIVATION: Recently, the number of available protein tertiary structures and compounds has increased. However, structure-based virtual screening is computationally expensive owing to docking simulations. Thus, methods that filter out obviously unnecessary compounds prior to computationally expensive docking simulations have been proposed. However, the calculation speed of these methods is not fast enough to evaluate ≥ 10 million compounds. RESULTS: In this article, we propose a novel, docking-based pre-screening protocol named Spresso (Speedy PRE-Screening method with Segmented cOmpounds). Partial structures (fragments) are common among many compounds; therefore, the number of fragment variations needed for evaluation is smaller than that of compounds. Our method increases calculation speeds by â¼200-fold compared to conventional methods. AVAILABILITY AND IMPLEMENTATION: Spresso is written in C ++ and Python, and is available as an open-source code (http://www.bi.cs.titech.ac.jp/spresso/) under the GPLv3 license. CONTACT: akiyama@c.titech.ac.jp. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Simulación del Acoplamiento Molecular/métodos , Proteínas/química , Programas Informáticos , Estructura Terciaria de Proteína , Proteínas/ultraestructuraRESUMEN
Virtual screening is a commonly used process to search for feasible drug candidates from a huge number of compounds during the early stages of drug design. As the compound database continues to expand to billions of entries or more, there remains an urgent need to accelerate the process of docking calculations. Reuse of calculation results is a possible way to accelerate the process. In this study, we first propose yet another virtual screening-oriented docking strategy by combining three factors, namely, compound decomposition, simplified fragment grid storing k-best scores, and flexibility consideration with pregenerated conformers. Candidate compounds contain many common fragments (chemical substructures). Thus, the calculation results of these common fragments can be reused among them. As a proof-of-concept of the aforementioned strategies, we also conducted the development of REstretto, a tool that implements the three factors to enable the reuse of calculation results. We demonstrated that the speed and accuracy of REstretto were comparable to those of AutoDock Vina, a well-known free docking tool. The implementation of REstretto has much room for further performance improvement, and therefore, the results show the feasibility of the strategy. The code is available under an MIT license at https://github.com/akiyamalab/restretto.
RESUMEN
Metagenomic analysis, a technique used to comprehensively analyze microorganisms present in the environment, requires performing high-precision homology searches on large amounts of sequencing data, the size of which has increased dramatically with the development of next-generation sequencing. NCBI BLAST is the most widely used software for performing homology searches, but its speed is insufficient for the throughput of current DNA sequencers. In this paper, we propose a new, high-performance homology search algorithm that employs a two-step seed search strategy using multiple reduced amino acid alphabets to identify highly similar subsequences. Additionally, we evaluated the validity of the proposed method against several existing tools. Our method was faster than any other existing program for ≤120,000 queries, while DIAMOND, an existing tool, was the fastest method for >120,000 queries.
Asunto(s)
Biología Computacional/métodos , Metagenómica/métodos , Homología de Secuencia de Aminoácido , Algoritmos , InternetRESUMEN
The need to accelerate large-scale protein-ligand docking in virtual screening against a huge compound database led researchers to propose a strategy that entails memorizing the evaluation result of the partial structure of a compound and reusing it to evaluate other compounds. However, the previous method required frequent disk accesses, resulting in insufficient acceleration. Thus, more efficient memory usage can be expected to lead to further acceleration, and optimal memory usage could be achieved by solving the minimum cost flow problem. In this research, we propose a fast algorithm for the minimum cost flow problem utilizing the characteristics of the graph generated for this problem as constraints. The proposed algorithm, which optimized memory usage, was approximately seven times faster compared to existing minimum cost flow algorithms.
Asunto(s)
Algoritmos , Simulación del Acoplamiento Molecular , Proteínas/química , Programas Informáticos , Ligandos , Estructura MolecularRESUMEN
We propose a new iterative screening contest method to identify target protein inhibitors. After conducting a compound screening contest in 2014, we report results acquired from a contest held in 2015 in this study. Our aims were to identify target enzyme inhibitors and to benchmark a variety of computer-aided drug discovery methods under identical experimental conditions. In both contests, we employed the tyrosine-protein kinase Yes as an example target protein. Participating groups virtually screened possible inhibitors from a library containing 2.4 million compounds. Compounds were ranked based on functional scores obtained using their respective methods, and the top 181 compounds from each group were selected. Our results from the 2015 contest show an improved hit rate when compared to results from the 2014 contest. In addition, we have successfully identified a statistically-warranted method for identifying target inhibitors. Quantitative analysis of the most successful method gave additional insights into important characteristics of the method used.
Asunto(s)
Descubrimiento de Drogas/métodos , Inhibidores Enzimáticos/farmacología , Ensayos Analíticos de Alto Rendimiento/métodos , Inhibidores de Proteínas Quinasas/farmacología , Proteínas Proto-Oncogénicas c-yes/antagonistas & inhibidores , Inhibidores Enzimáticos/química , Inhibidores Enzimáticos/metabolismo , Humanos , Aprendizaje Automático , Estructura Molecular , Unión Proteica , Inhibidores de Proteínas Quinasas/química , Inhibidores de Proteínas Quinasas/metabolismo , Proteínas Proto-Oncogénicas c-yes/metabolismo , Reproducibilidad de los Resultados , Relación Estructura-ActividadRESUMEN
A search of broader range of chemical space is important for drug discovery. Different methods of computer-aided drug discovery (CADD) are known to propose compounds in different chemical spaces as hit molecules for the same target protein. This study aimed at using multiple CADD methods through open innovation to achieve a level of hit molecule diversity that is not achievable with any particular single method. We held a compound proposal contest, in which multiple research groups participated and predicted inhibitors of tyrosine-protein kinase Yes. This showed whether collective knowledge based on individual approaches helped to obtain hit compounds from a broad range of chemical space and whether the contest-based approach was effective.