RESUMO
Stimulator of interferon genes (STING) is a dimeric transmembrane adapter protein that plays a key role in the human innate immune response to infection and has been therapeutically exploited for its antitumor activity. The activation of STING requires its high-order oligomerization, which could be induced by binding of the endogenous ligand, cGAMP, to the cytosolic ligand-binding domain. Here we report the discovery through functional screens of a class of compounds, named NVS-STGs, that activate human STING. Our cryo-EM structures show that NVS-STG2 induces the high-order oligomerization of human STING by binding to a pocket between the transmembrane domains of the neighboring STING dimers, effectively acting as a molecular glue. Our functional assays showed that NVS-STG2 could elicit potent STING-mediated immune responses in cells and antitumor activities in animal models.
Assuntos
Proteínas Adaptadoras de Transdução de Sinal , Proteínas de Membrana , Animais , Humanos , Proteínas Adaptadoras de Transdução de Sinal/metabolismo , Bioensaio , Citosol , Imunidade Inata , Ligantes , Proteínas de Membrana/metabolismoRESUMO
Predicting compound activity in assays is a long-standing challenge in drug discovery. Computational models based on compound-induced gene expression signatures from a single profiling assay have shown promise toward predicting compound activity in other, seemingly unrelated, assays. Applications of such models include predicting mechanisms-of-action (MoA) for phenotypic hits, identifying off-target activities, and identifying polypharmacologies. Here, we introduce transcriptomics-to-activity transformer (TAT) models that leverage gene expression profiles observed over compound treatment at multiple concentrations to predict the compound activity in other biochemical or cellular assays. We built TAT models based on gene expression data from a RASL-seq assay to predict the activity of 2692 compounds in 262 dose-response assays. We obtained useful models for 51% of the assays, as determined through a realistic held-out set. Prospectively, we experimentally validated the activity predictions of a TAT model in a malaria inhibition assay. With a 63% hit rate, TAT successfully identified several submicromolar malaria inhibitors. Our results thus demonstrate the potential of transcriptomic responses over compound concentration and the TAT modeling framework as a cost-efficient way to identify the bioactivities of promising compounds across many assays.
Assuntos
Aprendizado Profundo , Malária , Humanos , Transcriptoma , Descoberta de Drogas/métodos , Perfilação da Expressão GênicaRESUMO
The identification of activating mutations in NOTCH1 in 50% of T cell acute lymphoblastic leukemia has generated interest in elucidating how these mutations contribute to oncogenic transformation and in targeting the pathway. A phenotypic screen identified compounds that interfere with trafficking of Notch and induce apoptosis via an endoplasmic reticulum (ER) stress mechanism. Target identification approaches revealed a role for SLC39A7 (ZIP7), a zinc transport family member, in governing Notch trafficking and signaling. Generation and sequencing of a compound-resistant cell line identified a V430E mutation in ZIP7 that confers transferable resistance to the compound NVS-ZP7-4. NVS-ZP7-4 altered zinc in the ER, and an analog of the compound photoaffinity labeled ZIP7 in cells, suggesting a direct interaction between the compound and ZIP7. NVS-ZP7-4 is the first reported chemical tool to probe the impact of modulating ER zinc levels and investigate ZIP7 as a novel druggable node in the Notch pathway.
Assuntos
Proteínas de Transporte de Cátions/genética , Estresse do Retículo Endoplasmático/fisiologia , Receptor Notch1/genética , Animais , Apoptose , Proteínas de Transporte/metabolismo , Proteínas de Transporte de Cátions/metabolismo , Proteínas de Transporte de Cátions/fisiologia , Linhagem Celular , Transformação Celular Neoplásica , Retículo Endoplasmático/fisiologia , Humanos , Mutação , Transporte Proteico , Receptor Notch1/fisiologia , Transdução de Sinais , Zinco/metabolismoRESUMO
In the 1950s, the drug thalidomide, administered as a sedative to pregnant women, led to the birth of thousands of children with multiple defects. Despite the teratogenicity of thalidomide and its derivatives lenalidomide and pomalidomide, these immunomodulatory drugs (IMiDs) recently emerged as effective treatments for multiple myeloma and 5q-deletion-associated dysplasia. IMiDs target the E3 ubiquitin ligase CUL4-RBX1-DDB1-CRBN (known as CRL4(CRBN)) and promote the ubiquitination of the IKAROS family transcription factors IKZF1 and IKZF3 by CRL4(CRBN). Here we present crystal structures of the DDB1-CRBN complex bound to thalidomide, lenalidomide and pomalidomide. The structure establishes that CRBN is a substrate receptor within CRL4(CRBN) and enantioselectively binds IMiDs. Using an unbiased screen, we identified the homeobox transcription factor MEIS2 as an endogenous substrate of CRL4(CRBN). Our studies suggest that IMiDs block endogenous substrates (MEIS2) from binding to CRL4(CRBN) while the ligase complex is recruiting IKZF1 or IKZF3 for degradation. This dual activity implies that small molecules can modulate an E3 ubiquitin ligase and thereby upregulate or downregulate the ubiquitination of proteins.
Assuntos
Peptídeo Hidrolases/química , Talidomida/química , Ubiquitina-Proteína Ligases/química , Proteínas Adaptadoras de Transdução de Sinal , Cristalografia por Raios X , Proteínas de Ligação a DNA/agonistas , Proteínas de Ligação a DNA/antagonistas & inibidores , Proteínas de Ligação a DNA/química , Proteínas de Ligação a DNA/metabolismo , Proteínas de Homeodomínio/metabolismo , Humanos , Lenalidomida , Modelos Moleculares , Complexos Multiproteicos/agonistas , Complexos Multiproteicos/antagonistas & inibidores , Complexos Multiproteicos/química , Complexos Multiproteicos/metabolismo , Peptídeo Hidrolases/metabolismo , Ligação Proteica , Relação Estrutura-Atividade , Especificidade por Substrato , Talidomida/análogos & derivados , Talidomida/metabolismo , Fatores de Transcrição/metabolismo , Ubiquitina-Proteína Ligases/antagonistas & inibidores , Ubiquitina-Proteína Ligases/metabolismoRESUMO
Perturbation of gene expression by means of synthetic small interfering RNAs (siRNAs) is a powerful way to uncover gene function. However, siRNA technology suffers from sequence-specific off-target effects and from limitations in knock-down efficiency. In this study, we assess a further problem: unintended effects of siRNA transfections on cellular fitness/proliferation. We show that the nucleotide compositions of siRNAs at specific positions have reproducible growth-restricting effects on mammalian cells in culture. This is likely distinct from hybridization-dependent off-target effects, since each nucleotide residue is seen to be acting independently and additively. The effect is robust and reproducible across different siRNA libraries and also across various cell lines, including human and mouse cells. Analyzing the growth inhibition patterns in correlation to the nucleotide sequence of the siRNAs allowed us to build a predictor that can estimate growth-restricting effects for any arbitrary siRNA sequence. Competition experiments with co-transfected siRNAs further suggest that the growth-restricting effects might be linked to an oversaturation of the cellular miRNA machinery, thus disrupting endogenous miRNA functions at large. We caution that competition between siRNA molecules could complicate the interpretation of double-knockdown or epistasis experiments, and potential interactions with endogenous miRNAs can be a factor when assaying cell growth or viability phenotypes.
Assuntos
Proliferação de Células/genética , MicroRNAs/genética , Hibridização de Ácido Nucleico , Interferência de RNA , RNA Interferente Pequeno/genética , Células A549 , Animais , Linhagem Celular , Sobrevivência Celular/genética , Células Cultivadas , Embrião de Mamíferos/citologia , Fibroblastos/citologia , Fibroblastos/metabolismo , Expressão Gênica , Células HeLa , Humanos , Camundongos , TransfecçãoRESUMO
Discovering the unintended 'off-targets' that predict adverse drug reactions is daunting by empirical methods alone. Drugs can act on several protein targets, some of which can be unrelated by conventional molecular metrics, and hundreds of proteins have been implicated in side effects. Here we use a computational strategy to predict the activity of 656 marketed drugs on 73 unintended 'side-effect' targets. Approximately half of the predictions were confirmed, either from proprietary databases unknown to the method or by new experimental assays. Affinities for these new off-targets ranged from 1 nM to 30 µM. To explore relevance, we developed an association metric to prioritize those new off-targets that explained side effects better than any known target of a given drug, creating a drug-target-adverse drug reaction network. Among these new associations was the prediction that the abdominal pain side effect of the synthetic oestrogen chlorotrianisene was mediated through its newly discovered inhibition of the enzyme cyclooxygenase-1. The clinical relevance of this inhibition was borne out in whole human blood platelet aggregation assays. This approach may have wide application to de-risking toxicological liabilities in drug discovery.
Assuntos
Avaliação Pré-Clínica de Medicamentos/métodos , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Testes de Toxicidade/métodos , Plaquetas/efeitos dos fármacos , Clorotrianiseno/efeitos adversos , Clorotrianiseno/química , Clorotrianiseno/farmacologia , Ciclo-Oxigenase 1/metabolismo , Inibidores de Ciclo-Oxigenase/efeitos adversos , Inibidores de Ciclo-Oxigenase/farmacologia , Bases de Dados Factuais , Estrogênios não Esteroides/efeitos adversos , Estrogênios não Esteroides/farmacologia , Previsões , Humanos , Modelos Biológicos , Terapia de Alvo Molecular/efeitos adversos , Agregação Plaquetária/efeitos dos fármacos , Reprodutibilidade dos Testes , Especificidade por SubstratoRESUMO
Despite the usefulness of high-throughput screening (HTS) in drug discovery, for some systems, low assay throughput or high screening cost can prohibit the screening of large numbers of compounds. In such cases, iterative cycles of screening involving active learning (AL) are employed, creating the need for smaller "informer sets" that can be routinely screened to build predictive models for selecting compounds from the screening collection for follow-up screens. Here, we present a data-driven derivation of an informer compound set with improved predictivity of active compounds in HTS, and we validate its benefit over randomly selected training sets on 46 PubChem assays comprising at least 300,000 compounds and covering a wide range of assay biology. The informer compound set showed improvement in BEDROC(α = 100), PRAUC, and ROCAUC values averaged over all assays of 0.024, 0.014, and 0.016, respectively, compared to randomly selected training sets, all with paired t-test p-values <10(-15). A per-assay assessment showed that the BEDROC(α = 100), which is of particular relevance for early retrieval of actives, improved for 38 out of 46 assays, increasing the success rate of smaller follow-up screens. Overall, we showed that an informer set derived from historical HTS activity data can be employed for routine small-scale exploratory screening in an assay-agnostic fashion. This approach led to a consistent improvement in hit rates in follow-up screens without compromising scaffold retrieval. The informer set is adjustable in size depending on the number of compounds one intends to screen, as performance gains are realized for sets with more than 3,000 compounds, and this set is therefore applicable to a variety of situations. Finally, our results indicate that random sampling may not adequately cover descriptor space, drawing attention to the importance of the composition of the training set for predicting actives.
Assuntos
Avaliação Pré-Clínica de Medicamentos/métodos , Ensaios de Triagem em Larga Escala/métodos , Informática/métodos , Aprendizado de MáquinaRESUMO
Modern high-throughput screening (HTS) is a well-established approach for hit finding in drug discovery that is routinely employed in the pharmaceutical industry to screen more than a million compounds within a few weeks. However, as the industry shifts to more disease-relevant but more complex phenotypic screens, the focus has moved to piloting smaller but smarter chemically/biologically diverse subsets followed by an expansion around hit compounds. One standard method for doing this is to train a machine-learning (ML) model with the chemical fingerprints of the tested subset of molecules and then select the next compounds based on the predictions of this model. An alternative approach would be to take advantage of the wealth of bioactivity information contained in older (full-deck) screens using so-called HTS fingerprints, where each element of the fingerprint corresponds to the outcome of a particular assay, as input to machine-learning algorithms. We constructed HTS fingerprints using two collections of data: 93 in-house assays and 95 publicly available assays from PubChem. For each source, an additional set of 51 and 46 assays, respectively, was collected for testing. Three different ML methods, random forest (RF), logistic regression (LR), and naïve Bayes (NB), were investigated for both the HTS fingerprint and a chemical fingerprint, Morgan2. RF was found to be best suited for learning from HTS fingerprints yielding area under the receiver operating characteristic curve (AUC) values >0.8 for 78% of the internal assays and enrichment factors at 5% (EF(5%)) >10 for 55% of the assays. The RF(HTS-fp) generally outperformed the LR trained with Morgan2, which was the best ML method for the chemical fingerprint, for the majority of assays. In addition, HTS fingerprints were found to retrieve more diverse chemotypes. Combining the two models through heterogeneous classifier fusion led to a similar or better performance than the best individual model for all assays. Further validation using a pair of in-house assays and data from a confirmatory screen--including a prospective set of around 2000 compounds selected based on our approach--confirmed the good performance. Thus, the combination of machine-learning with HTS fingerprints and chemical fingerprints utilizes information from both domains and presents a very promising approach for hit expansion, leading to more hits. The source code used with the public data is provided.
Assuntos
Ensaios de Triagem em Larga Escala/métodos , Informática/métodos , Algoritmos , Inteligência Artificial , Teorema de Bayes , Modelos LogísticosRESUMO
We present a user-friendly molecular generative pipeline called Pocket Crafter, specifically designed to facilitate hit finding activity in the drug discovery process. This workflow utilized a three-dimensional (3D) generative modeling method Pocket2Mol, for the de novo design of molecules in spatial perspective for the targeted protein structures, followed by filters for chemical-physical properties and drug-likeness, structure-activity relationship analysis, and clustering to generate top virtual hit scaffolds. In our WDR5 case study, we acquired a focused set of 2029 compounds after a targeted searching within Novartis archived library based on the virtual scaffolds. Subsequently, we experimentally profiled these compounds, resulting in a novel chemical scaffold series that demonstrated activity in biochemical and biophysical assays. Pocket Crafter successfully prototyped an effective end-to-end 3D generative chemistry-based workflow for the exploration of new chemical scaffolds, which represents a promising approach in early drug discovery for hit identification.
RESUMO
Phenotypic assays have become an established approach to drug discovery. Greater disease relevance is often achieved through cellular models with increased complexity and more detailed readouts, such as gene expression or advanced imaging. However, the intricate nature and cost of these assays impose limitations on their screening capacity, often restricting screens to well-characterized small compound sets such as chemogenomics libraries. Here, we outline a cheminformatics approach to identify a small set of compounds with likely novel mechanisms of action (MoAs), expanding the MoA search space for throughput limited phenotypic assays. Our approach is based on mining existing large-scale, phenotypic high-throughput screening (HTS) data. It enables the identification of chemotypes that exhibit selectivity across multiple cell-based assays, which are characterized by persistent and broad structure activity relationships (SAR). We validate the effectiveness of our approach in broad cellular profiling assays (Cell Painting, DRUG-seq, and Promotor Signature Profiling) and chemical proteomics experiments. These experiments revealed that the compounds behave similarly to known chemogenetic libraries, but with a notable bias toward novel protein targets. To foster collaboration and advance research in this area, we have curated a public set of such compounds based on the PubChem BioAssay dataset and made it available for use by the scientific community.
Assuntos
Descoberta de Drogas , Ensaios de Triagem em Larga Escala , Bibliotecas de Moléculas Pequenas , Descoberta de Drogas/métodos , Ensaios de Triagem em Larga Escala/métodos , Quimioinformática/métodos , Bibliotecas de Moléculas Pequenas/química , Relação Estrutura-AtividadeRESUMO
Unbiased transcriptomic RNA-seq data has provided deep insights into biological processes. However, its impact in drug discovery has been narrow given high costs and low throughput. Proof-of-concept studies with Digital RNA with pertUrbation of Genes (DRUG)-seq demonstrated the potential to address this gap. We extended the DRUG-seq platform by subjecting it to rigorous testing and by adding an open-source analysis pipeline. The results demonstrate high reproducibility and ability to resolve the mechanism(s) of action for a diverse set of compounds. Furthermore, we demonstrate how this data can be incorporated into a drug discovery project aiming to develop therapeutics for schizophrenia using human stem cell-derived neurons. We identified both an on-target activation signature, induced by a set of chemically distinct positive allosteric modulators of the N-methyl-d-aspartate (NMDA) receptor, and independent off-target effects. Overall, the protocol and open-source analysis pipeline are a step toward industrializing RNA-seq for high-complexity transcriptomics studies performed at a saturating scale.
Assuntos
Descoberta de Drogas , Transcriptoma , Descoberta de Drogas/métodos , Humanos , RNA , Reprodutibilidade dos Testes , Análise de Sequência de RNA/métodosRESUMO
From a medicinal chemistry point of view, one of the primary goals of high throughput screening (HTS) hit list assessment is the identification of chemotypes with an informative structure-activity relationship (SAR). Such chemotypes may enable optimization of the primary potency, as well as selectivity and phamacokinetic properties. A common way to prioritize them is molecular clustering of the hits. Typical clustering techniques, however, rely on a general notion of chemical similarity or standard rules of scaffold decomposition and are thus insensitive to molecular features that are enriched in biologically active compounds. This hinders SAR analysis, because compounds sharing the same pharmacophore might not end up in the same cluster and thus are not directly compared to each other by the medicinal chemist. Similarly, common chemotypes that are not related to activity may contaminate clusters, distracting from important chemical motifs. We combined molecular similarity and Bayesian models and introduce (I) a robust, activity-aware clustering approach and (II) a feature mapping method for the elucidation of distinct SAR determinants in polypharmacologic compounds. We evaluated the method on 462 dose-response assays from the Pubchem Bioassay repository. Activity-aware clustering grouped compounds sharing molecular cores that were specific for the target or pathway at hand, rather than grouping inactive scaffolds commonly found in compound series. Many of these core structures we also found in literature that discussed SARs of the respective targets. A numerical comparison of cores allowed for identification of the structural prerequisites for polypharmacology, i.e., distinct bioactive regions within a single compound, and pointed toward selectivity-conferring medchem strategies. The method presented here is generally applicable to any type of activity data and may help bridge the gap between hit list assessment and designing a medchem strategy.
Assuntos
Desenho de Fármacos , Ensaios de Triagem em Larga Escala , Preparações Farmacêuticas/química , Relação Estrutura-Atividade , Teorema de Bayes , Análise por Conglomerados , Ensaios de Triagem em Larga Escala/métodos , FarmacologiaRESUMO
While molecules that promote the growth of animal cells have been identified, it remains unclear how such signals are orchestrated to determine a characteristic target size for different cell types. It is increasingly clear that cell size is determined by size checkpoints-mechanisms that restrict the cell cycle progression of cells that are smaller than their target size. Previously, we described a p38 MAPK-dependent cell size checkpoint mechanism whereby p38 is selectively activated and prevents cell cycle progression in cells that are smaller than a given target size. In this study, we show that the specific target size required for inactivation of p38 and transition through the cell cycle is determined by CDK4 activity. Our data suggest a model whereby p38 and CDK4 cooperate analogously to the function of a thermostat: while p38 senses irregularities in size, CDK4 corresponds to the thermostat dial that sets the target size.
Assuntos
Ciclo Celular/genética , Tamanho Celular , Quinase 4 Dependente de Ciclina/genética , Proteínas Quinases p38 Ativadas por Mitógeno/genética , Apoptose/genética , Pontos de Checagem do Ciclo Celular/genética , Homeostase/genética , Humanos , Sistema de Sinalização das MAP Quinases/genéticaRESUMO
Three limonoid natural products with selective anti-proliferative activity against BRAF(V600E) and NRAS(Q61K)-mutation-dependent melanoma cell lines were identified. Differential transcriptome analysis revealed dependency of compound activity on expression of the mitochondrial cytochrome P450 oxidase CYP27A1, a transcriptional target of melanogenesis-associated transcription factor (MITF). We determined that CYP27A1 activity is necessary for the generation of a reactive metabolite that proceeds to inhibit cellular proliferation. A genome-wide small interfering RNA screen in combination with chemical proteomics experiments revealed gene-drug functional epistasis, suggesting that these compounds target mitochondrial biogenesis and inhibit tumor bioenergetics through a covalent mechanism. Our work suggests a strategy for melanoma-specific targeting by exploiting the expression of MITF target gene CYP27A1 and inhibiting mitochondrial oxidative phosphorylation in BRAF mutant melanomas.
Assuntos
Colestanotriol 26-Mono-Oxigenase/metabolismo , Limoninas/farmacologia , Mitocôndrias/efeitos dos fármacos , Antineoplásicos/química , Antineoplásicos/metabolismo , Antineoplásicos/farmacologia , Antineoplásicos/uso terapêutico , Produtos Biológicos/química , Produtos Biológicos/metabolismo , Produtos Biológicos/farmacologia , Produtos Biológicos/uso terapêutico , Linhagem Celular Tumoral , Proliferação de Células/efeitos dos fármacos , Colestanotriol 26-Mono-Oxigenase/antagonistas & inibidores , Colestanotriol 26-Mono-Oxigenase/genética , Humanos , Limoninas/química , Limoninas/metabolismo , Limoninas/uso terapêutico , Melanoma/tratamento farmacológico , Melanoma/patologia , Fator de Transcrição Associado à Microftalmia/genética , Fator de Transcrição Associado à Microftalmia/metabolismo , Mitocôndrias/metabolismo , Fosforilação Oxidativa/efeitos dos fármacos , Regiões Promotoras Genéticas , Ligação Proteica , Proteínas Proto-Oncogênicas B-raf/antagonistas & inibidores , Proteínas Proto-Oncogênicas B-raf/genética , Proteínas Proto-Oncogênicas B-raf/metabolismo , Interferência de RNA , RNA Interferente Pequeno/metabolismoRESUMO
High-content screening is transforming drug discovery by enabling simultaneous measurement of multiple features of cellular phenotype that are relevant to therapeutic and toxic activities of compounds. High-content screening studies typically generate immense datasets of image-based phenotypic information, and how best to mine relevant phenotypic data is an unsolved challenge. Here, we introduce factor analysis as a data-driven tool for defining cell phenotypes and profiling compound activities. This method allows a large data reduction while retaining relevant information, and the data-derived factors used to quantify phenotype have discernable biological meaning. We used factor analysis of cells stained with fluorescent markers of cell cycle state to profile a compound library and cluster the hits into seven phenotypic categories. We then compared phenotypic profiles, chemical similarity and predicted protein binding activities of active compounds. By integrating these different descriptors of measured and potential biological activity, we can effectively draw mechanism-of-action inferences.
Assuntos
Antineoplásicos , Biologia Computacional/métodos , Desenho de Fármacos , Bibliotecas de Moléculas Pequenas , Antineoplásicos/química , Antineoplásicos/farmacologia , Ciclo Celular/efeitos dos fármacos , Núcleo Celular/efeitos dos fármacos , Núcleo Celular/ultraestrutura , Proliferação de Células/efeitos dos fármacos , Análise por Conglomerados , Biologia Computacional/estatística & dados numéricos , Replicação do DNA/efeitos dos fármacos , Relação Dose-Resposta a Droga , Células HeLa , Humanos , Ligantes , Modelos Estatísticos , Estrutura Molecular , Valor Preditivo dos Testes , Ligação Proteica , Bibliotecas de Moléculas Pequenas/química , Bibliotecas de Moléculas Pequenas/farmacologia , Relação Estrutura-AtividadeRESUMO
Chemogenetic libraries, collections of well-defined chemical probes, provide tremendous value to biomedical research but require substantial effort to ensure diversity as well as quality of the contents. We have assembled a chemogenetic library by data mining and crowdsourcing institutional expertise. We are sharing our approach, lessons learned, and disclosing our current collection of 4,185 compounds with their primary annotated gene targets (https://github.com/Novartis/MoaBox). This physical collection is regularly updated and used broadly both within Novartis and in collaboration with external partners.
Assuntos
Sondas Moleculares/química , Bibliotecas de Moléculas Pequenas/química , Bioensaio , Bases de Dados de Compostos Químicos , Descoberta de Drogas , Humanos , Aprendizado de Máquina , Sondas Moleculares/metabolismo , Bibliotecas de Moléculas Pequenas/metabolismoRESUMO
Typically, screening collections of pharmaceutical companies contain more than a million compounds today. However, for certain high-throughput screening (HTS) campaigns, constraints posed by the assay throughput and/or the reagent costs make it impractical to screen the entire deck. Therefore, it is desirable to effectively screen subsets of the collection based on a hypothesis or a diversity selection. How to select compound subsets is a subject of ongoing debate. The authors present an approach based on extended connectivity fingerprints to carry out diversity selection on a per plate basis (instead of a per compound basis). HTS data from 35 Novartis screens spanning 5 target classes were investigated to assess the performance of this approach. The analysis shows that selecting a fingerprint-diverse subset of 250K compounds, representing 20% of the screening deck, would have achieved significantly higher hit rates for 86% of the screens. This measure also outperforms the Murcko scaffold-based plate selection described previously, where only 49% of the screens showed similar improvements. Strikingly, the 2-fold improvement in average hit rates observed for 3 of 5 target classes in the data set indicates a target bias of the plate (and thus compound) selection method. Even though the diverse subset selection lacks any target hypothesis, its application shows significantly better results for some targets-namely, G-protein-coupled receptors, proteases, and protein-protein interactions-but not for kinase and pathway screens. The synthetic origin of the compounds in the diverse subset appears to influence the screening hit rates. Natural products were the most diverse compound class, with significantly higher hit rates compared to the compounds from the traditional synthetic and combinatorial libraries. These results offer empirical guidelines for plate-based diversity selection to enhance hit rates, based on target class and the library type being screened.
Assuntos
Técnicas de Química Combinatória/instrumentação , Avaliação Pré-Clínica de Medicamentos/métodos , Preparações Farmacêuticas/análise , Preparações Farmacêuticas/químicaRESUMO
High-throughput screening (HTS) is a well-established hit-finding approach used in the pharmaceutical industry. In this article, recent experience at Novartis with respect to factors influencing the success of HTS campaigns is discussed. An inherent measure of HTS quality could be defined by the assay Z and Z' factors, the number of hits and their biological potencies; however, such measures of quality do not always correlate with the advancement of hits to the later stages of drug discovery. Also, for many target classes, such as kinases, it is easy to identify hits, but, as a result of selectivity, intellectual property and other issues, the projects do not result in lead declarations. In this article, HTS success is defined as the fraction of HTS campaigns that advance into the later stages of drug discovery, and the major influencing factors are examined. Interestingly, screening compounds in individual wells or in mixtures did not have a major impact on the HTS success and, equally interesting, there was no difference in the progression rates of biochemical and cell-based assays. Particular target types, assay technologies, structure-activity relationships and powder availability had a much greater impact on success as defined above. In addition, significant mutual dependencies can be observed - while one assay format works well with one target type, this situation might be completely reversed for a combination of the same readout technology with a different target type. The results and opinions presented here should be regarded as groundwork, and a plethora of factors that influence the fate of a project, such as biophysical measurements, chemical attractiveness of the hits, strategic reasons and safety pharmacology, are not covered here. Nonetheless, it is hoped that this information will be used industry-wide to improve success rates in terms of hits progressing into exploratory chemistry and beyond. The support that can be obtained from new in silico approaches to phase transitions are also described, along with the gaps they are designed to fill.