RESUMO
Small, colloidally aggregating molecules (SCAMs) are the most common source of false positives in high-throughput screening (HTS) campaigns. Although SCAMs can be experimentally detected and suppressed by the addition of detergent in the assay buffer, detergent sensitivity is not routinely monitored in HTS. Computational methods are thus needed to flag potential SCAMs during HTS triage. In this study, we have developed and rigorously validated quantitative structure-interference relationship (QSIR) models of detergent-sensitive aggregation in several HTS campaigns under various assay conditions and screening concentrations. In particular, we have modeled detergent-sensitive aggregation in an AmpC ß-lactamase assay, the preferred HTS counter-screen for aggregation, as well as in another assay that measures cruzain inhibition. Our models increase the accuracy of aggregation prediction by â¼53% in the ß-lactamase assay and by â¼46% in the cruzain assay compared to previously published methods. We also discuss the importance of both assay conditions and screening concentrations in the development of QSIR models for various interference mechanisms besides aggregation. The models developed in this study are publicly available for fast prediction within the SCAM detective web application (https://scamdetective.mml.unc.edu/).
Assuntos
Ensaios de Triagem em Larga EscalaRESUMO
Betulinic acid (BA), a pentacyclic triterpenoid, exhibits broad spectrum antiproliferative activity, but generally with only modest potency. To improve BA's pharmacological properties, fluorine was introduced as a single atom at C-2, creating two diastereomers, or in a trifluoromethyl group at C-3. We evaluated the impact of these groups on antiproliferative activity against five human tumor cell lines. A racemic 2-F-BA (compound 6) showed significantly improved antiproliferative activity, while each diastereomer exhibited similar effects. We also demonstrated that 2-F-BA is a topoisomerase (Topo) I and IIα dual inhibitor in cell-based and cell-free assays. A hypothetical mode of binding to the Topo I-DNA suggested a difference between the hydrogen bonding of BA and 2-F-BA to DNA, which may account for the difference in bioactivity against Topo I.
Assuntos
Triterpenos/química , Triterpenos/síntese química , Proliferação de Células , Humanos , Estrutura Molecular , Triterpenos Pentacíclicos , Ácido BetulínicoRESUMO
Elucidation of the mechanistic relationships between drugs, their targets, and diseases is at the core of modern drug discovery research. Thousands of studies relevant to the drug-target-disease (DTD) triangle have been published and annotated in the Medline/PubMed database. Mining this database affords rapid identification of all published studies that confirm connections between vertices of this triangle or enable new inferences of such connections. To this end, we describe the development of Chemotext, a publicly available Web server that mines the entire compendium of published literature in PubMed annotated by Medline Subject Heading (MeSH) terms. The goal of Chemotext is to identify all known DTD relationships and infer missing links between vertices of the DTD triangle. As a proof-of-concept, we show that Chemotext could be instrumental in generating new drug repurposing hypotheses or annotating clinical outcomes pathways for known drugs. The Chemotext Web server is freely available at http://chemotext.mml.unc.edu .
Assuntos
Mineração de Dados/métodos , Bases de Dados de Compostos Químicos , Sistemas de Liberação de Medicamentos , Tratamento Farmacológico , Internet , Medical Subject Headings , PubMed , Descoberta de Drogas , Humanos , Linguagens de Programação , Interface Usuário-ComputadorRESUMO
Multiple approaches to quantitative structure-activity relationship (QSAR) modeling using various statistical or machine learning techniques and different types of chemical descriptors have been developed over the years. Oftentimes models are used in consensus to make more accurate predictions at the expense of model interpretation. We propose a simple, fast, and reliable method termed Multi-Descriptor Read Across (MuDRA) for developing both accurate and interpretable models. The method is conceptually related to the well-known kNN approach but uses different types of chemical descriptors simultaneously for similarity assessment. To benchmark the new method, we have built MuDRA models for six different end points (Ames mutagenicity, aquatic toxicity, hepatotoxicity, hERG liability, skin sensitization, and endocrine disruption) and compared the results with those generated with conventional consensus QSAR modeling. We find that models built with MuDRA show consistently high external accuracy similar to that of conventional QSAR models. However, MuDRA models excel in terms of transparency, interpretability, and computational efficiency. We posit that due to its methodological simplicity and reliable predictive accuracy, MuDRA provides a powerful alternative to a much more complex consensus QSAR modeling. MuDRA is implemented and freely available at the Chembench web portal ( https://chembench.mml.unc.edu/mudra ).
Assuntos
Relação Quantitativa Estrutura-Atividade , Algoritmos , Bases de Dados Factuais , Humanos , Internet , Modelos Biológicos , Mutagênicos/toxicidade , Software , Testes de ToxicidadeRESUMO
The use of substructural alerts to identify Pan-Assay INterference compoundS (PAINS) has become a common component of the triage process in biological screening campaigns. These alerts, however, were originally derived from a proprietary library tested in just six assays measuring protein-protein interaction (PPI) inhibition using the AlphaScreen detection technology only; moreover, 68% (328 out of the 480 alerts) were derived from four or fewer compounds. In an effort to assess the reliability of these alerts as indicators of pan-assay interference, we performed a large-scale analysis of the impact of PAINS alerts on compound promiscuity in bioassays using publicly available data in PubChem. We found that the majority (97%) of all compounds containing PAINS alerts were actually infrequent hitters in AlphaScreen assays measuring PPI inhibition. We also found that the presence of PAINS alerts, contrary to expectations, did not reflect any heightened assay activity trends across all assays in PubChem including AlphaScreen, luciferase, beta-lactamase, or fluorescence-based assays. In addition, 109 PAINS alerts were present in 3570 extensively assayed, but consistently inactive compounds called Dark Chemical Matter. Finally, we observed that 87 small molecule FDA-approved drugs contained PAINS alerts and profiled their bioassay activity. Based on this detailed analysis of PAINS alerts in nonproprietary compound libraries, we caution against the blind use of PAINS filters to detect and triage compounds with possible PAINS liabilities and recommend that such conclusions should be drawn only by conducting orthogonal experiments.
Assuntos
Informática/métodos , Software , Gráficos por Computador , Descoberta de Drogas , Modelos Moleculares , Conformação Molecular , RotaçãoRESUMO
The enormous increase in the amount of publicly available chemical genomics data and the growing emphasis on data sharing and open science mandates that cheminformaticians also make their models publicly available for broad use by the scientific community. Chembench is one of the first publicly accessible, integrated cheminformatics Web portals. It has been extensively used by researchers from different fields for curation, visualization, analysis, and modeling of chemogenomics data. Since its launch in 2008, Chembench has been accessed more than 1 million times by more than 5000 users from a total of 98 countries. We report on the recent updates and improvements that increase the simplicity of use, computational efficiency, accuracy, and accessibility of a broad range of tools and services for computer-assisted drug design and computational toxicology available on Chembench. Chembench remains freely accessible at https://chembench.mml.unc.edu.
Assuntos
Informática/métodos , Internet , Interface Usuário-Computador , Linguagens de Programação , Relação Quantitativa Estrutura-AtividadeRESUMO
BACKGROUND: Approximately 10-15 % of gastrointestinal stromal tumors (GISTs) lack gain of function mutations in the KIT and platelet-derived growth factor receptor alpha (PDGFRA) genes. An alternate mechanism of oncogenesis through loss of function of the succinate-dehydrogenase (SDH) enzyme complex has been identified for a subset of these "wild type" GISTs. METHODS: Paired tumor and normal DNA from an SDH-intact wild-type GIST case was subjected to whole exome sequencing to identify the pathogenic mechanism(s) in this tumor. Selected findings were further investigated in panels of GIST tumors through Sanger DNA sequencing, quantitative real-time PCR, and immunohistochemical approaches. RESULTS: A hemizygous frameshift mutation (p.His2261Leufs*4), in the neurofibromin 1 (NF1) gene was identified in the patient's GIST; however, no germline NF1 mutation was found. A somatic frameshift mutation (p.Lys54Argfs*31) in the MYC associated factor X (MAX) gene was also identified. Immunohistochemical analysis for MAX on a large panel of GISTs identified loss of MAX expression in the MAX-mutated GIST and in a subset of mainly KIT-mutated tumors. CONCLUSION: This study suggests that inactivating NF1 mutations outside the context of neurofibromatosis may be the oncogenic mechanism for a subset of sporadic GIST. In addition, loss of function mutation of the MAX gene was identified for the first time in GIST, and a broader role for MAX in GIST progression was suggested.
Assuntos
Fatores de Transcrição de Zíper de Leucina Básica/genética , Tumores do Estroma Gastrointestinal/genética , Neurofibromina 1/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Exoma/genética , Feminino , Mutação da Fase de Leitura , Tumores do Estroma Gastrointestinal/patologia , Regulação Neoplásica da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Pessoa de Meia-Idade , Proteínas Proto-Oncogênicas c-kit/genética , Receptor alfa de Fator de Crescimento Derivado de Plaquetas/genética , Succinato Desidrogenase/genéticaRESUMO
Hits from high-throughput screening (HTS) of chemical libraries are often false positives due to their interference with assay detection technology. In response, we generated the largest publicly available library of chemical liabilities and developed "Liability Predictor," a free web tool to predict HTS artifacts. More specifically, we generated, curated, and integrated HTS data sets for thiol reactivity, redox activity, and luciferase (firefly and nano) activity and developed and validated quantitative structure-interference relationship (QSIR) models to predict these nuisance behaviors. The resulting models showed 58-78% external balanced accuracy for 256 external compounds per assay. QSIR models developed and validated herein identify nuisance compounds among experimental hits more reliably than do popular PAINS filters. Both the models and the curated data sets were implemented in "Liability Predictor," publicly available at https://liability.mml.unc.edu/. "Liability Predictor" may be used as part of chemical library design or for triaging HTS hits.
Assuntos
Artefatos , Ensaios de Triagem em Larga Escala , Ensaios de Triagem em Larga Escala/métodos , Bibliotecas de Moléculas Pequenas/químicaRESUMO
The conventional drug discovery pipeline has proven to be unsustainable for rare diseases. Herein, we discuss recent advances in biomedical knowledge mining applied to discovering therapeutics for rare diseases. We summarize current chemogenomics data of relevance to rare diseases and provide a perspective on the effectiveness of machine learning (ML) and biomedical knowledge graph mining in rare disease drug discovery. We illustrate the power of these methodologies using a chordoma case study. We expect that a broader application of knowledge graph mining and artificial intelligence (AI) approaches will expedite the discovery of viable drug candidates against both rare and common diseases.
Assuntos
Inteligência Artificial , Doenças Raras , Descoberta de Drogas/métodos , Humanos , Bases de Conhecimento , Aprendizado de Máquina , Doenças Raras/tratamento farmacológicoRESUMO
Here, we propose a broad concept of 'Clinical Outcome Pathways' (COPs), which are defined as a series of key molecular and cellular events that underlie therapeutic effects of drug molecules. We formalize COPs as a chain of the following events: molecular initiating event (MIE) â intermediate event(s) â clinical outcome. We illustrate the concept with COP examples both for primary and alternative (i.e., drug repurposing) therapeutic applications. We also describe the elucidation of COPs for several drugs of interest using the publicly accessible Reasoning Over Biomedical Objects linked in Knowledge-Oriented Pathways (ROBOKOP) biomedical knowledge graph-mining tool. We propose that broader use of COP uncovered with the help of biomedical knowledge graph mining will likely accelerate drug discovery and repurposing efforts.
Assuntos
Reposicionamento de Medicamentos , Bases de Conhecimento , Descoberta de Drogas , ConhecimentoRESUMO
BACKGROUND: Humans are exposed to tens of thousands of chemical substances that need to be assessed for their potential toxicity. Acute systemic toxicity testing serves as the basis for regulatory hazard classification, labeling, and risk management. However, it is cost- and time-prohibitive to evaluate all new and existing chemicals using traditional rodent acute toxicity tests. In silico models built using existing data facilitate rapid acute toxicity predictions without using animals. OBJECTIVES: The U.S. Interagency Coordinating Committee on the Validation of Alternative Methods (ICCVAM) Acute Toxicity Workgroup organized an international collaboration to develop in silico models for predicting acute oral toxicity based on five different end points: Lethal Dose 50 (LD50 value, U.S. Environmental Protection Agency hazard (four) categories, Globally Harmonized System for Classification and Labeling hazard (five) categories, very toxic chemicals [LD50 (LD50≤50mg/kg)], and nontoxic chemicals (LD50>2,000mg/kg). METHODS: An acute oral toxicity data inventory for 11,992 chemicals was compiled, split into training and evaluation sets, and made available to 35 participating international research groups that submitted a total of 139 predictive models. Predictions that fell within the applicability domains of the submitted models were evaluated using external validation sets. These were then combined into consensus models to leverage strengths of individual approaches. RESULTS: The resulting consensus predictions, which leverage the collective strengths of each individual model, form the Collaborative Acute Toxicity Modeling Suite (CATMoS). CATMoS demonstrated high performance in terms of accuracy and robustness when compared with in vivo results. DISCUSSION: CATMoS is being evaluated by regulatory agencies for its utility and applicability as a potential replacement for in vivo rat acute oral toxicity studies. CATMoS predictions for more than 800,000 chemicals have been made available via the National Toxicology Program's Integrated Chemical Environment tools and data sets (ice.ntp.niehs.nih.gov). The models are also implemented in a free, standalone, open-source tool, OPERA, which allows predictions of new and untested chemicals to be made. https://doi.org/10.1289/EHP8495.
Assuntos
Órgãos Governamentais , Animais , Simulação por Computador , Ratos , Testes de Toxicidade Aguda , Estados Unidos , United States Environmental Protection AgencyRESUMO
Chordoma is a devastating rare cancer that affects one in a million people. With a mean-survival of just 6 years and no approved medicines, the primary treatments are surgery and radiation. In order to speed new medicines to chordoma patients, a drug repurposing strategy represents an attractive approach. Drugs that have already advanced through human clinical safety trials have the potential to be approved more quickly than de novo discovered medicines on new targets. We have taken two strategies to enable this: (1) generated and validated machine learning models of chordoma inhibition and screened compounds of interest in vitro. (2) Tested combinations of approved kinase inhibitors already being individually evaluated for chordoma. Several published studies of compounds screened against chordoma cell lines were used to generate Bayesian Machine learning models which were then used to score compounds selected from the NIH NCATS industry-provided assets. Out of these compounds, the mTOR inhibitor AZD2014, was the most potent against chordoma cell lines (IC50 0.35 µM U-CH1 and 0.61 µM U-CH2). Several studies have shown the importance of the mTOR signaling pathway in chordoma and suggest it as a promising avenue for targeted therapy. Additionally, two currently FDA approved drugs, afatinib and palbociclib (EGFR and CDK4/6 inhibitors, respectively) demonstrated synergy in vitro (CI50 = 0.43) while AZD2014 and afatanib also showed synergy (CI50 = 0.41) against a chordoma cell in vitro. These findings may be of interest clinically, and this in vitro- and in silico approach could also be applied to other rare cancers.
Assuntos
Protocolos de Quimioterapia Combinada Antineoplásica/farmacologia , Cordoma/tratamento farmacológico , Reposicionamento de Medicamentos , Aprendizado de Máquina , Protocolos de Quimioterapia Combinada Antineoplásica/química , Benzamidas/agonistas , Benzamidas/farmacologia , Linhagem Celular Tumoral , Cordoma/metabolismo , Cordoma/patologia , Humanos , Morfolinas/agonistas , Morfolinas/farmacologia , Proteínas de Neoplasias/antagonistas & inibidores , Proteínas de Neoplasias/metabolismo , Piperazinas/agonistas , Piperazinas/farmacologia , Piridinas/agonistas , Piridinas/farmacologia , Pirimidinas/agonistas , Pirimidinas/farmacologiaRESUMO
Doublecortin-like kinase 1 (DCLK1) is a serine/threonine kinase that is overexpressed in gastrointestinal cancers, including esophageal, gastric, colorectal, and pancreatic cancers. DCLK1 is also used as a marker of tuft cells, which regulate type II immunity in the gut. However, the substrates and functions of DCLK1 are understudied. We recently described the first selective DCLK1/2 inhibitor, DCLK1-IN-1, developed to aid the functional characterization of this important kinase. Here we describe the synthesis and structure-activity relationships of 5,11-dihydro-6H-benzo[e]pyrimido[5,4-b][1,4]diazepin-6-one DCLK1 inhibitors, resulting in the identification of DCLK1-IN-1.
Assuntos
Benzodiazepinas/química , Inibidores de Proteínas Quinases/química , Proteínas Serina-Treonina Quinases/antagonistas & inibidores , Pirimidinas/química , Benzodiazepinas/síntese química , Ensaios Enzimáticos , Estrutura Molecular , Inibidores de Proteínas Quinases/síntese química , Pirimidinas/síntese química , Relação Estrutura-AtividadeRESUMO
We describe SGC-GAK-1 (11), a potent, selective, and cell-active inhibitor of cyclin G-associated kinase (GAK), together with a structurally related negative control SGC-GAK-1N (14). 11 was highly selective in an in vitro kinome-wide screen, but cellular engagement assays defined RIPK2 as a collateral target. We identified 18 as a potent RIPK2 inhibitor lacking GAK activity. Together, this chemical probe set can be used to interrogate GAK cellular biology.
Assuntos
Ciclina G/metabolismo , Peptídeos e Proteínas de Sinalização Intracelular/metabolismo , Sondas Moleculares/metabolismo , Proteínas Serina-Treonina Quinases/metabolismo , Células HEK293 , Humanos , MasculinoRESUMO
Although the cAMP-dependent protein kinase (PKA) is ubiquitously expressed, it is sequestered at specific subcellular locations throughout the cell, thereby resulting in compartmentalized cellular signaling that triggers site-specific behavioral phenotypes. We developed a three-step engineering strategy to construct an optogenetic PKA (optoPKA) and demonstrated that, upon illumination, optoPKA migrates to specified intracellular sites. Furthermore, we designed intracellular spatially segregated reporters of PKA activity and confirmed that optoPKA phosphorylates these reporters in a light-dependent fashion. Finally, proteomics experiments reveal that light activation of optoPKA results in the phosphorylation of known endogenous PKA substrates as well as potential novel substrates.
Assuntos
Proteínas Quinases Dependentes de AMP Cíclico/genética , Optogenética , Proteínas Quinases Dependentes de AMP Cíclico/metabolismo , Humanos , Membranas Mitocondriais/metabolismo , FosforilaçãoRESUMO
Drug repurposing approaches have the potential advantage of facilitating rapid and cost-effective development of new therapies. Particularly, the repurposing of drugs with known safety profiles in children could bypass or streamline toxicity studies. We employed a phenotypic screening paradigm on a panel of well-characterized cell lines derived from pediatric solid tumors against a collection of â¼3,800 compounds spanning approved drugs and investigational agents. Specifically, we employed titration-based screening where compounds were tested at multiple concentrations for their effect on cell viability. Molecular and cellular target enrichment analysis indicated that numerous agents across different therapeutic categories and modes of action had an antiproliferative effect, notably antiparasitic/protozoal drugs with non-classic antineoplastic activity. Focusing on active compounds with dosing and safety information in children according to the Children's Pharmacy Collaborative database, we identified compounds with therapeutic potential through further validation using 3D tumor spheroid models. Moreover, we show that antiparasitic agents induce cell death via apoptosis induction. This study demonstrates that our screening platform enables the identification of chemical agents with cytotoxic activity in pediatric cancer cell lines of which many have known safety/toxicity profiles in children. These agents constitute attractive candidates for efficacy studies in pre-clinical models of pediatric solid tumors.
RESUMO
The Ebola virus (EBOV) causes severe human infection that lacks effective treatment. A recent screen identified a series of compounds that block EBOV-like particle entry into human cells. Using data from this screen, quantitative structure-activity relationship models were built and employed for virtual screening of a â¼17 million compound library. Experimental testing of 102 hits yielded 14 compounds with IC50 values under 10 µM, including several sub-micromolar inhibitors, and more than 10-fold selectivity against host cytotoxicity. These confirmed hits include FDA-approved drugs and clinical candidates with non-antiviral indications, as well as compounds with novel scaffolds and no previously known bioactivity. Five selected hits inhibited BSL-4 live-EBOV infection in a dose-dependent manner, including vindesine (0.34 µM). Additional studies of these novel anti-EBOV compounds revealed their mechanisms of action, including the inhibition of NPC1 protein, cathepsin B/L, and lysosomal function. Compounds identified in this study are among the most potent and well-characterized anti-EBOV inhibitors reported to date.
Assuntos
Antivirais/farmacologia , Ebolavirus/efeitos dos fármacos , Bibliotecas de Moléculas Pequenas/farmacologia , Animais , Antivirais/química , Chlorocebus aethiops , Descoberta de Drogas , Avaliação Pré-Clínica de Medicamentos , Células HeLa , Humanos , Estrutura Molecular , Relação Quantitativa Estrutura-Atividade , Bibliotecas de Moléculas Pequenas/química , Células Vero , Internalização do Vírus/efeitos dos fármacosRESUMO
Enoxaparin is a low-molecular weight heparin used to treat thrombotic disorders. Following the fatal contamination of the heparin supply chain in 2007-2008, the U.S. Pharmacopeia (USP) and U.S. Food and Drug Administration (FDA) have worked extensively to modernize the unfractionated heparin and enoxaparin monographs. As a result, the determination of molecular weight (MW) has been added to the monograph as a measure to strengthen the quality testing and to increase the protection of the global supply of this life-saving drug. The current USP calibrant materials used for enoxaparin MW determination are composed of a mixture of oligosaccharides; however, they are difficult to reproduce as the calibrants have ill-defined structures due to the heterogeneity of the heparin parent material. To address this issue, we describe a promising approach consisting of a predictive computational model built from a library of chemoenzymatically synthesized heparin oligosaccharides for enoxaparin MW determination. Here, we demonstrate that this test can be performed with greater efficiency by coupling synthetic oligosaccharides with the power of computational modeling. Our approach is expected to improve the MW measurement for enoxaparin.
RESUMO
Protein kinases are highly tractable targets for drug discovery. However, the biological function and therapeutic potential of the majority of the 500+ human protein kinases remains unknown. We have developed physical and virtual collections of small molecule inhibitors, which we call chemogenomic sets, that are designed to inhibit the catalytic function of almost half the human protein kinases. In this manuscript we share our progress towards generation of a comprehensive kinase chemogenomic set (KCGS), release kinome profiling data of a large inhibitor set (Published Kinase Inhibitor Set 2 (PKIS2)), and outline a process through which the community can openly collaborate to create a KCGS that probes the full complement of human protein kinases.
Assuntos
Bases de Dados de Produtos Farmacêuticos , Inibidores de Proteínas Quinases/farmacologia , Bibliotecas de Moléculas Pequenas/farmacologia , Descoberta de Drogas/métodos , Genômica/métodos , Humanos , Inibidores de Proteínas Quinases/química , Bibliotecas de Moléculas Pequenas/química , Relação Estrutura-AtividadeRESUMO
Skin sensitization is a major environmental and occupational health hazard. Although many chemicals have been evaluated in humans, there have been no efforts to model these data to date. We have compiled, curated, analyzed, and compared the available human and LLNA data. Using these data, we have developed reliable computational models and applied them for virtual screening of chemical libraries to identify putative skin sensitizers. The overall concordance between murine LLNA and human skin sensitization responses for a set of 135 unique chemicals was low (R = 28-43%), although several chemical classes had high concordance. We have succeeded to develop predictive QSAR models of all available human data with the external correct classification rate of 71%. A consensus model integrating concordant QSAR predictions and LLNA results afforded a higher CCR of 82% but at the expense of the reduced external dataset coverage (52%). We used the developed QSAR models for virtual screening of CosIng database and identified 1061 putative skin sensitizers; for seventeen of these compounds, we found published evidence of their skin sensitization effects. Models reported herein provide more accurate alternative to LLNA testing for human skin sensitization assessment across diverse chemical data. In addition, they can also be used to guide the structural optimization of toxic compounds to reduce their skin sensitization potential.