Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 22
Filter
1.
Nat Chem Biol ; 11(12): 958-66, 2015 Dec.
Article in English | MEDLINE | ID: mdl-26479441

ABSTRACT

High-throughput screening (HTS) is an integral part of early drug discovery. Herein, we focused on those small molecules in a screening collection that have never shown biological activity despite having been exhaustively tested in HTS assays. These compounds are referred to as 'dark chemical matter' (DCM). We quantified DCM, validated it in quality control experiments, described its physicochemical properties and mapped it into chemical space. Through analysis of prospective reporter-gene assay, gene expression and yeast chemogenomics experiments, we evaluated the potential of DCM to show biological activity in future screens. We demonstrated that, despite the apparent lack of activity, occasionally these compounds can result in potent hits with unique activity and clean safety profiles, which makes them valuable starting points for lead optimization efforts. Among the identified DCM hits was a new antifungal chemotype with strong activity against the pathogen Cryptococcus neoformans but little activity at targets relevant to human safety.


Subject(s)
Antifungal Agents/pharmacology , Cryptococcus neoformans/drug effects , Drug Discovery , High-Throughput Screening Assays , Antifungal Agents/chemistry , Microbial Sensitivity Tests , Molecular Structure , Structure-Activity Relationship
2.
Nat Chem Biol ; 10(1): 76-84, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24292071

ABSTRACT

Sec14-like phosphatidylinositol transfer proteins (PITPs) integrate diverse territories of intracellular lipid metabolism with stimulated phosphatidylinositol-4-phosphate production and are discriminating portals for interrogating phosphoinositide signaling. Yet, neither Sec14-like PITPs nor PITPs in general have been exploited as targets for chemical inhibition for such purposes. Herein, we validate what is to our knowledge the first small-molecule inhibitors (SMIs) of the yeast PITP Sec14. These SMIs are nitrophenyl(4-(2-methoxyphenyl)piperazin-1-yl)methanones (NPPMs) and are effective inhibitors in vitro and in vivo. We further establish that Sec14 is the sole essential NPPM target in yeast and that NPPMs exhibit exquisite targeting specificities for Sec14 (relative to related Sec14-like PITPs), propose a mechanism for how NPPMs exert their inhibitory effects and demonstrate that NPPMs exhibit exquisite pathway selectivity in inhibiting phosphoinositide signaling in cells. These data deliver proof of concept that PITP-directed SMIs offer new and generally applicable avenues for intervening with phosphoinositide signaling pathways with selectivities superior to those afforded by contemporary lipid kinase-directed strategies.


Subject(s)
Phosphatidylinositols/metabolism , Phospholipid Transfer Proteins/metabolism , Signal Transduction , Protein Binding , Structure-Activity Relationship
3.
PLoS Comput Biol ; 9(10): e1003253, 2013.
Article in English | MEDLINE | ID: mdl-24098102

ABSTRACT

Mycobacterium tuberculosis, the causative agent of tuberculosis (TB), infects an estimated two billion people worldwide and is the leading cause of mortality due to infectious disease. The development of new anti-TB therapeutics is required, because of the emergence of multi-drug resistance strains as well as co-infection with other pathogens, especially HIV. Recently, the pharmaceutical company GlaxoSmithKline published the results of a high-throughput screen (HTS) of their two million compound library for anti-mycobacterial phenotypes. The screen revealed 776 compounds with significant activity against the M. tuberculosis H37Rv strain, including a subset of 177 prioritized compounds with high potency and low in vitro cytotoxicity. The next major challenge is the identification of the target proteins. Here, we use a computational approach that integrates historical bioassay data, chemical properties and structural comparisons of selected compounds to propose their potential targets in M. tuberculosis. We predicted 139 target--compound links, providing a necessary basis for further studies to characterize the mode of action of these compounds. The results from our analysis, including the predicted structural models, are available to the wider scientific community in the open source mode, to encourage further development of novel TB therapeutics.


Subject(s)
Antitubercular Agents/chemistry , Bacterial Proteins/chemistry , Computational Biology/methods , Drug Discovery/methods , Mycobacterium tuberculosis/chemistry , Amino Acid Sequence , Antitubercular Agents/metabolism , Bacterial Proteins/metabolism , Databases, Chemical , Molecular Docking Simulation , Molecular Sequence Data , Protein Conformation , Sequence Alignment
4.
Nat Chem Biol ; 7(12): 891-3, 2011 Nov 06.
Article in English | MEDLINE | ID: mdl-22057127

ABSTRACT

The DAF-9 cytochrome P450 is a key regulator of dauer formation, developmental timing and longevity in the nematode Caenorhabditis elegans. Here we describe the first identified chemical inhibitor of DAF-9 and the first reported small-molecule tool that robustly induces dauer formation in typical culture conditions. This molecule (called dafadine) also inhibits the mammalian ortholog of DAF-9(CYP27A1), suggesting that dafadine can be used to interrogate developmental control and longevity in other animals.


Subject(s)
Caenorhabditis elegans Proteins/antagonists & inhibitors , Caenorhabditis elegans/drug effects , Caenorhabditis elegans/growth & development , Cytochrome P-450 Enzyme Inhibitors , Enzyme Inhibitors/pharmacology , Isoxazoles/pharmacology , Longevity/drug effects , Piperidines/pharmacology , Pyridines/pharmacology , Animals , Caenorhabditis elegans/metabolism , Caenorhabditis elegans Proteins/metabolism , Cytochrome P-450 Enzyme System/metabolism , Enzyme Activation/drug effects , Enzyme Inhibitors/chemistry , Isoxazoles/chemistry , Larva/drug effects , Molecular Structure , Piperidines/chemistry , Pyridines/chemistry , Stereoisomerism , Structure-Activity Relationship
5.
Nat Chem Biol ; 6(7): 549-57, 2010 Jul.
Article in English | MEDLINE | ID: mdl-20512140

ABSTRACT

The resistance of Caenorhabditis elegans to pharmacological perturbation limits its use as a screening tool for novel small bioactive molecules. One strategy to improve the hit rate of small-molecule screens is to preselect molecules that have an increased likelihood of reaching their target in the worm. To learn which structures evade the worm's defenses, we performed the first survey of the accumulation and metabolism of over 1,000 commercially available drug-like small molecules in the worm. We discovered that fewer than 10% of these molecules accumulate to concentrations greater than 50% of that present in the worm's environment. Using our dataset, we developed a structure-based accumulation model that identifies compounds with an increased likelihood of bioavailability and bioactivity, and we describe structural features that facilitate small-molecule accumulation in the worm. Preselecting molecules that are more likely to reach a target by first applying our model to the tens of millions of commercially available compounds will undoubtedly increase the success of future small-molecule screens with C. elegans.


Subject(s)
Caenorhabditis elegans/metabolism , Drug Evaluation, Preclinical/methods , Pharmaceutical Preparations/metabolism , Animals , Chromatography, High Pressure Liquid/methods , Models, Biological , Molecular Structure , Pharmaceutical Preparations/chemistry , Structure-Activity Relationship
6.
Nucleic Acids Res ; 38(13): e142, 2010 Jul.
Article in English | MEDLINE | ID: mdl-20460461

ABSTRACT

Next-generation sequencing has proven an extremely effective technology for molecular counting applications where the number of sequence reads provides a digital readout for RNA-seq, ChIP-seq, Tn-seq and other applications. The extremely large number of sequence reads that can be obtained per run permits the analysis of increasingly complex samples. For lower complexity samples, however, a point of diminishing returns is reached when the number of counts per sequence results in oversampling with no increase in data quality. A solution to making next-generation sequencing as efficient and affordable as possible involves assaying multiple samples in a single run. Here, we report the successful 96-plexing of complex pools of DNA barcoded yeast mutants and show that such 'Bar-seq' assessment of these samples is comparable with data provided by barcode microarrays, the current benchmark for this application. The cost reduction and increased throughput permitted by highly multiplexed sequencing will greatly expand the scope of chemogenomics assays and, equally importantly, the approach is suitable for other sequence counting applications that could benefit from massive parallelization.


Subject(s)
Sequence Analysis, DNA/methods , Mutation , Polymerase Chain Reaction , Saccharomyces cerevisiae/genetics
7.
Nat Chem Biol ; 4(8): 498-506, 2008 Aug.
Article in English | MEDLINE | ID: mdl-18622389

ABSTRACT

Bioactive compounds are widely used to modulate protein function and can serve as important leads for drug development. Identifying the in vivo targets of these compounds remains a challenge. Using yeast, we integrated three genome-wide gene-dosage assays to measure the effect of small molecules in vivo. A single TAG microarray was used to resolve the fitness of strains derived from pools of (i) homozygous deletion mutants, (ii) heterozygous deletion mutants and (iii) genomic library transformants. We demonstrated, with eight diverse reference compounds, that integration of these three chemogenomic profiles improves the sensitivity and specificity of small-molecule target identification. We further dissected the mechanism of action of two protein phosphatase inhibitors and in the process developed a framework for the rational design of multidrug combinations to sensitize cells with specific genotypes more effectively. Finally, we applied this platform to 188 novel synthetic chemical compounds and identified both potential targets and structure-activity relationships.


Subject(s)
Drug Design , Genome, Fungal , Genomics/methods , Organic Chemicals/pharmacology , Genes, Fungal/drug effects , Organic Chemicals/chemical synthesis , Structure-Activity Relationship , Yeasts
8.
Nucleic Acids Res ; 35(Web Server issue): W645-8, 2007 Jul.
Article in English | MEDLINE | ID: mdl-17526519

ABSTRACT

The M-Coffee server is a web server that makes it possible to compute multiple sequence alignments (MSAs) by running several MSA methods and combining their output into one single model. This allows the user to simultaneously run all his methods of choice without having to arbitrarily choose one of them. The MSA is delivered along with a local estimation of its consistency with the individual MSAs it was derived from. The computation of the consensus multiple alignment is carried out using a special mode of the T-Coffee package [Notredame, Higgins and Heringa (T-Coffee: a novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 2000; 302: 205-217); Wallace, O'Sullivan, Higgins and Notredame (M-Coffee: combining multiple sequence alignment methods with T-Coffee. Nucleic Acids Res. 2006; 34: 1692-1699)] Given a set of sequences (DNA or proteins) in FASTA format, M-Coffee delivers a multiple alignment in the most common formats. M-Coffee is a freeware open source package distributed under a GPL license and it is available either as a standalone package or as a web service from www.tcoffee.org.


Subject(s)
Algorithms , Computational Biology/methods , Sequence Alignment/methods , Amino Acid Sequence , Computer Simulation , Information Storage and Retrieval , Internet , Molecular Sequence Data , Reproducibility of Results , Sequence Homology, Amino Acid , Software , User-Computer Interface
9.
Curr Opin Struct Biol ; 15(3): 261-6, 2005 Jun.
Article in English | MEDLINE | ID: mdl-15963889

ABSTRACT

Multiple sequence alignments are very widely used in all areas of DNA and protein sequence analysis. The main methods that are still in use are based on 'progressive alignment' and date from the mid to late 1980s. Recently, some dramatic improvements have been made to the methodology with respect either to speed and capacity to deal with large numbers of sequences or to accuracy. There have also been some practical advances concerning how to combine three-dimensional structural information with primary sequences to give more accurate alignments, when structures are available.


Subject(s)
Algorithms , DNA/chemistry , Models, Molecular , Proteins/chemistry , Sequence Alignment/methods , Sequence Analysis/methods , Amino Acid Sequence , Base Sequence , Computer Simulation , DNA/analysis , DNA/classification , Models, Chemical , Molecular Conformation , Molecular Sequence Data , Proteins/analysis , Proteins/classification , Sequence Alignment/trends , Sequence Analysis/trends , Sequence Homology , Software , Structure-Activity Relationship
10.
Nucleic Acids Res ; 34(6): 1692-9, 2006.
Article in English | MEDLINE | ID: mdl-16556910

ABSTRACT

We introduce M-Coffee, a meta-method for assembling multiple sequence alignments (MSA) by combining the output of several individual methods into one single MSA. M-Coffee is an extension of T-Coffee and uses consistency to estimate a consensus alignment. We show that the procedure is robust to variations in the choice of constituent methods and reasonably tolerant to duplicate MSAs. We also show that performances can be improved by carefully selecting the constituent methods. M-Coffee outperforms all the individual methods on three major reference datasets: HOMSTRAD, Prefab and Balibase. We also show that on a case-by-case basis, M-Coffee is twice as likely to deliver the best alignment than any individual method. Given a collection of pre-computed MSAs, M-Coffee has similar CPU requirements to the original T-Coffee. M-Coffee is a freeware open-source package available from http://www.tcoffee.org/.


Subject(s)
Algorithms , Sequence Alignment/methods , Reproducibility of Results , Software
11.
Drug Discov Today ; 23(1): 151-160, 2018 01.
Article in English | MEDLINE | ID: mdl-28917822

ABSTRACT

Increasing amounts of biological data are accumulating in the pharmaceutical industry and academic institutions. However, data does not equal actionable information, and guidelines for appropriate data capture, harmonization, integration, mining, and visualization need to be established to fully harness its potential. Here, we describe ongoing efforts at Merck & Co. to structure data in the area of chemogenomics. We are integrating complementary data from both internal and external data sources into one chemogenomics database (Chemical Genetic Interaction Enterprise; CHEMGENIE). Here, we demonstrate how this well-curated database facilitates compound set design, tool compound selection, target deconvolution in phenotypic screening, and predictive model building.


Subject(s)
Databases, Factual , Drug Discovery , Genomics , Models, Theoretical , Phenotype
12.
BMC Bioinformatics ; 8: 135, 2007 Apr 23.
Article in English | MEDLINE | ID: mdl-17451607

ABSTRACT

BACKGROUND: Proteins that evolve from a common ancestor can change functionality over time, and it is important to be able identify residues that cause this change. In this paper we show how a supervised multivariate statistical method, Between Group Analysis (BGA), can be used to identify these residues from families of proteins with different substrate specifities using multiple sequence alignments. RESULTS: We demonstrate the usefulness of this method on three different test cases. Two of these test cases, the Lactate/Malate dehydrogenase family and Nucleotidyl Cyclases, consist of two functional groups. The other family, Serine Proteases consists of three groups. BGA was used to analyse and visualise these three families using two different encoding schemes for the amino acids. CONCLUSION: This overall combination of methods in this paper is powerful and flexible while being computationally very fast and simple. BGA is especially useful because it can be used to analyse any number of functional classes. In the examples we used in this paper, we have only used 2 or 3 classes for demonstration purposes but any number can be used and visualised.


Subject(s)
Amino Acid Sequence/genetics , Multivariate Analysis , Sequence Alignment/methods , Base Sequence/genetics , Sensitivity and Specificity , Sequence Alignment/statistics & numerical data , Sequence Analysis, Protein/methods , Sequence Analysis, Protein/statistics & numerical data , Software/statistics & numerical data
13.
ACS Cent Sci ; 2(10): 687-701, 2016 Oct 26.
Article in English | MEDLINE | ID: mdl-27800551

ABSTRACT

The development of new antimalarial compounds remains a pivotal part of the strategy for malaria elimination. Recent large-scale phenotypic screens have provided a wealth of potential starting points for hit-to-lead campaigns. One such public set is explored, employing an open source research mechanism in which all data and ideas were shared in real time, anyone was able to participate, and patents were not sought. One chemical subseries was found to exhibit oral activity but contained a labile ester that could not be replaced without loss of activity, and the original hit exhibited remarkable sensitivity to minor structural change. A second subseries displayed high potency, including activity within gametocyte and liver stage assays, but at the cost of low solubility. As an open source research project, unexplored avenues are clearly identified and may be explored further by the community; new findings may be cumulatively added to the present work.

14.
Science ; 344(6180): 208-11, 2014 Apr 11.
Article in English | MEDLINE | ID: mdl-24723613

ABSTRACT

Genome-wide characterization of the in vivo cellular response to perturbation is fundamental to understanding how cells survive stress. Identifying the proteins and pathways perturbed by small molecules affects biology and medicine by revealing the mechanisms of drug action. We used a yeast chemogenomics platform that quantifies the requirement for each gene for resistance to a compound in vivo to profile 3250 small molecules in a systematic and unbiased manner. We identified 317 compounds that specifically perturb the function of 121 genes and characterized the mechanism of specific compounds. Global analysis revealed that the cellular response to small molecules is limited and described by a network of 45 major chemogenomic signatures. Our results provide a resource for the discovery of functional interactions among genes, chemicals, and biological processes.


Subject(s)
Cells/drug effects , Drug Evaluation, Preclinical/methods , Drug Resistance/genetics , Gene Regulatory Networks , Genome-Wide Association Study/methods , Small Molecule Libraries/pharmacology , Cell Line, Tumor , Haploinsufficiency , Humans , Pharmacogenetics , Saccharomyces cerevisiae/drug effects , Saccharomyces cerevisiae/genetics
15.
Genome Biol ; 13(11): R105, 2012 Nov 18.
Article in English | MEDLINE | ID: mdl-23158586

ABSTRACT

Chemical biology, the interfacial discipline of using small molecules as probes to investigate biology, is a powerful approach of developing specific, rapidly acting tools that can be applied across organisms. The single-celled alga Chlamydomonas reinhardtii is an excellent model system because of its photosynthetic ability, cilia-related motility and simple genetics. We report the results of an automated fitness screen of 5,445 small molecules and subsequent assays on motility/phototaxis and photosynthesis. Cheminformatic analysis revealed active core structures and was used to construct a naïve Bayes model that successfully predicts algal bioactive compounds.


Subject(s)
Algal Proteins/metabolism , Chlamydomonas reinhardtii/drug effects , High-Throughput Screening Assays/methods , Small Molecule Libraries/pharmacology , Antipsychotic Agents/pharmacology , Bayes Theorem , Chlamydomonas reinhardtii/physiology , Genetic Fitness , Models, Biological , Phenotype
16.
Methods Mol Biol ; 781: 363-76, 2011.
Article in English | MEDLINE | ID: mdl-21877291

ABSTRACT

Cytoscape is an open-source software package that is widely used to integrate and visualize diverse data sets in biology. This chapter explains how to use Cytoscape to integrate open-source chemical information with a biological network. By visualizing information about known compound-target interactions in the context of a biological network of interest, one can rapidly identify novel avenues to perturb the system with compounds and, for example, potentially identify therapeutically relevant targets. Herein, two different protocols are explained in detail, with no prior knowledge of Cytoscape assumed, which demonstrate how to incorporate data from the ChEMBL database with either a gene-gene or a protein-protein interaction network. ChEMBL is a very large, open-source repository of compound-target information available from the European Molecular Biology Laboratory.


Subject(s)
Computational Biology/methods , Gene Regulatory Networks/drug effects , Protein Interaction Maps/drug effects , Software , Databases, Genetic , Databases, Protein , Molecular Targeted Therapy
17.
PLoS One ; 6(6): e20789, 2011.
Article in English | MEDLINE | ID: mdl-21698101

ABSTRACT

Highly selective, cell-permeable and fast-acting inhibitors of individual kinases are sought-after as tools for studying the cellular function of kinases in real time. A combination of small molecule synthesis and protein mutagenesis, identified a highly potent inhibitor (1-Isopropyl-3-(phenylethynyl)-1H-pyrazolo[3,4-d]pyrimidin-4-amine) of a rationally engineered Hog1 serine/threonine kinase (Hog1(T100G)). This inhibitor has been successfully used to study various aspects of Hog1 signaling, including a transient cell cycle arrest and gene expression changes mediated by Hog1 in response to stress. This study also underscores that the general applicability of this approach depends, in part, on the selectivity of the designed the inhibitor with respect to activity versus the engineered and wild type kinases. To explore this specificity in detail, we used a validated chemogenetic assay to assess the effect of this inhibitor on all gene products in yeast in parallel. The results from this screen emphasize the need for caution and for case-by-case assessment when using the Analog-Sensitive Kinase Allele technology to assess the physiological roles of kinases.


Subject(s)
Drug Design , Protein Kinase Inhibitors/chemistry , Protein Kinase Inhibitors/pharmacology , Cell Cycle , Gene Expression , Inhibitory Concentration 50 , Magnetic Resonance Spectroscopy , Protein Kinase Inhibitors/chemical synthesis
18.
Chem Biol ; 18(10): 1273-83, 2011 Oct 28.
Article in English | MEDLINE | ID: mdl-22035796

ABSTRACT

Preselection of compounds that are more likely to induce a phenotype can increase the efficiency and reduce the costs for model organism screening. To identify such molecules, we screened ~81,000 compounds in Saccharomyces cerevisiae and identified ~7500 that inhibit cell growth. Screening these growth-inhibitory molecules across a diverse panel of model organisms resulted in an increased phenotypic hit-rate. These data were used to build a model to predict compounds that inhibit yeast growth. Empirical and in silico application of the model enriched the discovery of bioactive compounds in diverse model organisms. To demonstrate the potential of these molecules as lead chemical probes, we used chemogenomic profiling in yeast and identified specific inhibitors of lanosterol synthase and of stearoyl-CoA 9-desaturase. As community resources, the ~7500 growth-inhibitory molecules have been made commercially available and the computational model and filter used are provided.


Subject(s)
Enzyme Inhibitors/chemistry , Saccharomyces cerevisiae/growth & development , Saccharomyces cerevisiae/metabolism , Small Molecule Libraries , Bacillus subtilis/drug effects , Bacillus subtilis/growth & development , Bayes Theorem , Benzofurans/chemistry , Benzofurans/metabolism , Benzofurans/pharmacology , Candida albicans/drug effects , Candida albicans/growth & development , Computer Simulation , Enzyme Inhibitors/pharmacology , Escherichia coli/drug effects , Escherichia coli/growth & development , Fatty Acid Desaturases/antagonists & inhibitors , Fatty Acid Desaturases/metabolism , HeLa Cells , Humans , Intramolecular Transferases/antagonists & inhibitors , Intramolecular Transferases/metabolism , Models, Biological , Phenotype , Piperazines/chemistry , Piperazines/metabolism , Piperazines/pharmacology , Saccharomyces cerevisiae/chemistry , Stearoyl-CoA Desaturase
19.
Science ; 327(5964): 425-31, 2010 Jan 22.
Article in English | MEDLINE | ID: mdl-20093466

ABSTRACT

A genome-scale genetic interaction map was constructed by examining 5.4 million gene-gene pairs for synthetic genetic interactions, generating quantitative genetic interaction profiles for approximately 75% of all genes in the budding yeast, Saccharomyces cerevisiae. A network based on genetic interaction profiles reveals a functional map of the cell in which genes of similar biological processes cluster together in coherent subsets, and highly correlated profiles delineate specific pathways to define gene function. The global network identifies functional cross-connections between all bioprocesses, mapping a cellular wiring diagram of pleiotropy. Genetic interaction degree correlated with a number of different gene attributes, which may be informative about genetic network hubs in other organisms. We also demonstrate that extensive and unbiased mapping of the genetic landscape provides a key for interpretation of chemical-genetic interactions and drug target identification.


Subject(s)
Gene Regulatory Networks , Genome, Fungal , Saccharomyces cerevisiae Proteins/metabolism , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Computational Biology , Gene Duplication , Gene Expression Regulation, Fungal , Genes, Fungal , Genetic Fitness , Metabolic Networks and Pathways , Mutation , Protein Interaction Mapping , Saccharomyces cerevisiae/physiology , Saccharomyces cerevisiae Proteins/genetics
20.
Comput Biol Chem ; 32(4): 282-6, 2008 Aug.
Article in English | MEDLINE | ID: mdl-18450519

ABSTRACT

Most sequence clustering methods require a full distance matrix to be computed between all pairs of sequences. This requires computer memory and time proportional to N(2) for N sequences. For small N or say up to 10000 or so, this can be accomplished in reasonable times for sequences of moderate length. For very large N, however, this becomes increasingly prohibitive. In this paper, we have tested variations on a class of published embedding methods that have been designed for clustering large numbers of complex objects where the individual distance calculations are expensive. These methods involve embedding the sequences in a space where the similarities within a set of sequences can be closely approximated without having to compute all pair-wise distances. We show how this approach greatly reduces computation time and memory requirements for clustering large numbers of sequences and demonstrate the quality of the clusterings by benchmarking them as guide trees for multiple alignments. Source code is available on request from the authors.


Subject(s)
Algorithms , Databases, Protein , Proteins/chemistry , Sequence Alignment/methods , Amino Acid Sequence , Cluster Analysis , Molecular Sequence Data , Proteins/genetics , Sequence Homology, Amino Acid , Time Factors
SELECTION OF CITATIONS
SEARCH DETAIL