Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 3 de 3
Filter
Add more filters










Database
Language
Publication year range
1.
J Chem Inf Model ; 64(6): 1966-1974, 2024 03 25.
Article in English | MEDLINE | ID: mdl-38437714

ABSTRACT

Chemical diversity is challenging to describe objectively. Despite this, various notions of chemical diversity are used throughout the medicinal chemistry optimization process in drug discovery. In this work, we show the usefulness of considering exploited vectors during different phases of the drug design process to provide a quantitative and objective description of chemical diversity. We have developed a concise and fast approach to enumerate and analyze the exploited vector patterns (EVPs) of molecular compound series, which can then be used in archetypal compound selection tasks, from hit matter identification to hit expansion and lead optimization. We first show that EVPs can be used to assess the progressibility of compounds in a fragment library design exercise. By considering EVPs, we then show how a set of compounds can be prioritized for hit expansion using EVP-based, customizable diversity sampling approaches, reducing the time taken and mitigating human biases. We also show that EVPs are a useful tool to analyze SAR data, offering the chance to uncover correlations between different vectors without predetermining the molecular scaffold structures. The codes used to perform these tasks are presented as easy-to-use Jupyter notebooks, which can be readily adapted for further related tasks.


Subject(s)
Cheminformatics , Drug Discovery , Humans , Drug Design , Molecular Structure , Chemistry, Pharmaceutical
2.
J Chem Inf Model ; 54(7): 1864-79, 2014 Jul 28.
Article in English | MEDLINE | ID: mdl-24873983

ABSTRACT

Knowledge-based systems for toxicity prediction are typically based on rules, known as structural alerts, that describe relationships between structural features and different toxic effects. The identification of structural features associated with toxicological activity can be a time-consuming process and often requires significant input from domain experts. Here, we describe an emerging pattern mining method for the automated identification of activating structural features in toxicity data sets that is designed to help expedite the process of alert development. We apply the contrast pattern tree mining algorithm to generate a set of emerging patterns of structural fragment descriptors. Using the emerging patterns it is possible to form hierarchical clusters of compounds that are defined by the presence of common structural features and represent distinct chemical classes. The method has been tested on a large public in vitro mutagenicity data set and a public hERG channel inhibition data set and is shown to be effective at identifying common toxic features and recognizable classes of toxicants. We also describe how knowledge developers can use emerging patterns to improve the specificity and sensitivity of an existing expert system.


Subject(s)
Data Mining/methods , Toxicology , Algorithms , Endpoint Determination , Ether-A-Go-Go Potassium Channels/antagonists & inhibitors , Mutagenicity Tests , Potassium Channel Blockers/toxicity
3.
J Chem Inf Model ; 52(11): 3074-87, 2012 Nov 26.
Article in English | MEDLINE | ID: mdl-23092382

ABSTRACT

The design of new alerts, that is, collections of structural features observed to result in toxicological activity, can be a slow process and may require significant input from toxicology and chemistry experts. A method has therefore been developed to help automate alert identification by mining descriptions of activating structural features directly from toxicity data sets. The method is based on jumping emerging pattern mining which is applied to a set of toxic and nontoxic compounds that are represented using atom pair descriptors. Using the resulting jumping emerging patterns, it is possible to cluster toxic compounds into groups defined by the presence of shared structural features and to arrange the clusters into hierarchies. The methodology has been tested on a number of data sets for Ames mutagenicity, oestrogenicity, and hERG channel inhibition end points. These tests have shown the method to be effective at clustering the data sets around minimal jumping-emerging structural patterns and finding descriptions of potentially activating structural features. Furthermore, the mined structural features have been shown to be related to some of the known alerts for all three tested end points.


Subject(s)
Data Mining/methods , Estrogens/chemistry , Mutagens/chemistry , Pattern Recognition, Automated/methods , Cluster Analysis , Estrogens/toxicity , Ether-A-Go-Go Potassium Channels/antagonists & inhibitors , Humans , Mutagens/toxicity
SELECTION OF CITATIONS
SEARCH DETAIL