Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
1.
J Chem Inf Model ; 64(12): 4687-4699, 2024 Jun 24.
Artículo en Inglés | MEDLINE | ID: mdl-38822782

RESUMEN

The design of compounds during hit-to-lead often seeks to explore a vector from a core scaffold to form additional interactions with the target protein. A rational approach to this is to probe the region of a protein accessed by a vector with a systematic placement of pharmacophore features in 3D, particularly when bound structures are not available. Herein, we present bbSelect, an open-source tool built to map the placements of pharmacophore features in 3D Euclidean space from a library of R-groups, employing partitioning to drive a diverse and systematic selection to a user-defined size. An evaluation of bbSelect against established methods exemplified the superiority of bbSelect in its ability to perform diverse selections, achieving high levels of pharmacophore feature placement coverage with selection sizes of a fraction of the total set and without the introduction of excess complexity. bbSelect also reports visualizations and rationale to enable users to understand and interrogate results. This provides a tool for the drug discovery community to guide their hit-to-lead activities.


Asunto(s)
Descubrimiento de Drogas , Programas Informáticos , Descubrimiento de Drogas/métodos , Modelos Moleculares , Diseño de Fármacos , Proteínas/química , Farmacóforo
2.
Pharm Stat ; 20(4): 898-915, 2021 07.
Artículo en Inglés | MEDLINE | ID: mdl-33768736

RESUMEN

One of the main problems that the drug discovery research field confronts is to identify small molecules, modulators of protein function, which are likely to be therapeutically useful. Common practices rely on the screening of vast libraries of small molecules (often 1-2 million molecules) in order to identify a molecule, known as a lead molecule, which specifically inhibits or activates the protein function. To search for the lead molecule, we investigate the molecular structure, which generally consists of an extremely large number of fragments. Presence or absence of particular fragments, or groups of fragments, can strongly affect molecular properties. We study the relationship between molecular properties and its fragment composition by building a regression model, in which predictors, represented by binary variables indicating the presence or absence of fragments, are grouped in subsets and a bi-level penalization term is introduced for the high dimensionality of the problem. We evaluate the performance of this model in two simulation studies, comparing different penalization terms and different clustering techniques to derive the best predictor subsets structure. Both studies are characterized by small sets of data relative to the number of predictors under consideration. From the results of these simulation studies, we show that our approach can generate models able to identify key features and provide accurate predictions. The good performance of these models is then exhibited with real data about the MMP-12 enzyme.


Asunto(s)
Descubrimiento de Drogas , Análisis por Conglomerados , Simulación por Computador , Humanos
3.
J Cheminform ; 13(1): 13, 2021 Feb 22.
Artículo en Inglés | MEDLINE | ID: mdl-33618772

RESUMEN

Malaria is a disease affecting hundreds of millions of people across the world, mainly in developing countries and especially in sub-Saharan Africa. It is the cause of hundreds of thousands of deaths each year and there is an ever-present need to identify and develop effective new therapies to tackle the disease and overcome increasing drug resistance. Here, we extend a previous study in which a number of partners collaborated to develop a consensus in silico model that can be used to identify novel molecules that may have antimalarial properties. The performance of machine learning methods generally improves with the number of data points available for training. One practical challenge in building large training sets is that the data are often proprietary and cannot be straightforwardly integrated. Here, this was addressed by sharing QSAR models, each built on a private data set. We describe the development of an open-source software platform for creating such models, a comprehensive evaluation of methods to create a single consensus model and a web platform called MAIP available at https://www.ebi.ac.uk/chembl/maip/ . MAIP is freely available for the wider community to make large-scale predictions of potential malaria inhibiting compounds. This project also highlights some of the practical challenges in reproducing published computational methods and the opportunities that open-source software can offer to the community.

4.
J Med Chem ; 63(20): 11964-11971, 2020 10 22.
Artículo en Inglés | MEDLINE | ID: mdl-32955254

RESUMEN

Machine learning approaches promise to accelerate and improve success rates in medicinal chemistry programs by more effectively leveraging available data to guide a molecular design. A key step of an automated computational design algorithm is molecule generation, where the machine is required to design high-quality, drug-like molecules within the appropriate chemical space. Many algorithms have been proposed for molecular generation; however, a challenge is how to assess the validity of the resulting molecules. Here, we report three Turing-inspired tests designed to evaluate the performance of molecular generators. Profound differences were observed between the performance of molecule generators in these tests, highlighting the importance of selection of the appropriate design algorithms for specific circumstances. One molecule generator, based on match molecular pairs, performed excellently against all tests and thus provides a valuable component for machine-driven medicinal chemistry design workflows.


Asunto(s)
Algoritmos , Aprendizaje Automático , Química Farmacéutica , Diseño de Fármacos , Humanos , Estructura Molecular
5.
J Chem Inf Model ; 60(12): 5699-5713, 2020 12 28.
Artículo en Inglés | MEDLINE | ID: mdl-32659085

RESUMEN

Deep learning approaches have become popular in recent years in the field of de novo molecular design. While a variety of different methods are available, it is still a challenge to assess and compare their performance. A particularly promising approach for automated drug design is to use recurrent neural networks (RNNs) as SMILES generators and train them with the learning procedure called "transfer learning". This involves first training the initial model on a large generic data set of molecules to learn the general syntax of SMILES, followed by fine-tuning on a smaller set of molecules, coming from, e.g., a lead optimization program. To create a well-performing transfer learning application which can be automated, it is important to understand how the size of the second data set affects the training process. In addition, extensive postfiltering using similarity metrics of the molecules generated after transfer learning should be avoided, as it can introduce new biases toward the selection of drug candidates. Here, we present results from the application of a gated recurrent unit cell (GRU)-RNN to transfer learning on data sets of varying sizes and complexity. Analysis of the results has allowed us to provide some general guidelines for transfer learning. In particular, we show that data set sizes containing at least 190 molecules are needed for effective GRU-RNN-based molecular generation using transfer learning. The methods presented here should be applicable generally to the benchmarking of other deep learning methodologies for molecule generation.


Asunto(s)
Diseño de Fármacos , Redes Neurales de la Computación , Aprendizaje Automático
6.
J Comput Aided Mol Des ; 34(7): 767, 2020 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-31691917

RESUMEN

The original version of this article unfortunately contained some mistakes in the references.

7.
J Comput Aided Mol Des ; 34(7): 747-765, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-31637565

RESUMEN

This paper introduces BRADSHAW (Biological Response Analysis and Design System using an Heterogenous, Automated Workflow), a system for automated molecular design which integrates methods for chemical structure generation, experimental design, active learning and cheminformatics tools. The simple user interface is designed to facilitate access to large scale automated design whilst minimising software development required to introduce new algorithms, a critical requirement in what is a very fast moving field. The system embodies a philosophy of automation, best practice, experimental design and the use of both traditional cheminformatics and modern machine learning algorithms.


Asunto(s)
Diseño Asistido por Computadora , Diseño de Fármacos , Antagonistas del Receptor de Adenosina A2/química , Algoritmos , Quimioinformática/métodos , Quimioinformática/estadística & datos numéricos , Quimioinformática/tendencias , Diseño Asistido por Computadora/estadística & datos numéricos , Diseño Asistido por Computadora/tendencias , Aprendizaje Profundo , Descubrimiento de Drogas/métodos , Descubrimiento de Drogas/estadística & datos numéricos , Descubrimiento de Drogas/tendencias , Humanos , Aprendizaje Automático , Inhibidores de la Metaloproteinasa de la Matriz/química , Relación Estructura-Actividad Cuantitativa , Bibliotecas de Moléculas Pequeñas , Programas Informáticos , Interfaz Usuario-Computador , Flujo de Trabajo
8.
SLAS Discov ; 23(6): 532-545, 2018 07.
Artículo en Inglés | MEDLINE | ID: mdl-29699447

RESUMEN

High-throughput screening (HTS) hits include compounds with undesirable properties. Many filters have been described to identify such hits. Notably, pan-assay interference compounds (PAINS) has been adopted by the community as the standard term to refer to such filters, and very useful guidelines have been adopted by the American Chemical Society (ACS) and subsequently triggered a healthy scientific debate about the pitfalls of draconian use of filters. Using an inhibitory frequency index, we have analyzed in detail the promiscuity profile of the whole GlaxoSmithKline (GSK) HTS collection comprising more than 2 million unique compounds that have been tested in hundreds of screening assays. We provide a comprehensive analysis of many previously published filters and newly described classes of nuisance structures that may serve as a useful source of empirical information to guide the design or growth of HTS collections and hit triaging strategies.


Asunto(s)
Descubrimiento de Drogas/métodos , Ensayos Analíticos de Alto Rendimiento/métodos , Bibliotecas de Moléculas Pequeñas/química , Bioensayo/métodos
9.
J Chem Inf Model ; 57(3): 445-453, 2017 03 27.
Artículo en Inglés | MEDLINE | ID: mdl-28257198

RESUMEN

The development of new antimalarial therapies is essential, and lowering the barrier of entry for the screening and discovery of new lead compound classes can spur drug development at organizations that may not have large compound screening libraries or resources to conduct high-throughput screens. Machine learning models have been long established to be more robust and have a larger domain of applicability with larger training sets. Screens over multiple data sets to find compounds with potential malaria blood stage inhibitory activity have been used to generate multiple Bayesian models. Here we describe a method by which Bayesian quantitative structure-activity relationship models, which contain information on thousands to millions of proprietary compounds, can be shared between collaborators at both for-profit and not-for-profit institutions. This model-sharing paradigm allows for the development of consensus models that have increased predictive power over any single model and yet does not reveal the identity of any compounds in the training sets.


Asunto(s)
Antimaláricos/farmacología , Aprendizaje Automático , Malaria/tratamiento farmacológico , Modelos Teóricos , Relación Estructura-Actividad Cuantitativa , Antimaláricos/uso terapéutico , Teorema de Bayes , Descubrimiento de Drogas , Malaria/sangre , Curva ROC , Temperatura
10.
Acta Crystallogr D Struct Biol ; 73(Pt 3): 279-285, 2017 03 01.
Artículo en Inglés | MEDLINE | ID: mdl-28291763

RESUMEN

In this work, two freely available web-based interactive computational tools that facilitate the analysis and interpretation of protein-ligand interaction data are described. Firstly, WONKA, which assists in uncovering interesting and unusual features (for example residue motions) within ensembles of protein-ligand structures and enables the facile sharing of observations between scientists. Secondly, OOMMPPAA, which incorporates protein-ligand activity data with protein-ligand structural data using three-dimensional matched molecular pairs. OOMMPPAA highlights nuanced structure-activity relationships (SAR) and summarizes available protein-ligand activity data in the protein context. In this paper, the background that led to the development of both tools is described. Their implementation is outlined and their utility using in-house Structural Genomics Consortium (SGC) data sets and openly available data from the PDB and ChEMBL is described. Both tools are freely available to use and download at http://wonka.sgc.ox.ac.uk/WONKA/ and http://oommppaa.sgc.ox.ac.uk/OOMMPPAA/.


Asunto(s)
Diseño Asistido por Computadora , Diseño de Fármacos , Proteínas/metabolismo , Programas Informáticos , Sitios de Unión , Bases de Datos de Proteínas , Humanos , Ligandos , Simulación del Acoplamiento Molecular , Unión Proteica , Proteínas/química , Relación Estructura-Actividad
11.
J Comput Aided Mol Des ; 31(3): 249-253, 2017 03.
Artículo en Inglés | MEDLINE | ID: mdl-28070730

RESUMEN

The acronym "CADD" is often used interchangeably to refer to "Computer Aided Drug Discovery" and "Computer Aided Drug Design". While the former definition implies the use of a computer to impact one or more aspects of discovering a drug, in this paper we contend that computational chemists are most effective when they enable teams to apply true design principles as they strive to create medicines to treat human disease. We argue that teams must bring to bear multiple sub-disciplines of computational chemistry in an integrated manner in order to utilize these principles to address the multi-objective nature of the drug discovery problem. Impact, resourcing principles, and future directions for the field are also discussed, including areas of future opportunity as well as a cautionary note about hype and hubris.


Asunto(s)
Biología Computacional/métodos , Diseño Asistido por Computadora , Diseño de Fármacos , Modelos Moleculares , Estructura Molecular , Programas Informáticos , Relación Estructura-Actividad
12.
Drug Discov Today ; 21(10): 1719-1727, 2016 10.
Artículo en Inglés | MEDLINE | ID: mdl-27423371

RESUMEN

In an attempt to seek increased understanding of compound attributes that influence successful drug pipeline progression, GlaxoSmithKline's portfolio of oral candidates was compared with reference sets of marketed oral drugs. The approach differs from other attrition studies by explicitly focusing on choosing 'the right compound' by applying relevant, experimentally derived properties. The analysis led to four proposed compound quality categories, created by combining specific criteria for three measures: dose, solubility and the property forecast index, a composite measure of lipophilicity using chromatographically determined LogD and aromaticity. The 'three properties' provide benchmarked guidelines for project teams to use when seeking and selecting clinical candidates, because they reflect the property distribution of marketed oral drugs.


Asunto(s)
Descubrimiento de Drogas , Administración Oral , Animales , Humanos , Interacciones Hidrofóbicas e Hidrofílicas , Preparaciones Farmacéuticas/administración & dosificación , Preparaciones Farmacéuticas/química , Preparaciones Farmacéuticas/metabolismo , Solubilidad
13.
J Chem Inf Model ; 54(10): 2636-46, 2014 Oct 27.
Artículo en Inglés | MEDLINE | ID: mdl-25244105

RESUMEN

There is an ever increasing resource in terms of both structural information and activity data for many protein targets. In this paper we describe OOMMPPAA, a novel computational tool designed to inform compound design by combining such data. OOMMPPAA uses 3D matched molecular pairs to generate 3D ligand conformations. It then identifies pharmacophoric transformations between pairs of compounds and associates them with their relevant activity changes. OOMMPPAA presents this data in an interactive application providing the user with a visual summary of important interaction regions in the context of the binding site. We present validation of the tool using openly available data for CDK2 and a GlaxoSmithKline data set for a SAM-dependent methyl-transferase. We demonstrate OOMMPPAA's application in optimizing both potency and cell permeability and use OOMMPPAA to highlight nuanced and cross-series SAR. OOMMPPAA is freely available to download at http://oommppaa.sgc.ox.ac.uk/OOMMPPAA/ .


Asunto(s)
Quinasa 2 Dependiente de la Ciclina/antagonistas & inhibidores , Inhibidores Enzimáticos/química , Metiltransferasas/antagonistas & inhibidores , Bibliotecas de Moléculas Pequeñas/química , Programas Informáticos , Sitios de Unión , Quinasa 2 Dependiente de la Ciclina/química , Diseño de Fármacos , Inhibidores Enzimáticos/síntesis química , Humanos , Ligandos , Metiltransferasas/química , Simulación del Acoplamiento Molecular , Unión Proteica , Relación Estructura-Actividad Cuantitativa , S-Adenosilmetionina/química , Bibliotecas de Moléculas Pequeñas/síntesis química
16.
J Comput Aided Mol Des ; 27(4): 321-36, 2013 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-23615761

RESUMEN

We describe the QSAR Workbench, a system for the building and analysis of QSAR models. The system is built around the Pipeline Pilot workflow tool and provides access to a variety of model building algorithms for both continuous and categorical data. Traditionally models are built on a one by one basis and fully exploring the model space of algorithms and descriptor subsets is a time consuming basis. The QSAR Workbench provides a framework to allow for multiple models to be built over a number of modeling algorithms, descriptor combinations and data splits (training and test sets). Methods to analyze and compare models are provided, enabling the user to select the most appropriate model. The Workbench provides a consistent set of routines for data preparation and chemistry normalization that are also applied for predictions. The Workbench provides a large degree of automation with the ability to publish preconfigured model building workflows for a variety of problem domains, whilst providing experienced users full access to the underlying parameterization if required. Methods are provided to allow for publication of selected models as web services, thus providing integration with the chemistry desktop. We describe the design and implementation of the QSAR Workbench and demonstrate its utility through application to two public domain datasets.


Asunto(s)
Diseño de Fármacos , Modelos Biológicos , Relación Estructura-Actividad Cuantitativa , Algoritmos , Bases de Datos Farmacéuticas , Humanos , Flujo de Trabajo
18.
Drug Discov Today ; 16(17-18): 822-30, 2011 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-21704184

RESUMEN

Here, we review the performance of chromatographic hydrophobicity measurements in a data set of 100,000 GlaxoSmithKline compounds, demonstrating the advantages of the method over octanol-water partitioning and highlighting new insights for drug discovery. The value of chromatographic measurements, versus other hydrophobicity estimates, was supported by improved relationships with solubility, permeation, cytochrome P450s, intrinsic clearance, hERG binding and promiscuity. We also observed marked differentiation of the relative influence of intrinsic and effective hydrophobicity. The summing of hydrophobicity values plus aromatic ring count [logD(pH7.4) (or logP)+#Ar], indicated a wide relevance for simplistic 'property forecast indices' in developability assays, clearly enhanced by chromatographic values; therefore establishing new foundations for enriching property-based drug design.


Asunto(s)
Cromatografía/métodos , Técnicas de Laboratorio Clínico/métodos , Diseño de Fármacos , Hidrocarburos Aromáticos/química , Preparaciones Farmacéuticas/química , Humanos , Interacciones Hidrofóbicas e Hidrofílicas , Solubilidad
19.
Drug Discov Today ; 16(15-16): 646-53, 2011 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-21601652

RESUMEN

Drug toxicity is a major cause of late-stage product attrition. During lead identification and optimization phases little information is typically available about which molecules might have safety concerns. A system was built linking chemistry, preclinical and human safety information, enabling scientists to lever safety knowledge across multiple disciplines. The system consists of a data warehouse with chemical structures and chemical and biological properties for ∼80000 compounds and tools to access and analyze clinical data, toxicology, in vitro pharmacology and drug metabolism data. Tapping into this safety knowledge enables rapid clinically focused risk assessments of drug candidates. Use of this strategy adds value to the drug discovery process at GSK via efficient triage of compounds based on their potential for toxicity.


Asunto(s)
Diseño de Fármacos , Descubrimiento de Drogas/métodos , Industria Farmacéutica/métodos , Animales , Bases de Datos Factuales , Evaluación Preclínica de Medicamentos/métodos , Efectos Colaterales y Reacciones Adversas Relacionados con Medicamentos , Humanos , Preparaciones Farmacéuticas/química , Preparaciones Farmacéuticas/metabolismo , Medición de Riesgo/métodos , Toxicología/métodos
20.
Nat Rev Drug Discov ; 10(3): 188-95, 2011 03.
Artículo en Inglés | MEDLINE | ID: mdl-21358738

RESUMEN

High-throughput screening (HTS) has been postulated in several quarters to be a contributory factor to the decline in productivity in the pharmaceutical industry. Moreover, it has been blamed for stifling the creativity that drug discovery demands. In this article, we aim to dispel these myths and present the case for the use of HTS as part of a proven scientific tool kit, the wider use of which is essential for the discovery of new chemotypes.


Asunto(s)
Investigación Biomédica , Evaluación Preclínica de Medicamentos , Animales , Diseño de Fármacos , Evaluación Preclínica de Medicamentos/normas , Evaluación Preclínica de Medicamentos/estadística & datos numéricos , Humanos , Bibliotecas de Moléculas Pequeñas
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA