Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 158
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
J Comput Chem ; 2024 May 09.
Artículo en Inglés | MEDLINE | ID: mdl-38725239

RESUMEN

In binding free energy calculations, simulations must sample all relevant conformations of the system in order to obtain unbiased results. For instance, different ligands can bind to different metastable states of a protein, and if these protein conformational changes are not sampled in relative binding free energy calculations, the contribution of these states to binding is not accounted for and thus calculated binding free energies are inaccurate. In this work, we investigate the impact of different beta-sectretase 1 (BACE1) protein conformations obtained from x-ray crystallography on the binding of BACE1 inhibitors. We highlight how these conformational changes are not adequately sampled in typical molecular dynamics simulations. Furthermore, we show that insufficient sampling of relevant conformations induces substantial error in relative binding free energy calculations, as judged by a variation in calculated relative binding free energies up to 2 kcal/mol depending on the starting protein conformation. These results emphasize the importance of protein conformational sampling and pose this BACE1 system as a challenge case for further method development in the area of enhanced protein conformational sampling, either in combination with binding calculations or as an endpoint correction.

2.
J Chem Inf Model ; 64(12): 4661-4672, 2024 Jun 24.
Artículo en Inglés | MEDLINE | ID: mdl-38860710

RESUMEN

DNA-encoded library technology grants access to nearly infinite opportunities to explore the chemical structure space for drug discovery. Successful navigation depends on the design and synthesis of libraries with appropriate physicochemical properties (PCPs) and structural diversity while aligning with practical considerations. To this end, we analyze combinatorial library design constraints including the number of chemistry cycles, bond construction strategies, and building block (BB) class selection in pursuit of ideal library designs. We compare two-cycle library designs (amino acid + carboxylic acid, primary amine + carboxylic acid) in the context of PCPs and chemical space coverage, given different BB selection strategies and constraints. We find that broad availability of amines and acids is essential for enabling the widest exploration of chemical space. Surprisingly, cost is not a driving factor, and virtually, the same chemical space can be explored with "budget" BBs.


Asunto(s)
ADN , Bibliotecas de Moléculas Pequeñas , ADN/química , Bibliotecas de Moléculas Pequeñas/química , Descubrimiento de Drogas/métodos , Técnicas Químicas Combinatorias , Diseño de Fármacos , Aminas/química , Ácidos Carboxílicos/química , Biblioteca de Genes
3.
J Chem Inf Model ; 2024 Jun 19.
Artículo en Inglés | MEDLINE | ID: mdl-38895959

RESUMEN

In drug discovery, the in silico prediction of binding affinity is one of the major means to prioritize compounds for synthesis. Alchemical relative binding free energy (RBFE) calculations based on molecular dynamics (MD) simulations are nowadays a popular approach for the accurate affinity ranking of compounds. MD simulations rely on empirical force field parameters, which strongly influence the accuracy of the predicted affinities. Here, we evaluate the ability of six different small-molecule force fields to predict experimental protein-ligand binding affinities in RBFE calculations on a set of 598 ligands and 22 protein targets. The public force fields OpenFF Parsley and Sage, GAFF, and CGenFF show comparable accuracy, while OPLS3e is significantly more accurate. However, a consensus approach using Sage, GAFF, and CGenFF leads to accuracy comparable to OPLS3e. While Parsley and Sage are performing comparably based on aggregated statistics across the whole dataset, there are differences in terms of outliers. Analysis of the force field reveals that improved parameters lead to significant improvement in the accuracy of affinity predictions on subsets of the dataset involving those parameters. Lower accuracy can not only be attributed to the force field parameters but is also dependent on input preparation and sampling convergence of the calculations. Especially large perturbations and nonconverged simulations lead to less accurate predictions. The input structures, Gromacs force field files, as well as the analysis Python notebooks are available on GitHub.

4.
Phys Chem Chem Phys ; 26(12): 9207-9225, 2024 Mar 20.
Artículo en Inglés | MEDLINE | ID: mdl-38444308

RESUMEN

We report the results of the SAMPL9 host-guest blind challenge for predicting binding free energies. The challenge focused on macrocycles from pillar[n]-arene and cyclodextrin host families, including WP6, and bCD and HbCD. A variety of methods were used by participants to submit binding free energy predictions. A machine learning approach based on molecular descriptors achieved the highest accuracy (RMSE of 2.04 kcal mol-1) among the ranked methods in the WP6 dataset. Interestingly, predictions for WP6 obtained via docking tended to outperform all methods (RMSE of 1.70 kcal mol-1), most of which are MD based and computationally more expensive. In general, methods applying force fields achieved better correlation with experiments for WP6 opposed to the machine learning and docking models. In the cyclodextrin-phenothiazine challenge, the ATM approach emerged as the top performing method with RMSE less than 1.86 kcal mol-1. Correlation metrics of ranked methods in this dataset were relatively poor compared to WP6. We also highlight several lessons learned to guide future work and help improve studies on the systems discussed. For example, WP6 may be present in other microstates other than its -12 state in the presence of certain guests. Machine learning approaches can be used to fine tune or help train force fields for certain chemistry (i.e. WP6-G4). Certain phenothiazines occupy distinct primary and secondary orientations, some of which were considered individually for accurate binding free energies. The accuracy of predictions from certain methods while starting from a single binding pose/orientation demonstrates the sensitivity of calculated binding free energies to the orientation, and in some cases the likely dominant orientation for the system. Computational and experimental results suggest that guest phenothiazine core traverses both the secondary and primary faces of the cyclodextrin hosts, a bulky cationic side chain will primarily occupy the primary face, and the phenothiazine core substituent resides at the larger secondary face.

5.
Nat Methods ; 17(3): 311-318, 2020 03.
Artículo en Inglés | MEDLINE | ID: mdl-32015544

RESUMEN

Tissues and organs are composed of diverse cell types, which poses a major challenge for cell-type-specific profiling of gene expression. Current metabolic labeling methods rely on exogenous pyrimidine analogs that are only incorporated into RNA in cells expressing an exogenous enzyme. This approach assumes that off-target cells cannot incorporate these analogs. We disprove this assumption and identify and characterize the enzymatic pathways responsible for high background incorporation. We demonstrate that mammalian cells can incorporate uracil analogs and characterize the enzymatic pathways responsible for high background incorporation. To overcome these limitations, we developed a new small molecule-enzyme pair consisting of uridine/cytidine kinase 2 and 2'-azidouridine. We demonstrate that 2'-azidouridine is only incorporated in cells expressing uridine/cytidine kinase 2 and characterize selectivity mechanisms using molecular dynamics and X-ray crystallography. Furthermore, this pair can be used to purify and track RNA from specific cellular populations, making it ideal for high-resolution cell-specific RNA labeling. Overall, these results reveal new aspects of mammalian salvage pathways and serve as a new benchmark for designing, characterizing and evaluating methodologies for cell-specific labeling of biomolecules.


Asunto(s)
ARN/química , Uracilo/química , Animales , Azidas/química , Biotinilación , Dominio Catalítico , Técnicas de Cocultivo , Desoxiuridina/análogos & derivados , Desoxiuridina/química , Células HEK293 , Células HeLa , Humanos , Cinética , Ratones , Simulación de Dinámica Molecular , Mutagénesis Sitio-Dirigida , Células 3T3 NIH , Nucleósido-Fosfato Quinasa/metabolismo , Dominios Proteicos , ARN Interferente Pequeño/genética , Uridina/química , Uridina Quinasa/metabolismo
6.
J Chem Inf Model ; 63(6): 1776-1793, 2023 03 27.
Artículo en Inglés | MEDLINE | ID: mdl-36878475

RESUMEN

Drug discovery is accelerated with computational methods such as alchemical simulations to estimate ligand affinities. In particular, relative binding free energy (RBFE) simulations are beneficial for lead optimization. To use RBFE simulations to compare prospective ligands in silico, researchers first plan the simulation experiment, using graphs where nodes represent ligands and graph edges represent alchemical transformations between ligands. Recent work demonstrated that optimizing the statistical architecture of these perturbation graphs improves the accuracy of the predicted changes in the free energy of ligand binding. Therefore, to improve the success rate of computational drug discovery, we present the open-source software package High Information Mapper (HiMap)─a new take on its predecessor, Lead Optimization Mapper (LOMAP). HiMap removes heuristics decisions from design selection and instead finds statistically optimal graphs over ligands clustered with machine learning. Beyond optimal design generation, we present theoretical insights for designing alchemical perturbation maps. Some of these results include that for n number of nodes, the precision of perturbation maps is stable at n·ln(n) edges. This result indicates that even an "optimal" graph can result in unexpectedly high errors if a plan includes too few alchemical transformations for the given number of ligands and edges. And, as a study compares more ligands, the performance of even optimal graphs will deteriorate with linear scaling of the edge count. In this sense, ensuring an A- or D-optimal topology is not enough to produce robust errors. We additionally find that optimal designs will converge more rapidly than radial and LOMAP designs. Moreover, we derive bounds for how clustering reduces cost for designs with a constant expected relative error per cluster, invariant of the size of the design. These results inform how to best design perturbation maps for computational drug discovery and have broader implications for experimental design.


Asunto(s)
Simulación de Dinámica Molecular , Termodinámica , Ligandos , Estudios Prospectivos , Entropía , Unión Proteica
7.
J Chem Inf Model ; 63(16): 5120-5132, 2023 08 28.
Artículo en Inglés | MEDLINE | ID: mdl-37578123

RESUMEN

DNA-encoded libraries (DELs) provide the means to make and screen millions of diverse compounds against a target of interest in a single experiment. However, despite producing large volumes of binding data at a relatively low cost, the DEL selection process is susceptible to noise, necessitating computational follow-up to increase signal-to-noise ratios. In this work, we present a set of informatics tools to employ data from prior DEL screen(s) to gain information about which building blocks are most likely to be productive when designing new DELs for the same target. We demonstrate that similar building blocks have similar probabilities of forming compounds that bind. We then build a model from the inference that the combined behavior of individual building blocks is predictive of whether an overall compound binds. We illustrate our approach on a set of three-cycle OpenDEL libraries screened against soluble epoxide hydrolase (sEH) and report performance of more than an order of magnitude greater than random guessing on a holdout set, demonstrating that our model can serve as a baseline for comparison against other machine learning models on DEL data. Lastly, we provide a discussion on how we believe this informatics workflow could be applied to benefit researchers in their specific DEL campaigns.


Asunto(s)
Descubrimiento de Drogas , Bibliotecas de Moléculas Pequeñas , Bibliotecas de Moléculas Pequeñas/química , ADN/química , Aprendizaje Automático
8.
J Chem Inf Model ; 62(5): 1172-1177, 2022 03 14.
Artículo en Inglés | MEDLINE | ID: mdl-35191702

RESUMEN

Nowadays, drug design projects benefit from highly accurate protein-ligand binding free energy predictions based on molecular dynamics simulations. While such calculations have been computationally expensive in the past, we now demonstrate that workflows built on open source software packages can efficiently leverage pre-exascale computing resources to screen hundreds of compounds in a matter of days. We report our results of free energy calculations on a large set of pharmaceutically relevant targets assembled to reflect industrial drug discovery projects.


Asunto(s)
Diseño de Fármacos , Simulación de Dinámica Molecular , Ligandos , Unión Proteica , Programas Informáticos , Termodinámica
9.
J Chem Inf Model ; 62(22): 5622-5633, 2022 11 28.
Artículo en Inglés | MEDLINE | ID: mdl-36351167

RESUMEN

The development of accurate transferable force fields is key to realizing the full potential of atomistic modeling in the study of biological processes such as protein-ligand binding for drug discovery. State-of-the-art transferable force fields, such as those produced by the Open Force Field Initiative, use modern software engineering and automation techniques to yield accuracy improvements. However, force field torsion parameters, which must account for many stereoelectronic and steric effects, are considered to be less transferable than other force field parameters and are therefore often targets for bespoke parametrization. Here, we present the Open Force Field QCSubmit and BespokeFit software packages that, when combined, facilitate the fitting of torsion parameters to quantum mechanical reference data at scale. We demonstrate the use of QCSubmit for simplifying the process of creating and archiving large numbers of quantum chemical calculations, by generating a dataset of 671 torsion scans for druglike fragments. We use BespokeFit to derive individual torsion parameters for each of these molecules, thereby reducing the root-mean-square error in the potential energy surface from 1.1 kcal/mol, using the original transferable force field, to 0.4 kcal/mol using the bespoke version. Furthermore, we employ the bespoke force fields to compute the relative binding free energies of a congeneric series of inhibitors of the TYK2 protein, and demonstrate further improvements in accuracy, compared to the base force field (MUE reduced from 0.560.390.77 to 0.420.280.59 kcal/mol and R2 correlation improved from 0.720.350.87 to 0.930.840.97).


Asunto(s)
Proteínas , Programas Informáticos , Ligandos , Proteínas/química , Entropía , Unión Proteica
10.
J Chem Inf Model ; 62(23): 6094-6104, 2022 Dec 12.
Artículo en Inglés | MEDLINE | ID: mdl-36433835

RESUMEN

Force fields form the basis for classical molecular simulations, and their accuracy is crucial for the quality of, for instance, protein-ligand binding simulations in drug discovery. The huge diversity of small-molecule chemistry makes it a challenge to build and parameterize a suitable force field. The Open Force Field Initiative is a combined industry and academic consortium developing a state-of-the-art small-molecule force field. In this report, industry members of the consortium worked together to objectively evaluate the performance of the force fields (referred to here as OpenFF) produced by the initiative on a combined public and proprietary dataset of 19,653 relevant molecules selected from their internal research and compound collections. This evaluation was important because it was completely blind; at most partners, none of the molecules or data were used in force field development or testing prior to this work. We compare the Open Force Field "Sage" version 2.0.0 and "Parsley" version 1.3.0 with GAFF-2.11-AM1BCC, OPLS4, and SMIRNOFF99Frosst. We analyzed force-field-optimized geometries and conformer energies compared to reference quantum mechanical data. We show that OPLS4 performs best, and the latest Open Force Field release shows a clear improvement compared to its predecessors. The performance of established force fields such as GAFF-2.11 was generally worse. While OpenFF researchers were involved in building the benchmarking infrastructure used in this work, benchmarking was done entirely in-house within industrial organizations and the resulting assessment is reported here. This work assesses the force field performance using separate benchmarking steps, external datasets, and involving external research groups. This effort may also be unique in terms of the number of different industrial partners involved, with 10 different companies participating in the benchmark efforts.


Asunto(s)
Proteínas , Termodinámica , Ligandos , Proteínas/química , Fenómenos Físicos
11.
J Comput Aided Mol Des ; 36(10): 707-734, 2022 10.
Artículo en Inglés | MEDLINE | ID: mdl-36229622

RESUMEN

The SAMPL series of challenges aim to focus the community on specific modeling challenges, while testing and hopefully driving progress of computational methods to help guide pharmaceutical drug discovery. In this study, we report on the results of the SAMPL8 host-guest blind challenge for predicting absolute binding affinities. SAMPL8 focused on two host-guest datasets, one involving the cucurbituril CB8 (with a series of common drugs of abuse) and another involving two different Gibb deep-cavity cavitands. The latter dataset involved a previously featured deep cavity cavitand (TEMOA) as well as a new variant (TEETOA), both binding to a series of relatively rigid fragment-like guests. Challenge participants employed a reasonably wide variety of methods, though many of these were based on molecular simulations, and predictive accuracy was mixed. As in some previous SAMPL iterations (SAMPL6 and SAMPL7), we found that one approach to achieve greater accuracy was to apply empirical corrections to the binding free energy predictions, taking advantage of prior data on binding to these hosts. Another approach which performed well was a hybrid MD-based approach with reweighting to a force matched QM potential. In the cavitand challenge, an alchemical method using the AMOEBA-polarizable force field achieved the best success with RMSE less than 1 kcal/mol, while another alchemical approach (ATM/GAFF2-AM1BCC/TIP3P/HREM) had RMSE less than 1.75 kcal/mol. The work discussed here also highlights several important lessons; for example, retrospective studies of reference calculations demonstrate the sensitivity of predicted binding free energies to ethyl group sampling and/or guest starting pose, providing guidance to help improve future studies on these systems.


Asunto(s)
Simulación de Dinámica Molecular , Proteínas , Humanos , Ligandos , Termodinámica , Unión Proteica , Proteínas/química , Estudios Retrospectivos , Preparaciones Farmacéuticas
12.
J Comput Aided Mol Des ; 36(10): 767-779, 2022 10.
Artículo en Inglés | MEDLINE | ID: mdl-36198874

RESUMEN

Water plays an important role in mediating protein-ligand interactions. Water rearrangement upon a ligand binding or modification can be very slow and beyond typical timescales used in molecular dynamics (MD) simulations. Thus, inadequate sampling of slow water motions in MD simulations often impairs the accuracy of the accuracy of ligand binding free energy calculations. Previous studies suggest grand canonical Monte Carlo (GCMC) outperforms normal MD simulations for water sampling, thus GCMC has been applied to help improve the accuracy of ligand binding free energy calculations. However, in prior work we observed protein and/or ligand motions impaired how well GCMC performs at water rehydration, suggesting more work is needed to improve this method to handle water sampling. In this work, we applied GCMC in 21 protein-ligand systems to assess the performance of GCMC for rehydrating buried water sites. While our results show that GCMC can rapidly rehydrate all selected water sites for most systems, it fails in five systems. In most failed systems, we observe protein/ligand motions, which occur in the absence of water, combine to close water sites and block instantaneous GCMC water insertion moves. For these five failed systems, we both extended our GCMC simulations and tested a new technique named grand canonical nonequilibrium candidate Monte Carlo (GCNCMC). GCNCMC combines GCMC with the nonequilibrium candidate Monte Carlo (NCMC) sampling technique to improve the probability of a successful water insertion/deletion. Our results show that GCNCMC and extended GCMC can rehydrate all target water sites for three of the five problematic systems and GCNCMC is more efficient than GCMC in two out of the three systems. In one system, only GCNCMC can rehydrate all target water sites, while GCMC fails. Both GCNCMC and GCMC fail in one system. This work suggests this new GCNCMC method is promising for water rehydration especially when protein/ligand motions may block water insertion/removal.


Asunto(s)
Simulación de Dinámica Molecular , Agua , Agua/química , Ligandos , Método de Montecarlo , Proteínas , Fluidoterapia
13.
J Comput Aided Mol Des ; 36(4): 291-311, 2022 04.
Artículo en Inglés | MEDLINE | ID: mdl-35426591

RESUMEN

A novel crystallographic fragment screening data set was generated and used in the SAMPL7 challenge for protein-ligands. The SAMPL challenges prospectively assess the predictive power of methods involved in computer-aided drug design. Application of various methods to fragment molecules are now widely used in the search for new drugs. However, there is little in the way of systematic validation specifically for fragment-based approaches. We have performed a large crystallographic high-throughput fragment screen against the therapeutically relevant second bromodomain of the Pleckstrin-homology domain interacting protein (PHIP2) that revealed 52 different fragments bound across 4 distinct sites, 47 of which were bound to the pharmacologically relevant acetylated lysine (Kac) binding site. These data were used to assess computational screening, binding pose prediction and follow-up enumeration. All submissions performed randomly for screening. Pose prediction success rates (defined as less than 2 Å root mean squared deviation against heavy atom crystal positions) ranged between 0 and 25% and only a very few follow-up compounds were deemed viable candidates from a medicinal-chemistry perspective based on a common molecular descriptors analysis. The tight deadlines imposed during the challenge led to a small number of submissions suggesting that the accuracy of rapidly responsive workflows remains limited. In addition, the application of these methods to reproduce crystallographic fragment data still appears to be very challenging. The results show that there is room for improvement in the development of computational tools particularly when applied to fragment-based drug design.


Asunto(s)
Diseño de Fármacos , Proteínas , Sitios de Unión , Ligandos , Unión Proteica , Proteínas/química
14.
J Intensive Care Med ; 37(5): 641-646, 2022 May.
Artículo en Inglés | MEDLINE | ID: mdl-33955290

RESUMEN

BACKGROUND: To compare the safety and efficacy of percutaneous ultrasound guided gastrostomy (PUG) tube placement with traditional fluoroscopic guided percutaneous gastrostomy tube placement (PRG). METHODS: A prospective, observational, non-randomized cohort trial was performed comparing 25 consecutive patients who underwent PUG placement between April 2020 and August 2020 with 25 consecutive patients who underwent PRG placement between February 2020 and March 2020. Procedure time, sedation, analgesia requirements, and complications were compared between the two groups in non-inferiority analysis. RESULTS: Technical success rates were 96% in both groups (24/25) of procedures. Ninety-two percent of patients in the PUG cohort were admitted to the ICU at the time of G-tube request. Aside from significantly more COVID-19 patients in the PUG group (P < .001), there was no other statistically significant difference in patient demographics. Intra-procedure pain medication requirements were the same for both groups, 50 micrograms of IV fentanyl (P = 1.0). Intra-procedure sedation with IV midazolam was insignificantly higher in the PUG group 1.12 mg vs 0.8 mg (P = .355). Procedure time trended toward statistical significance (P = .076), with PRG being shorter than PUG (30.5 ± 14.1 minutes vs 39.7 ± 17.9 minutes). There were 2 non-device related major complications in the PUG group and 1 major and 1 minor complication in the PRG group. CONCLUSION: PUG is similar in terms of complications to PRG gastrostomy tube placement and a safe method for gastrostomy tube placement in the critically ill with the added benefits of bedside placement, elimination of radiation exposure, and expanded and improved access to care.


Asunto(s)
COVID-19 , Gastrostomía , Gastrostomía/métodos , Humanos , Estudios Prospectivos , Estudios Retrospectivos , Ultrasonografía Intervencional
15.
J Vasc Interv Radiol ; 32(8): 1240.e1-1240.e8, 2021 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-34332723

RESUMEN

Recently developed endovascular techniques to create percutaneous arteriovenous fistulas are an alternative to surgical arteriovenous fistula creation, although there is currently a lack of high-level evidence regarding their creation, maturation, utilization, and long-term function. Recognizing this, the Society of Interventional Radiology Foundation sponsored a Research Consensus Panel and Summit for the prioritization of a research agenda to identify and address the gaps in current knowledge.


Asunto(s)
Fístula Arteriovenosa , Derivación Arteriovenosa Quirúrgica , Fallo Renal Crónico , Fístula Arteriovenosa/diagnóstico por imagen , Fístula Arteriovenosa/terapia , Consenso , Humanos , Investigación Interdisciplinaria , Fallo Renal Crónico/diagnóstico , Fallo Renal Crónico/terapia , Diálisis Renal , Resultado del Tratamiento , Grado de Desobstrucción Vascular
16.
J Chem Inf Model ; 61(3): 1048-1052, 2021 03 22.
Artículo en Inglés | MEDLINE | ID: mdl-33686853

RESUMEN

Relative free energy calculations are fast becoming a critical part of early stage pharmaceutical design, making it important to know how to obtain the best performance with these calculations in applications that could span hundreds of calculations and molecules. In this work, we compared two different treatments of long-range electrostatics, Particle Mesh Ewald (PME) and Reaction Field (RF), in relative binding free energy calculations using a nonequilibrium switching protocol. We found simulations using RF achieve comparable results to those using PME but gain more efficiency when using CPU and similar performance using GPU. The results from this work encourage more use of RF in molecular simulations.


Asunto(s)
Benchmarking , Electricidad Estática
17.
J Comput Aided Mol Des ; 35(1): 1-35, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-33392951

RESUMEN

The SAMPL challenges focus on testing and driving progress of computational methods to help guide pharmaceutical drug discovery. However, assessment of methods for predicting binding affinities is often hampered by computational challenges such as conformational sampling, protonation state uncertainties, variation in test sets selected, and even lack of high quality experimental data. SAMPL blind challenges have thus frequently included a component focusing on host-guest binding, which removes some of these challenges while still focusing on molecular recognition. Here, we report on the results of the SAMPL7 blind prediction challenge for host-guest affinity prediction. In this study, we focused on three different host-guest categories-a familiar deep cavity cavitand series which has been featured in several prior challenges (where we examine binding of a series of guests to two hosts), a new series of cyclodextrin derivatives which are monofunctionalized around the rim to add amino acid-like functionality (where we examine binding of two guests to a series of hosts), and binding of a series of guests to a new acyclic TrimerTrip host which is related to previous cucurbituril hosts. Many predictions used methods based on molecular simulations, and overall success was mixed, though several methods stood out. As in SAMPL6, we find that one strategy for achieving reasonable accuracy here was to make empirical corrections to binding predictions based on previous data for host categories which have been studied well before, though this can be of limited value when new systems are included. Additionally, we found that alchemical free energy methods using the AMOEBA polarizable force field had considerable success for the two host categories in which they participated. The new TrimerTrip system was also found to introduce some sampling problems, because multiple conformations may be relevant to binding and interconvert only slowly. Overall, results in this challenge tentatively suggest that further investigation of polarizable force fields for these challenges may be warranted.


Asunto(s)
Diseño Asistido por Computadora , Compuestos Macrocíclicos/química , Compuestos Macrocíclicos/metabolismo , Proteínas/química , Proteínas/metabolismo , Entropía , Humanos , Ligandos , Simulación de Dinámica Molecular , Estructura Molecular , Unión Proteica , Termodinámica
18.
J Comput Aided Mol Des ; 35(2): 131-166, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-33394238

RESUMEN

The prediction of acid dissociation constants (pKa) is a prerequisite for predicting many other properties of a small molecule, such as its protein-ligand binding affinity, distribution coefficient (log D), membrane permeability, and solubility. The prediction of each of these properties requires knowledge of the relevant protonation states and solution free energy penalties of each state. The SAMPL6 pKa Challenge was the first time that a separate challenge was conducted for evaluating pKa predictions as part of the Statistical Assessment of Modeling of Proteins and Ligands (SAMPL) exercises. This challenge was motivated by significant inaccuracies observed in prior physical property prediction challenges, such as the SAMPL5 log D Challenge, caused by protonation state and pKa prediction issues. The goal of the pKa challenge was to assess the performance of contemporary pKa prediction methods for drug-like molecules. The challenge set was composed of 24 small molecules that resembled fragments of kinase inhibitors, a number of which were multiprotic. Eleven research groups contributed blind predictions for a total of 37 pKa distinct prediction methods. In addition to blinded submissions, four widely used pKa prediction methods were included in the analysis as reference methods. Collecting both microscopic and macroscopic pKa predictions allowed in-depth evaluation of pKa prediction performance. This article highlights deficiencies of typical pKa prediction evaluation approaches when the distinction between microscopic and macroscopic pKas is ignored; in particular, we suggest more stringent evaluation criteria for microscopic and macroscopic pKa predictions guided by the available experimental data. Top-performing submissions for macroscopic pKa predictions achieved RMSE of 0.7-1.0 pKa units and included both quantum chemical and empirical approaches, where the total number of extra or missing macroscopic pKas predicted by these submissions were fewer than 8 for 24 molecules. A large number of submissions had RMSE spanning 1-3 pKa units. Molecules with sulfur-containing heterocycles or iodo and bromo groups were less accurately predicted on average considering all methods evaluated. For a subset of molecules, we utilized experimentally-determined microstates based on NMR to evaluate the dominant tautomer predictions for each macroscopic state. Prediction of dominant tautomers was a major source of error for microscopic pKa predictions, especially errors in charged tautomers. The degree of inaccuracy in pKa predictions observed in this challenge is detrimental to the protein-ligand binding affinity predictions due to errors in dominant protonation state predictions and the calculation of free energy corrections for multiple protonation states. Underestimation of ligand pKa by 1 unit can lead to errors in binding free energy errors up to 1.2 kcal/mol. The SAMPL6 pKa Challenge demonstrated the need for improving pKa prediction methods for drug-like molecules, especially for challenging moieties and multiprotic molecules.


Asunto(s)
Ligandos , Proteínas/química , Solventes/química , Algoritmos , Simulación por Computador , Modelos Químicos , Estructura Molecular , Programas Informáticos , Solubilidad , Termodinámica
19.
J Comput Aided Mol Des ; 35(3): 271-284, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-33506360

RESUMEN

Many molecular simulation methods use force fields to help model and simulate molecules and their behavior in various environments. Force fields are sets of functions and parameters used to calculate the potential energy of a chemical system as a function of the atomic coordinates. Despite the widespread use of force fields, their inadequacies are often thought to contribute to systematic errors in molecular simulations. Furthermore, different force fields tend to give varying results on the same systems with the same simulation settings. Here, we present a pipeline for comparing the geometries of small molecule conformers. We aimed to identify molecules or chemistries that are particularly informative for future force field development because they display inconsistencies between force fields. We applied our pipeline to a subset of the eMolecules database, and highlighted molecules that appear to be parameterized inconsistently across different force fields. We then identified over-represented functional groups in these molecule sets. The molecules and moieties identified by this pipeline may be particularly helpful for future force field parameterization.


Asunto(s)
Compuestos Aza/química , Compuestos Orgánicos/química , Bases de Datos de Compuestos Químicos , Modelos Moleculares , Conformación Molecular , Fenómenos Físicos , Teoría Cuántica , Programas Informáticos , Relación Estructura-Actividad , Termodinámica
20.
J Comput Aided Mol Des ; 35(11): 1141-1155, 2021 11.
Artículo en Inglés | MEDLINE | ID: mdl-34714468

RESUMEN

The goal of the Statistical Assessment of the Modeling of Proteins and Ligands (SAMPL) challenge is to improve the accuracy of current computational models to estimate free energy of binding, deprotonation, distribution and other associated physical properties that are useful for the design of new pharmaceutical products. New experimental datasets of physicochemical properties provide opportunities for prospective evaluation of computational prediction methods. Here, aqueous pKa and a range of bi-phasic logD values for a variety of pharmaceutical compounds were determined through a streamlined automated process to be utilized in the SAMPL8 physical property challenge. The goal of this paper is to provide an in-depth review of the experimental methods utilized to create a comprehensive data set for the blind prediction challenge. The significance of this work involves the use of high throughput experimentation equipment and instrumentation to produce acid dissociation constants for twenty-three drug molecules, as well as distribution coefficients for eleven of those molecules.


Asunto(s)
Modelos Químicos , Preparaciones Farmacéuticas/química , Proteínas/química , Automatización , Descubrimiento de Drogas , Ligandos , Estructura Molecular
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA