RESUMEN
Water molecules at protein-ligand interfaces are often of significant pharmaceutical interest, owing in part to the entropy which can be released upon the displacement of an ordered water by a therapeutic compound. Protein structures may not, however, completely resolve all critical bound water molecules, or there may be no experimental data available. As such, predicting the location of water molecules in the absence of a crystal structure is important in the context of rational drug design. Grand canonical Monte Carlo (GCMC) is a computational technique that is gaining popularity for the simulation of buried water sites. In this work, we assess the ability of GCMC to accurately predict water binding locations, using a dataset that we have curated, containing 108 unique structures of complexes between proteins and Food and Drug Administration (FDA)-approved small-molecule drugs. We show that GCMC correctly predicts 81.4% of nonbulk crystallographic water sites to within 1.4 Å. However, our analysis demonstrates that the reported performance of water prediction methods is highly sensitive to the way in which the performance is measured. We also find that crystallographic water sites with more protein/ligand hydrogen bonds and stronger electron density are more reliably predicted by GCMC. An analysis of water networks revealed that more than half of the structures contain at least one ligand-contacting water network. In these cases, displacement of a water site by a ligand modification might yield unexpected results if the larger network is destabilized. Cooperative effects between waters should therefore be explicitly considered in structure-based drug design.
Asunto(s)
Proteínas , Agua , Agua/química , Ligandos , Proteínas/química , Simulación por Computador , Preparaciones Farmacéuticas , Sitios de Unión , Unión ProteicaRESUMEN
Predicting the correct pose of a ligand binding to a protein and its associated binding affinity is of great importance in computer-aided drug discovery. A number of approaches have been developed to these ends, ranging from the widely used fast molecular docking to the computationally expensive enhanced sampling molecular simulations. In this context, methods such as coarse-grained metadynamics and binding pose metadynamics (BPMD) use simulations with metadynamics biasing to probe the binding affinity without trying to fully converge the binding free energy landscape in order to decrease the computational cost. In BPMD, the metadynamics bias perturbs the ligand away from the initial pose. The resistance of the ligand to this bias is used to calculate a stability score. The method has been shown to be useful in reranking predicted binding poses from docking. Here, we present OpenBPMD, an open-source Python reimplementation and reinterpretation of BPMD. OpenBPMD is powered by the OpenMM simulation engine and uses a revised scoring function. The algorithm was validated by testing it on a wide range of targets and showing that it matches or exceeds the performance of the original BPMD. We also investigated the role of accurate water positioning on the performance of the algorithm and showed how the combination with a grand-canonical Monte Carlo algorithm improves the accuracy of the predictions.
Asunto(s)
Descubrimiento de Drogas , Proteínas , Ligandos , Simulación del Acoplamiento Molecular , Unión Proteica , Proteínas/química , Sitios de Unión , TermodinámicaRESUMEN
The fundamental importance of water molecules at drug-protein interfaces is now widely recognised and a significant feature in structure-based drug design. Experimental methods for analysing the role of water in drug binding have many challenges, including the accurate location of bound water molecules in crystal structures, and problems in resolving specific water contributions to binding thermodynamics. Computational analyses of binding site water molecules provide an alternative, and in principle complete, structural and thermodynamic picture, and their use is now commonplace in the pharmaceutical industry. In this review, we describe the computational methodologies that are available and discuss their strengths and weaknesses. Additionally, we provide a critical analysis of the experimental data used to validate the methods, regarding the type and quality of experimental structural data. We also discuss some of the fundamental difficulties of each method and suggest directions for future study.
Asunto(s)
Preparaciones Farmacéuticas/química , Proteínas/química , Agua/química , Sitios de Unión , Ligandos , Unión Proteica , TermodinámicaRESUMEN
The influenza A M2 wild-type (WT) proton channel is the target of the anti-influenza drug rimantadine. Rimantadine has two enantiomers, though most investigations into drug binding and inhibition have used a racemic mixture. Solid-state NMR experiments using the full length-M2 WT have shown significant spectral differences that were interpreted to indicate tighter binding for (R)- vs (S)-rimantadine. However, it was unclear if this correlates with a functional difference in drug binding and inhibition. Using X-ray crystallography, we have determined that both (R)- and (S)-rimantadine bind to the M2 WT pore with slight differences in the hydration of each enantiomer. However, this does not result in a difference in potency or binding kinetics, as shown by similar values for kon, koff, and Kd in electrophysiological assays and for EC50 values in cellular assays. We concluded that the slight differences in hydration for the (R)- and (S)-rimantadine enantiomers are not relevant to drug binding or channel inhibition. To further explore the effect of the hydration of the M2 pore on binding affinity, the water structure was evaluated by grand canonical ensemble molecular dynamics simulations as a function of the chemical potential of the water. Initially, the two layers of ordered water molecules between the bound drug and the channel's gating His37 residues mask the drug's chirality. As the chemical potential becomes more unfavorable, the drug translocates down to the lower water layer, and the interaction becomes more sensitive to chirality. These studies suggest the feasibility of displacing the upper water layer and specifically recognizing the lower water layers in novel drugs.
RESUMEN
Networks of water molecules can play a critical role at the protein-ligand interface and can directly influence drug-target interactions. Grand canonical methods aid in the sampling of these water molecules, where conventional molecular dynamics equilibration times are often long, by allowing waters to be inserted and deleted from the system, according to the chemical potential. Here, we present our open source Python module, grand (https://github.com/essex-lab/grand), which allows molecular dynamics simulations to be performed in conjunction with grand canonical Monte Carlo sampling, using the OpenMM simulation engine. We demonstrate the accuracy of this module by reproducing the density of bulk water observed from constant pressure simulations. Application of this code to the bovine pancreatic trypsin inhibitor protein reproduces three buried crystallographic water sites that are poorly sampled using conventional molecular dynamics.
Asunto(s)
Simulación de Dinámica Molecular , Agua , Animales , Bovinos , Ligandos , Método de Montecarlo , ProteínasRESUMEN
Water molecules play a key role in many biomolecular systems, particularly when bound at protein-ligand interfaces. However, molecular simulation studies on such systems are hampered by the relatively long time scales over which water exchange between a protein and solvent takes place. Grand canonical Monte Carlo (GCMC) is a simulation technique that avoids this issue by attempting the insertion and deletion of water molecules within a given structure. The approach is constrained by low acceptance probabilities for insertions in congested systems, however. To address this issue, here, we combine GCMC with nonequilibium candidate Monte Carlo (NCMC) to yield a method that we refer to as grand canonical nonequilibrium candidate Monte Carlo (GCNCMC), in which the water insertions and deletions are carried out in a gradual, nonequilibrium fashion. We validate this new approach by comparing GCNCMC and GCMC simulations of bulk water and three protein binding sites. We find that not only is the efficiency of the water sampling improved by GCNCMC but that it also results in increased sampling of ligand conformations in a protein binding site, revealing new water-mediated ligand-binding geometries that are not observed using alternative enhanced sampling techniques.
RESUMEN
Water molecules play important roles in all biochemical processes. Therefore, it is of key importance to obtain information of the structure, dynamics, and thermodynamics of water molecules around proteins. Numerous computational methods have been suggested with this aim. In this study, we compare the performance of conventional and grand-canonical Monte Carlo (GCMC) molecular dynamics (MD) simulations to sample the water structure, as well GCMC and grid-based inhomogeneous solvation theory (GIST) to describe the energetics of the water network. They are evaluated on two proteins: the buried ligand-binding site of a ferritin dimer and the solvent-exposed binding site of galectin-3. We show that GCMC/MD simulations significantly speed up the sampling and equilibration of water molecules in the buried binding site, thereby making the results more similar for simulations started from different states. Both GCMC/MD and conventional MD reproduce crystal-water molecules reasonably for the buried binding site. GIST analyses are normally based on restrained MD simulations. This improves the precision of the calculated energies, but the restraints also significantly affect both absolute and relative energies. Solvation free energies for individual water molecules calculated with and without restraints show a good correlation, but with large quantitative differences. Finally, we note that the solvation free energies calculated with GIST are â¼5 times larger than those estimated by GCMC owing to differences in the reference state.
RESUMEN
Water often plays a key role in protein structure, molecular recognition, and mediating protein-ligand interactions. Thus, free energy calculations must adequately sample water motions, which often proves challenging in typical MD simulation time scales. Thus, the accuracy of methods relying on MD simulations ends up limited by slow water sampling. Particularly, as a ligand is removed or modified, bulk water may not have time to fill or rearrange in the binding site. In this work, we focus on several molecular dynamics (MD) simulation-based methods attempting to help rehydrate buried water sites: BLUES, using nonequilibrium candidate Monte Carlo (NCMC); grand, using grand canonical Monte Carlo (GCMC); and normal MD. We assess the accuracy and efficiency of these methods in rehydrating target water sites. We selected a range of systems with varying numbers of waters in the binding site, as well as those where water occupancy is coupled to the identity or binding mode of the ligand. We analyzed the rehydration of buried water sites in binding pockets using both clustering of trajectories and direct analysis of electron density maps. Our results suggest both BLUES and grand enhance water sampling relative to normal MD and grand is more robust than BLUES, but also that water sampling remains a major challenge for all of the methods tested. The lessons we learned for these methods and systems are discussed.
Asunto(s)
Simulación de Dinámica Molecular , Agua , Sitios de Unión , Fluidoterapia , Ligandos , Método de Montecarlo , Unión Proteica , Termodinámica , Agua/químicaRESUMEN
Current computer architectures, coupled with state-of-the-art molecular dynamics simulation software, facilitate the in-depth study of large biomolecular systems at high levels of detail. However, biological phenomena take place at various time and length scales and as a result a multiscale approach must be adopted. One such approach is coarse-graining, where biochemical accuracy is sacrificed for computational efficiency. Here, we present a practical guide to setting up and carrying out coarse-grained molecular dynamics simulations.