RESUMO
Macroscopic pKa values were calculated for all compounds in the SAMPL6 blind prediction challenge, based on quantum chemical calculations with a continuum solvation model and a linear correction derived from a small training set. Microscopic pKa values were derived from the gas-phase free energy difference between protonated and deprotonated forms together with the Conductor-like Polarizable Continuum Solvation Model and the experimental solvation free energy of the proton. pH-dependent microstate free energies were obtained from the microscopic pKas with a maximum likelihood estimator and appropriately summed to yield macroscopic pKa values or microstate populations as function of pH. We assessed the accuracy of three approaches to calculate the microscopic pKas: direct use of the quantum mechanical free energy differences and correction of the direct values for short-comings in the QM solvation model with two different linear models that we independently derived from a small training set of 38 compounds with known pKa. The predictions that were corrected with the linear models had much better accuracy [root-mean-square error (RMSE) 2.04 and 1.95 pKa units] than the direct calculation (RMSE 3.74). Statistical measures indicate that some systematic errors remain, likely due to differences in the SAMPL6 data set and the small training set with respect to their interactions with water. Overall, the current approach provides a viable physics-based route to estimate macroscopic pKa values for novel compounds with reasonable accuracy.
Assuntos
Compostos Heterocíclicos/química , Modelos Químicos , Solventes/química , Bases de Dados de Compostos Químicos , Concentração de Íons de Hidrogênio , Estrutura Molecular , Teoria Quântica , TermodinâmicaRESUMO
All-atom molecular dynamics simulations were used to predict water-cyclohexane distribution coefficients [Formula: see text] of a range of small molecules as part of the SAMPL5 blind prediction challenge. Molecules were parameterized with the transferable all-atom OPLS-AA force field, which required the derivation of new parameters for sulfamides and heterocycles and validation of cyclohexane parameters as a solvent. The distribution coefficient was calculated from the solvation free energies of the compound in water and cyclohexane. Absolute solvation free energies were computed by an established protocol using windowed alchemical free energy perturbation with thermodynamic integration. This protocol resulted in an overall root mean square error in [Formula: see text] of almost 4 log units and an overall signed error of -3 compared to experimental data. There was no substantial overall difference in accuracy between simulating in NVT and NPT ensembles. The signed error suggests a systematic error but the experimental [Formula: see text] data on their own are insufficient to uncover the source of this error. Preliminary work suggests that the major source of error lies in the hydration free energy calculations.
Assuntos
Cicloexanos/química , Simulação de Dinâmica Molecular , Preparações Farmacêuticas/química , Água/química , Modelos Químicos , Estrutura Molecular , Solubilidade , Solventes/química , TermodinâmicaRESUMO
Kinetic and thermodynamic models of biological systems are commonly used to connect microscopic features to system function in a bottom-up multiscale approach. The parameters of such models-free energy differences for equilibrium properties and in general rates for equilibrium and out-of-equilibrium observables-have to be measured by different experiments or calculated from multiple computer simulations. All such parameters necessarily come with uncertainties so that when they are naively combined in a full model of the process of interest, they will generally violate fundamental statistical mechanical equalities, namely detailed balance and an equality of forward/backward rate products in cycles due to T. Hill. If left uncorrected, such models can produce arbitrary outputs that are physically inconsistent. Here we develop a maximum likelihood approach (named multibind ) based on the so-called potential graph to combine kinetic or thermodynamic measurements to yield state resolved models that are thermodynamically consistent while being most consistent with the provided data and their uncertainties. We demonstrate the approach with two theoretical models, a generic two-proton binding site and a simplified model of a sodium/proton antiporter. We also describe an algorithm to use the multibind approach to solve the inverse problem of determining microscopic quantities from macroscopic measurements and as an example we predict the microscopic p K a s and protonation states of a small organic molecule from 1D NMR data. The multibind approach is applicable to any thermodynamic or kinetic model that describes a system as transitions between well-defined states with associated free energy differences or rates between these states. A Python package multibind , which implements the approach described here, is made publicly available under the MIT Open Source license. WHY IT MATTERS: The increase in computational efficiency and rapid advances in methodology for quantitative free energy and rate calculations has allowed for the construction of increasingly complex thermodynamic or kinetic "bottom-up" models of chemical and biological processes. These multi-scale models serve as a framework for analyzing aspects of cellular function in terms of microscopic, molecular properties and provide an opportunity to connect molecular mechanisms to cellular function. The underlying model parameters-free energy differences or rates-are constrained by thermodynamic identities over cycles of states but these identities are not necessarily obeyed during model construction, thus potentially leading to inconsistent models. We address these inconsistencies through the use of a maximum likelihood approach for free energies and rates to adjust the model parameters in such a way that they are maximally consistent with the input parameters and exactly fulfill the thermodynamic cycle constraints. This approach enables formulation of thermodynamically consistent multi-scale models from simulated or experimental measurements.
RESUMO
Kinetic and thermodynamic models of biological systems are commonly used to connect microscopic features to system function in a bottom-up multiscale approach. The parameters of such models-free energy differences for equilibrium properties and in general rates for equilibrium and out-of-equilibrium observables-have to be measured by different experiments or calculated from multiple computer simulations. All such parameters necessarily come with uncertainties so that when they are naively combined in a full model of the process of interest, they will generally violate fundamental statistical mechanical equalities, namely detailed balance and an equality of forward/backward rate products in cycles due to Hill. If left uncorrected, such models can produce arbitrary outputs that are physically inconsistent. Here, we develop a maximum likelihood approach (named multibind) based on the so-called potential graph to combine kinetic or thermodynamic measurements to yield state-resolved models that are thermodynamically consistent while being most consistent with the provided data and their uncertainties. We demonstrate the approach with two theoretical models, a generic two-proton binding site and a simplified model of a sodium/proton antiporter. We also describe an algorithm to use the multibind approach to solve the inverse problem of determining microscopic quantities from macroscopic measurements and, as an example, we predict the microscopic pKa values and protonation states of a small organic molecule from 1D NMR data. The multibind approach is applicable to any thermodynamic or kinetic model that describes a system as transitions between well-defined states with associated free energy differences or rates between these states. A Python package multibind, which implements the approach described here, is made publicly available under the MIT Open Source license.
RESUMO
The strict exchange of protons for sodium ions across cell membranes by Na+/H+ exchangers is a fundamental mechanism for cell homeostasis. At active pH, Na+/H+ exchange can be modelled as competition between H+ and Na+ to an ion-binding site, harbouring either one or two aspartic-acid residues. Nevertheless, extensive analysis on the model Na+/H+ antiporter NhaA from Escherichia coli, has shown that residues on the cytoplasmic surface, termed the pH sensor, shifts the pH at which NhaA becomes active. It was unclear how to incorporate the pH senor model into an alternating-access mechanism based on the NhaA structure at inactive pH 4. Here, we report the crystal structure of NhaA at active pH 6.5, and to an improved resolution of 2.2 Å. We show that at pH 6.5, residues in the pH sensor rearrange to form new salt-bridge interactions involving key histidine residues that widen the inward-facing cavity. What we now refer to as a pH gate, triggers a conformational change that enables water and Na+ to access the ion-binding site, as supported by molecular dynamics (MD) simulations. Our work highlights a unique, channel-like switch prior to substrate translocation in a secondary-active transporter.