RESUMEN
In recent years, molecular dynamics simulations of proteins in explicit mixed solvents have been applied to various problems in protein biophysics and drug discovery, including protein folding, protein surface characterization, fragment screening, allostery, and druggability assessment. In this study, we perform a systematic study on how mixtures of organic solvent probes in water can reveal cryptic ligand binding pockets that are not evident in crystal structures of apo proteins. We examine a diverse set of eight PDB proteins that show pocket opening induced by ligand binding and investigate whether solvent MD simulations on the apo structures can induce the binding site observed in the holo structures. The cosolvent simulations were found to induce conformational changes on the protein surface, which were characterized and compared with the holo structures. Analyses of the biological systems, choice of probes and concentrations, druggability of the resulting induced pockets, and application to drug discovery are discussed here.
Asunto(s)
Simulación de Dinámica Molecular , Proteínas/química , Proteínas/metabolismo , Solventes/química , Sitios de Unión , Conformación ProteicaRESUMEN
BACKGROUND: Proteins play an important role in biological processes in living organisms. Many protein functions are based on interaction with other proteins. The structural information is important for adequate description of these interactions. Sets of protein structures determined in both bound and unbound states are essential for benchmarking of the docking procedures. However, the number of such proteins in PDB is relatively small. A radical expansion of such sets is possible if the unbound structures are computationally simulated. RESULTS: The DOCKGROUND public resource provides data to improve our understanding of protein-protein interactions and to assist in the development of better tools for structural modeling of protein complexes, such as docking algorithms and scoring functions. A large set of simulated unbound protein structures was generated from the bound structures. The modeling protocol was based on 1 ns Langevin dynamics simulation. The simulated structures were validated on the ensemble of experimentally determined unbound and bound structures. The set is intended for large scale benchmarking of docking algorithms and scoring functions. CONCLUSIONS: A radical expansion of the unbound protein docking benchmark set was achieved by simulating the unbound structures. The simulated unbound structures were selected according to criteria from systematic comparison of experimentally determined bound and unbound structures. The set is publicly available at http://dockground.compbio.ku.edu.
Asunto(s)
Benchmarking , Biología Computacional/métodos , Simulación por Computador , Proteínas/química , Algoritmos , Sitios de Unión , Internet , Simulación del Acoplamiento Molecular , Dominios y Motivos de Interacción de Proteínas , Estructura Terciaria de Proteína , Proteínas/metabolismo , Interfaz Usuario-ComputadorRESUMEN
Ferritin-like molecules show a remarkable combination of the evolutionary conserved activity of iron uptake and release that engage different pores in the conserved ferritin shell. It was hypothesized that pore selection and iron traffic depend on dynamic allostery with no conformational changes in the backbone. In this study, we detect the allosteric networks in Pseudomonas aeruginosa bacterioferritin (BfrB), bacterial ferritin (FtnA), and bullfrog M and L ferritins (Ftns) by a network-weaving algorithm (NWA) that passes threads of an allosteric network through highly correlated residues using hierarchical clustering. The residue-residue correlations are calculated in the packing-on elastic network model that introduces atom packing into the common packing-off model. Applying NWA revealed that each of the molecules has an extended allosteric network mostly buried inside the ferritin shell. The structure of the networks is consistent with experimental observations of iron transport: The allosteric networks in BfrB and FtnA connect the ferroxidase center with the 4-fold pores and B-pores, leaving the 3-fold pores unengaged. In contrast, the allosteric network directly links the 3-fold pores with the 4-fold pores in M and L Ftns. The majority of the network residues are either on the inner surface or buried inside the subunit fold or at the subunit interfaces. We hypothesize that the ferritin structures evolved in a way to limit the influence of functionally unrelated events in the cytoplasm on the allosteric network to maintain stability of the translocation mechanisms. We showed that the residue-residue correlations and the resultant long-range cooperativity depend on the ferritin shell packing, which, in turn, depends on protein sequence composition. Switching from the packing-on to the packing-off model reduces correlations by 35%-38% so that no allosteric network can be found. The influence of the side-chain packing on the allosteric networks explains the diversity in mechanisms of iron traffic suggested by experimental approaches.
Asunto(s)
Células Eucariotas/metabolismo , Ferritinas/metabolismo , Hierro/metabolismo , Células Procariotas/metabolismo , Algoritmos , Regulación Alostérica , Células Eucariotas/química , Ferritinas/química , Hierro/química , Modelos Moleculares , Células Procariotas/química , Pseudomonas aeruginosa/química , Pseudomonas aeruginosa/metabolismoRESUMEN
BACKGROUND: Protein interactions play a key role in life processes. Characterization of conformational properties of protein-protein interactions is important for understanding the mechanisms of protein association. The rapidly increasing amount of experimentally determined structures of proteins and protein-protein complexes provides foundation for research on protein interactions and complex formation. The knowledge of the conformations of the surface side chains is essential for modeling of protein complexes. The purpose of this study was to analyze and compare dihedral angle distribution functions of the side chains at the interface and non-interface areas in bound and unbound proteins. RESULTS: To calculate the dihedral angle distribution functions, the configuration space was divided into grid cells. Statistical analysis showed that the similarity between bound and unbound interface and non-interface surface depends on the amino acid type and the grid resolution. The correlation coefficients between the distribution functions increased with the grid spacing increase for all amino acid types. The Manhattan distance showing the degree of dissimilarity between the distribution functions decreased accordingly. Short residues with one or two dihedral angles had higher correlations and smaller Manhattan distances than the longer residues. Met and Arg had the slowest growth of the correlation coefficient with the grid spacing increase. The correlations between the interface and non-interface distribution functions had a similar dependence on the grid resolution in both bound and unbound states. The interface and non-interface differences between bound and unbound distribution functions, caused by biological protein-protein interactions or crystal contacts, disappeared at the 70° grid spacing for interfaces and 30° for non-interface surface, which agrees with an average span of the side-chain rotamers. CONCLUSIONS: The two-fold difference in the critical grid spacing indicates larger conformational changes upon binding at the interface than at the rest of the surface. At the same time, transitions between rotamers induced by interactions across the interface or the crystal packing are rare, with most side chains having local readjustments that do not change the rotameric state. The analysis is important for better understanding of protein interactions and development of flexible docking approaches.
Asunto(s)
Aminoácidos/química , Complejos Multiproteicos/química , Conformación Proteica , Aminoácidos/metabolismo , Modelos Moleculares , Complejos Multiproteicos/metabolismo , Unión Proteica , Proteínas/química , Proteínas/metabolismoRESUMEN
Conformational changes in the side chains are essential for protein-protein binding. Rotameric states and unbound- to-bound conformational changes in the surface residues were systematically studied on a representative set of protein complexes. The side-chain conformations were mapped onto dihedral angles space. The variable threshold algorithm was developed to cluster the dihedral angle distributions and to derive rotamers, defined as the most probable conformation in a cluster. Six rotamer libraries were generated: full surface, surface noninterface, and surface interface-each for bound and unbound states. The libraries were used to calculate the probabilities of the rotamer transitions upon binding. The stability of amino acids was quantified based on the transition maps. The noninterface residues' stability was higher than that of the interface. Long side chains with three or four dihedral angles were less stable than the shorter ones. The transitions between the rotamers at the interface occurred more frequently than on the noninterface surface. Most side chains changed conformation within the same rotamer or moved to an adjacent rotamer. The highest percentage of the transitions was observed primarily between the two most occupied rotamers. The probability of the transition between rotamers increased with the decrease of the rotamer stability. The analysis revealed characteristics of the surface side-chain conformational transitions that can be utilized in flexible docking protocols.
Asunto(s)
Aminoácidos/química , Conformación Proteica , Proteínas/química , Algoritmos , Cristalografía por Rayos X , Modelos Moleculares , Unión Proteica , Mapas de Interacción de Proteínas , Propiedades de SuperficieRESUMEN
Ferritin-like molecules are unique to cellular iron homeostasis because they can store iron at concentrations much higher than those dictated by the solubility of Fe(3+). Very little is known about the protein interactions that deliver iron for storage or promote the mobilization of stored iron from ferritin-like molecules. Here, we report the X-ray crystal structure of Pseudomonas aeruginosa bacterioferritin (Pa-BfrB) in complex with bacterioferritin-associated ferredoxin (Pa-Bfd) at 2.0 Å resolution. As the first example of a ferritin-like molecule in complex with a cognate partner, the structure provides unprecedented insight into the complementary interface that enables the [2Fe-2S] cluster of Pa-Bfd to promote heme-mediated electron transfer through the BfrB protein dielectric (~18 Å), a process that is necessary to reduce the core ferric mineral and facilitate mobilization of Fe(2+). The Pa-BfrB-Bfd complex also revealed the first structure of a Bfd, thus providing a first view to what appears to be a versatile metal binding domain ubiquitous to the large Fer2_BFD family of proteins and enzymes with diverse functions. Residues at the Pa-BfrB-Bfd interface are highly conserved in Bfr and Bfd sequences from a number of pathogenic bacteria, suggesting that the specific recognition between Pa-BfrB and Pa-Bfd is of widespread significance to the understanding of bacterial iron homeostasis.
Asunto(s)
Proteínas Bacterianas/química , Proteínas Bacterianas/metabolismo , Grupo Citocromo b/química , Grupo Citocromo b/metabolismo , Ferredoxinas/química , Ferritinas/química , Ferritinas/metabolismo , Hierro/metabolismo , Modelos Moleculares , Cristalografía por Rayos X , Ferredoxinas/metabolismo , Hierro/química , Pliegue de Proteína , Pseudomonas aeruginosa/metabolismoRESUMEN
MOTIVATION: Computational studies of the energetics of protein association are important for revealing the underlying fundamental principles and for designing better tools to model protein complexes. The interaction cutoff contribution to the ruggedness of protein-protein energy landscape is studied in terms of relative energy fluctuations for 1/r(n) potentials based on a simplistic model of a protein complex. This artificial ruggedness exists for short cutoffs and gradually disappears with the cutoff increase. RESULTS: The critical values of the cutoff were calculated for each of 11 popular power-type potentials with n=0/9, 12 and for two thresholds of 5% and 10%. The artificial ruggedness decreases to tolerable thresholds for cutoffs larger than the critical ones. The results showed that for both thresholds the critical cutoff is a non-monotonic function of the potential power n. The functions reach the maximum at n=3/4 and then decrease with the increase of the potential power. The difference between two cutoffs for 5% and 10% artificial ruggedness becomes negligible for potentials decreasing faster than 1/r(12). The analytical results obtained for the simple model of protein complexes agree with the analysis of artificial ruggedness in a dataset of 62 protein-protein complexes, with different parameterizations of soft Lennard-Jones potential and two types of protein representations: all-atom and coarse-grained. The results suggest that cutoffs larger than the critical ones can be recommended for protein-protein potentials.
Asunto(s)
Biología Computacional/métodos , Complejos Multiproteicos/química , Algoritmos , Sitios de Unión , Cinética , Conformación Proteica , Pliegue de Proteína , Mapeo de Interacción de Proteínas , TermodinámicaRESUMEN
Structure fluctuations in proteins affect a broad range of cell phenomena, including stability of proteins and their fragments, allosteric transitions, and energy transfer. This study presents a statistical-thermodynamic analysis of relationship between the sequence composition and the distribution of residue fluctuations in protein-protein complexes. A one-node-per-residue elastic network model accounting for the nonhomogeneous protein mass distribution and the interatomic interactions through the renormalized inter-residue potential is developed. Two factors, a protein mass distribution and a residue environment, were found to determine the scale of residue fluctuations. Surface residues undergo larger fluctuations than core residues in agreement with experimental observations. Ranking residues over the normalized scale of fluctuations yields a distinct classification of amino acids into three groups: (i) highly fluctuating-Gly, Ala, Ser, Pro, and Asp, (ii) moderately fluctuating-Thr, Asn, Gln, Lys, Glu, Arg, Val, and Cys, and (iii) weakly fluctuating-Ile, Leu, Met, Phe, Tyr, Trp, and His. The structural instability in proteins possibly relates to the high content of the highly fluctuating residues and a deficiency of the weakly fluctuating residues in irregular secondary structure elements (loops), chameleon sequences, and disordered proteins. Strong correlation between residue fluctuations and the sequence composition of protein loops supports this hypothesis. Comparing fluctuations of binding site residues (interface residues) with other surface residues shows that, on average, the interface is more rigid than the rest of the protein surface and Gly, Ala, Ser, Cys, Leu, and Trp have a propensity to form more stable docking patches on the interface. The findings have broad implications for understanding mechanisms of protein association and stability of protein structures.
Asunto(s)
Proteínas/química , Secuencia de Aminoácidos , Animales , Modelos Moleculares , Unión Proteica , Conformación Proteica , Proteínas/metabolismo , Porcinos , alfa-Amilasas/química , alfa-Amilasas/metabolismoRESUMEN
Studies of intermolecular energy landscapes are important for understanding protein association and adequate modeling of protein interactions. Landscape representation at different resolutions can be used for the refinement of docking predictions and detection of macro characteristics, like the binding funnel. A representative set of protein-protein complexes was used to systematically map the intermolecular landscape by grid-based docking. The change of the resolution was achieved by varying the range of the potential, according to the variable resolution GRAMM methodology. A formalism was developed to consistently parameterize the potential and describe essential characteristics of the landscape. The results of gradual landscape smoothing, from high to low resolution, indicate that i), the number of energy basins, the landscape ruggedness, and the slope decrease accordingly; ii), the number of near-native matches, defined as those inside the funnel, increases until the trend breaks down at critical resolution; the rate of the increase and the critical resolution are specific to the type of a complex (enzyme inhibitor, antigen-antibody, and other), reflect known underlying recognition factors, and correlate with earlier determined estimates of the funnel size; iii), the native/nonnative energy gap, a major characteristic of the energy minima hierarchy, remains constant; and iv), the putative funnel (defined as the deepest basin) has the largest average depth-related ruggedness and slope, at all resolutions. The results facilitate better understanding of the binding landscapes and suggest directions for implementation in practical docking protocols.
Asunto(s)
Modelos Químicos , Mapeo de Interacción de Proteínas/métodos , Proteínas/metabolismo , Algoritmos , Simulación por Computador , Cinética , Unión ProteicaRESUMEN
The concept of the energy landscape is important for better understanding of protein-protein interactions and for designing adequate docking procedures. The intermolecular landscape has a rugged terrain that impedes search procedures. Its inherent ruggedness is related to the conformational characteristics of the molecules and to the form of the potential function--more rugged for short-range potentials and less rugged for "soft," typically long-range potentials. Our study determined that the landscape ruggedness is further substantially exacerbated by truncation of the potentials. This additional ruggedness appears below certain critical interaction ranges that depend on the form of the potential. The theoretical model describing the cutoff effect on the landscape ruggedness is confirmed by the energy calculation on a dataset of protein-protein complexes. The negative effect of the potentials cutoff is well known. However, revealing its physical basis in terms of the energy landscape is important for better understanding of intermolecular interactions.
Asunto(s)
Proteínas/metabolismo , Termodinámica , Modelos Químicos , Modelos Teóricos , Unión Proteica , Proteínas/químicaRESUMEN
Mechanisms of protein-carbohydrate recognition attract a lot of interest due to their roles in various cellular processes and metabolism disorders. We have performed a large-scale analysis of protein structures solved in complex with glucose, galactose and their substituted analogues. We found that, on average, sugar molecules establish five hydrogen bonds (HBs) in the binding site, including one to three HBs with bridging water molecules. The free energy contribution of bridging and direct HBs was estimated using the free energy perturbation (FEP+) methodology for mono- and disaccharides that bind to l-ABP, ttGBP, TrmB, hGalectin-1 and hGalectin-3. We show that removing hydroxy groups that are engaged in direct HBs with the charged groups of Asp, Arg and Glu residues, protein backbone amide or buried water dramatically decreases binding affinity. In contrast, all solvent-exposed hydroxy groups and hydroxy groups engaged in HBs with the solvent-exposed bridging water molecules contribute weakly to binding affinity and so can be replaced to optimize ligand potency. Finally, we rationalize an effect of binding site water replacement on the binding affinity to l-ABP.
Asunto(s)
Carbohidratos/química , Modelos Moleculares , Proteínas/química , Sitios de Unión , Bases de Datos de Proteínas , Disacáridos/química , Glicosilación , Enlace de Hidrógeno , Ligandos , Monosacáridos/química , Unión Proteica , Conformación Proteica , Solventes/química , Termodinámica , Agua/químicaRESUMEN
Negamycin is a hydrophilic antimicrobial translation inhibitor that crosses the lipophilic inner membrane of Escherichia coli via at least two transport routes to reach its intracellular target. In a minimal salts medium, negamycin's peptidic nature allows illicit entry via a high-affinity route by hijacking the Dpp dipeptide transporter. Transport via a second, low-affinity route is energetically driven by the membrane potential, seemingly without the direct involvement of a transport protein. In mouse thigh models of E. coli infection, no evidence for Dpp-mediated transport of negamycin was found. The implication is that for the design of new negamycin-based analogs, the physicochemical properties required for cell entry via the low-affinity route need to be retained to achieve clinical success in the treatment of infectious diseases. Furthermore, clinical resistance to such analogs due to mutations affecting their ribosomal target or transport is expected to be rare and similar to that of aminoglycosides.
RESUMEN
Negamycin is a natural product with antibacterial activity against a broad range of Gram-negative pathogens. Recent revelation of its ribosomal binding site and mode of inhibition has reinvigorated efforts to identify improved analogues with clinical potential. Translation-inhibitory potency and antimicrobial activity upon modification of different moieties of negamycin were in line with its observed ribosomal binding conformation, reaffirming stringent structural requirements for activity. However, substitutions on the N6 amine were tolerated and led to N6-(3-aminopropyl)-negamycin (31f), an analogue showing 4-fold improvement in antibacterial activity against key bacterial pathogens. This represents the most potent negamycin derivative to date and may be a stepping stone toward clinical development of this novel antibacterial class.
RESUMEN
Physicochemical description of numerous cell processes is fundamentally based on the energy landscapes of protein molecules involved. Although the whole energy landscape is difficult to reconstruct, increased attention to particular targets has provided enough structures for mapping functionally important subspaces associated with the unbound and bound protein structures. The subspace mapping produces a discrete representation of the landscape, further called energy spectrum. We compiled and characterized ensembles of bound and unbound conformations of six small proteins and explored their spectra in implicit solvent. First, the analysis of the unbound-to-bound changes points to conformational selection as the binding mechanism for four proteins. Second, results show that bound and unbound spectra often significantly overlap. Moreover, the larger the overlap the smaller the root mean square deviation (RMSD) between the bound and unbound conformational ensembles. Third, the center of the unbound spectrum has a higher energy than the center of the corresponding bound spectrum of the dimeric and multimeric states for most of the proteins. This suggests that the unbound states often have larger entropy than the bound states. Fourth, the exhaustively long minimization, making small intrarotamer adjustments (all-atom RMSD ≤ 0.7 Å), dramatically reduces the distance between the centers of the bound and unbound spectra as well as the spectra extent. It condenses unbound and bound energy levels into a thin layer at the bottom of the energy landscape with the energy spacing that varies between 0.8-4.6 and 3.5-10.5 kcal/mol for the unbound and bound states correspondingly. Finally, the analysis of protein energy fluctuations showed that protein vibrations itself can excite the interstate transitions, including the unbound-to-bound ones.
Asunto(s)
Proteínas/química , Proteínas/metabolismo , Animales , Humanos , Unión Proteica , Conformación Proteica , TermodinámicaRESUMEN
Structure fluctuations and conformational changes accompany all biological processes involving macromolecules. The paper presents a classification of protein residues based on the normalized equilibrium fluctuations of the residue centers of mass in proteins and a statistical analysis of conformation changes in the side-chains upon binding. Normal mode analysis and an elastic network model were applied to a set of protein complexes to calculate the residue fluctuations and develop the residue classification. Comparison with a classification based on normalized B-factors suggests that the B-factors may underestimate protein flexibility in solvent. Our classification shows that protein loops and disordered fragments are enriched with highly fluctuating residues and depleted with weakly fluctuating residues. Strategies for engineering thermostable proteins are discussed. To calculate the dihedral angles distribution functions, the configuration space was divided into cells by a cubic grid. The effect of protein association on the distribution functions depends on the amino acid type and a grid step in the dihedral angles space. The changes in the dihedral angles increase from the near-backbone dihedral angle to the most distant one, for most residues. On average, one fifth of the interface residues change the rotamer state upon binding, whereas the rest of the interface residues undergo local readjustments within the same rotamer.
Asunto(s)
Conformación Proteica , Proteínas/química , Sitios de Unión , Bases de Datos de Proteínas , Modelos Moleculares , Proteínas/metabolismoRESUMEN
Conformational changes upon protein-protein association are the key element of the binding mechanism. The study presents a systematic large-scale analysis of such conformational changes in the side chains. The results indicate that short and long side chains have different propensities for the conformational changes. Long side chains with three or more dihedral angles are often subject to large conformational transition. Shorter residues with one or two dihedral angles typically undergo local conformational changes not leading to a conformational transition. A relationship between the local readjustments and the equilibrium fluctuations of a side chain around its unbound conformation is suggested. Most of the side chains undergo larger changes in the dihedral angle most distant from the backbone. The frequencies of the core-to-surface interface transitions of six nonpolar residues and Tyr are larger than the frequencies of the opposite surface-to-core transitions. The binding increases both polar and nonpolar interface areas. However, the increase of the nonpolar area is larger for all considered classes of protein complexes, suggesting that the protein association perturbs the unbound interfaces to increase the hydrophobic contribution to the binding free energy. To test modeling approaches to side-chain flexibility in protein docking, conformational changes in the X-ray set were compared with those in the docking decoy sets. The results lead to a better understanding of the conformational changes in proteins and suggest directions for efficient conformational sampling in docking protocols.
Asunto(s)
Conformación Proteica , Proteínas/química , Proteínas/metabolismo , Humanos , Modelos Moleculares , Unión Proteica , Dominios y Motivos de Interacción de ProteínasRESUMEN
In the context of virtual database screening, calculations of protein-ligand binding entropy of relative and overall molecular motions are challenging, owing to the inherent structural complexity of the ligand binding well in the energy landscape of protein-ligand interactions and computing time limitations. We describe a fast statistical thermodynamic method for estimation the binding entropy to address the challenges. The method is based on the integration of the configurational integral over clusters obtained from multiple docked positions. We apply the method in conjunction with 11 popular scoring functions (AutoDock, ChemScore, DrugScore, D-Score, F-Score, G-Score, LigScore, LUDI, PLP, PMF, X-Score) to evaluate the binding entropy of 100 protein-ligand complexes. The averaged values of binding entropy contribution vary from 6.2 to 9.1 kcal/mol, showing good agreement with literature. We calculate positional sizes and the angular volume of the native ligand wells. The averaged geometric mean of positional sizes in principal directions varies from 0.8 to 1.4 A. The calculated range of angular volumes is 3.3-11.8 rad(2). Then we demonstrate that the averaged six-dimensional volume of the native well is larger than the volume of the most populated non-native well in energy landscapes described by all of 11 scoring functions.
Asunto(s)
Simulación por Computador , Entropía , Ligandos , Proteínas/química , Algoritmos , Biología Computacional , Conformación Molecular , Movimiento (Física) , Unión ProteicaRESUMEN
We present results of testing the ability of eleven popular scoring functions to predict native docked positions using a recently developed method (Ruvinsky and Kozintsev, J Comput Chem 2005, 26, 1089) for estimation the entropy contributions of relative motions to protein-ligand binding affinity. The method is based on the integration of the configurational integral over clusters obtained from multiple docked positions. We use a test set of 100 PDB protein-ligand complexes and ensembles of 101 docked positions generated by (Wang et al. J Med Chem 2003, 46, 2287) for each ligand in the test set. To test the suggested method we compared the averaged root-mean square deviations (RMSD) of the top-scored ligand docked positions, accounting and not accounting for entropy contributions, relative to the experimentally determined positions. We demonstrate that the method increases docking accuracy by 10-21% when used in conjunction with the AutoDock scoring function, by 2-25% with G-Score, by 7-41% with D-Score, by 0-8% with LigScore, by 1-6% with PLP, by 0-12% with LUDI, by 2-8% with F-Score, by 7-29% with ChemScore, by 0-9% with X-Score, by 2-19% with PMF, and by 1-7% with DrugScore. We also compared the performance of the suggested method with the method based on ranking by cluster occupancy only. We analyze how the choice of a clustering-RMSD and a low bound of dense clusters impacts on docking accuracy of the scoring methods. We derive optimal intervals of the clustering-RMSD for 11 scoring functions.