RESUMO
Interactions in enzymes between catalytic and neighboring amino acids and how these interactions facilitate catalysis are examined. In examples from both natural and designed enzymes, it is shown that increases in catalytic rates may be achieved through elongation of the buffer range of the catalytic residues; such perturbations in the protonation equilibria are, in turn, achieved through enhanced coupling of the protonation equilibria of the active ionizable residues with those of other ionizable residues. The strongest coupling between protonation states for a pair of residues that deprotonate to form an anion (or a pair that accept a proton to form a cation) is achieved when the difference in the intrinsic pKas of the two residues is approximately within 1 pH unit. Thus, catalytic aspartates and glutamates are often coupled to nearby acidic residues. For an anion-forming residue coupled to a cation-forming residue, the elongated buffer range is achieved when the intrinsic pKa of the anion-forming residue is higher than the intrinsic pKa of the (conjugate acid of the) cation-forming residue. Therefore, the high pKa, anion-forming residues tyrosine and cysteine make good coupling partners for catalytic lysine residues. For the anion-cation pairs, the optimum difference in intrinsic pKas is a function of the energy of interaction between the residues. For the energy of interaction ε expressed in units of (ln 10)RT, the optimum difference in intrinsic pKas is within â¼1 pH unit of ε.
Assuntos
Aminoácidos/química , Glicosídeo Hidrolases/química , Aminoácidos/metabolismo , Biocatálise , Glicosídeo Hidrolases/metabolismo , Concentração de Íons de Hidrogênio , Eletricidade EstáticaRESUMO
The crystal structures of an unliganded and adenosine 5'-monophosphate (AMP) bound, metal-dependent phosphoesterase (YP_910028.1) from Bifidobacterium adolescentis are reported at 2.4 and 1.94 Å, respectively. Functional characterization of this enzyme was guided by computational analysis and then confirmed by experiment. The structure consists of a polymerase and histidinol phosphatase (PHP, Pfam: PF02811) domain with a second domain (residues 105-178) inserted in the middle of the PHP sequence. The insert domain functions in binding AMP, but the precise function and substrate specificity of this domain are unknown. Initial bioinformatics analyses yielded multiple potential functional leads, with most of them suggesting DNA polymerase or DNA replication activity. Phylogenetic analysis indicated a potential DNA polymerase function that was somewhat supported by global structural comparisons identifying the closest structural match to the alpha subunit of DNA polymerase III. However, several other functional predictions, including phosphoesterase, could not be excluded. Theoretical microscopic anomalous titration curve shapes, a computational method for the prediction of active sites from protein 3D structures, identified potential reactive residues in YP_910028.1. Further analysis of the predicted active site and local comparison with its closest structure matches strongly suggested phosphoesterase activity, which was confirmed experimentally. Primer extension assays on both normal and mismatched DNA show neither extension nor degradation and provide evidence that YP_910028.1 has neither DNA polymerase activity nor DNA-proofreading activity. These results suggest that many of the sequence neighbors previously annotated as having DNA polymerase activity may actually be misannotated.
Assuntos
Proteínas de Bactérias/química , Proteínas de Bactérias/metabolismo , Bifidobacterium/enzimologia , Esterases/química , Esterases/metabolismo , 4-Nitrofenilfosfatase/química , 4-Nitrofenilfosfatase/metabolismo , Difosfato de Adenosina/química , Difosfato de Adenosina/metabolismo , Sequência de Aminoácidos , Sítios de Ligação , Domínio Catalítico , Simulação por Computador , Cristalografia , DNA Polimerase III/química , DNA Polimerase III/metabolismo , Histidinol-Fosfatase/química , Histidinol-Fosfatase/metabolismo , Modelos Moleculares , Dados de Sequência Molecular , Filogenia , Reprodutibilidade dos Testes , Relação Estrutura-AtividadeRESUMO
Theoretical microscopic titration curves (THEMATICS) is a computational method for the identification of active sites in proteins through deviations in computed titration behavior of ionizable residues. While the sensitivity to catalytic sites is high, the previously reported sensitivity to catalytic residues was not as high, about 50%. Here THEMATICS is combined with support vector machines (SVM) to improve sensitivity for catalytic residue prediction from protein 3D structure alone. For a test set of 64 proteins taken from the Catalytic Site Atlas (CSA), the average recall rate for annotated catalytic residues is 61%; good precision is maintained selecting only 4% of all residues. The average false positive rate, using the CSA annotations is only 3.2%, far lower than other 3D-structure-based methods. THEMATICS-SVM returns higher precision, lower false positive rate, and better overall performance, compared with other 3D-structure-based methods. Comparison is also made with the latest machine learning methods that are based on both sequence alignments and 3D structures. For annotated sets of well-characterized enzymes, THEMATICS-SVM performance compares very favorably with methods that utilize sequence homology. However, since THEMATICS depends only on the 3D structure of the query protein, no decline in performance is expected when applied to novel folds, proteins with few sequence homologues, or even orphan sequences. An extension of the method to predict non-ionizable catalytic residues is also presented. THEMATICS-SVM predicts a local network of ionizable residues with strong interactions between protonation events; this appears to be a special feature of enzyme active sites.
Assuntos
Sítios de Ligação , Domínio Catalítico , Biologia Computacional/métodos , Enzimas/química , Proteínas/química , Software , Catálise , Modelos Moleculares , Conformação Proteica , Sensibilidade e EspecificidadeRESUMO
BACKGROUND: Methods are now available for the prediction of interaction sites in protein 3D structures. While many of these methods report high success rates for site prediction, often these predictions are not very selective and have low precision. Precision in site prediction is addressed using Theoretical Microscopic Titration Curves (THEMATICS), a simple computational method for the identification of active sites in enzymes. Recall and precision are measured and compared with other methods for the prediction of catalytic sites. RESULTS: Using a test set of 169 enzymes from the original Catalytic Residue Dataset (CatRes) it is shown that THEMATICS can deliver precise, localised site predictions. Furthermore, adjustment of the cut-off criteria can improve the recall rates for catalytic residues with only a small sacrifice in precision. Recall rates for CatRes/CSA annotated catalytic residues are 41.1%, 50.4%, and 54.2% for Z score cut-off values of 1.00, 0.99, and 0.98, respectively. The corresponding precision rates are 19.4%, 17.9%, and 16.4%. The success rate for catalytic sites is higher, with correct or partially correct predictions for 77.5%, 85.8%, and 88.2% of the enzymes in the test set, corresponding to the same respective Z score cut-offs, if only the CatRes annotations are used as the reference set. Incorporation of additional literature annotations into the reference set gives total success rates of 89.9%, 92.9%, and 94.1%, again for corresponding cut-off values of 1.00, 0.99, and 0.98. False positive rates for a 75-protein test set are 1.95%, 2.60%, and 3.12% for Z score cut-offs of 1.00, 0.99, and 0.98, respectively. CONCLUSION: With a preferred cut-off value of 0.99, THEMATICS achieves a high success rate of interaction site prediction, about 86% correct or partially correct using CatRes/CSA annotations only and about 93% with an expanded reference set. Success rates for catalytic residue prediction are similar to those of other structure-based methods, but with substantially better precision and lower false positive rates. THEMATICS performs well across the spectrum of E.C. classes. The method requires only the structure of the query protein as input. THEMATICS predictions may be obtained via the web from structures in PDB format at: http://pfweb.chem.neu.edu/thematics/submit.html.
Assuntos
Modelos Químicos , Estrutura Secundária de Proteína , Proteínas/química , Software , Sítios de Ligação/genética , Bases de Dados Factuais/estatística & dados numéricos , Valor Preditivo dos Testes , Estrutura Secundária de Proteína/genética , Proteínas/genética , Software/estatística & dados numéricosRESUMO
Theoretical Microscopic Titration Curves (THEMATICS) may be used to identify chemically important residues in active sites of enzymes by characteristic deviations from the normal, sigmoidal Henderson-Hasselbalch titration behavior. Clusters of such deviant residues in physical proximity constitute reliable predictors of the location of the active site. Originally the residues with deviant predicted behavior were identified by human observation of the computed titration curves. However, it is preferable to select the unusual residues by mathematically well-defined criteria, in order to reduce the chance of error, eliminate any possible biases, and substantially speed up the selection process. Here we present some simple statistical tests that constitute such selection criteria. The first derivatives of the predicted titration curves resemble distribution functions and are normalized. The moments of these first derivative functions are computed. It is shown that the third and fourth moments, measures of asymmetry and kurtosis, respectively, are good measures of the deviations from normal behavior. Results are presented for 44 different enzymes. Detailed results are given for 4 enzymes with 4 different types of chemistry: arginine kinase from Limulus polyphemus (horseshoe crab); beta-lactamase from Escherichia coli; glutamate racemase from Aquifex pyrophilus; and 3-isopropylmalate dehydrogenase from Thiobacillus ferrooxidans. The relationship between the statistical measures of nonsigmoidal behavior in the predicted titration curves and the catalytic activity of the residue is discussed.
Assuntos
Enzimas/química , Enzimas/metabolismo , Isomerases de Aminoácido/química , Isomerases de Aminoácido/metabolismo , Animais , Arginina Quinase/química , Arginina Quinase/metabolismo , Bactérias/enzimologia , Proteínas de Bactérias/química , Proteínas de Bactérias/metabolismo , Sítios de Ligação , Catálise , Escherichia coli/enzimologia , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/metabolismo , Caranguejos Ferradura , Cinética , Microscopia/métodos , Modelos Estatísticos , beta-Lactamases/química , beta-Lactamases/metabolismoRESUMO
MOTIVATION: Identification of functional information for a protein from its three-dimensional (3D) structure is a major challenge in genomics. The power of theoretical microscopic titration curves (THEMATICS), when coupled with a statistical analysis, provides a method for high-throughput screening for identification of catalytic sites and binding sites with high accuracy and precision. The method requires only the 3D structure of the query protein as input, but it performs as well as other methods that depend on sequence alignments and structural similarities.