ABSTRACT
Functionally analogous enzymes are those that catalyze similar reactions on similar substrates but do not share common ancestry, providing a window on the different structural strategies nature has used to evolve required catalysts. Identification and use of this information to improve reaction classification and computational annotation of enzymes newly discovered in the genome projects would benefit from systematic determination of reaction similarities. Here, we quantified similarity in bond changes for overall reactions and catalytic mechanisms for 95 pairs of functionally analogous enzymes (non-homologous enzymes with identical first three numbers of their EC codes) from the MACiE database. Similarity of overall reactions was computed by comparing the sets of bond changes in the transformations from substrates to products. For similarity of mechanisms, sets of bond changes occurring in each mechanistic step were compared; these similarities were then used to guide global and local alignments of mechanistic steps. Using this metric, only 44% of pairs of functionally analogous enzymes in the dataset had significantly similar overall reactions. For these enzymes, convergence to the same mechanism occurred in 33% of cases, with most pairs having at least one identical mechanistic step. Using our metric, overall reaction similarity serves as an upper bound for mechanistic similarity in functional analogs. For example, the four carbon-oxygen lyases acting on phosphates (EC 4.2.3) show neither significant overall reaction similarity nor significant mechanistic similarity. By contrast, the three carboxylic-ester hydrolases (EC 3.1.1) catalyze overall reactions with identical bond changes and have converged to almost identical mechanisms. The large proportion of enzyme pairs that do not show significant overall reaction similarity (56%) suggests that at least for the functionally analogous enzymes studied here, more stringent criteria could be used to refine definitions of EC sub-subclasses for improved discrimination in their classification of enzyme reactions. The results also indicate that mechanistic convergence of reaction steps is widespread, suggesting that quantitative measurement of mechanistic similarity can inform approaches for functional annotation.
Subject(s)
Databases, Protein , Enzymes/chemistry , Models, Chemical , Sequence Analysis, Protein/methods , Amino Acid Sequence , Binding Sites , Catalysis , Computer Simulation , Enzyme Activation , Molecular Sequence Data , Protein Binding , Structure-Activity RelationshipABSTRACT
We present a probabilistic data fusion framework that combines multiple computational approaches for drawing relationships between drugs and targets. The approach has special relevance to identifying surprising unintended biological targets of drugs. Comparisons between molecules are made based on 2D topological structural considerations, based on 3D surface characteristics, and based on English descriptions of clinical effects. Similarity computations within each modality were transformed into probability scores. Given a new molecule along with a set of molecules sharing some biological effect, a single score based on comparison to the known set is produced, reflecting either 2D similarity, 3D similarity, clinical effects similarity or their combination. The methods were validated within acurated structural pharmacology database (SPDB) and further tested by blind application to data derived from the ChEMBL database. For prediction of off-target effects, 3D-similarity performed best as a single modality, but combining all methods produced performance gains. Striking examples of structurally surprising off-target predictions are presented.
Subject(s)
Databases, Pharmaceutical/statistics & numerical data , Drug Repositioning/statistics & numerical data , Pharmaceutical Preparations/chemistry , Computational Biology , Data Interpretation, Statistical , Drug-Related Side Effects and Adverse Reactions , Humans , Models, Statistical , Molecular Conformation , Molecular Targeted Therapy/statistics & numerical dataABSTRACT
Drug structures may be quantitatively compared based on 2D topological structural considerations and based on 3D characteristics directly related to binding. A framework for combining multiple similarity computations is presented along with its systematic application to 358 drugs with overlapping pharmacology. Given a new molecule along with a set of molecules sharing some biological effect, a single score based on comparison to the known set is produced, reflecting either 2D similarity, 3D similarity, or their combination. For prediction of primary targets, the benefit of 3D over 2D was relatively small, but for prediction of off-targets, the added benefit was large. In addition to assessing prediction, the relationship between chemical similarity and pharmacological novelty was studied. Drug pairs that shared high 3D similarity but low 2D similarity (i.e., a novel scaffold) were shown to be much more likely to exhibit pharmacologically relevant differences in terms of specific protein target modulation.
Subject(s)
Computer Simulation , Databases, Factual , Drug Design , Models, Molecular , Pharmaceutical Preparations/chemistry , Proteins/chemistry , Binding Sites , Ligands , Quantitative Structure-Activity RelationshipABSTRACT
Circulating tumor cells (CTCs) hold promise for studying advanced prostate cancer. A functional collagen adhesion matrix (CAM) assay was used to enrich CTCs from prostate cancer patients' blood. CAM ingestion and epithelial immuno-staining identified CTCs, which were genotyped using oligonucleotide array comparative genomic hybridization. The highest CTC counts were observed in men with metastatic castration resistant prostate cancer (CRPC) compared to castration sensitive prostate cancer. Copy number profiles for CRPC CTCs were similar to paired solid tumor DNA, and distinct from corresponding DNA from the residual CAM-depleted blood. CAM CTC enrichment may allow cellular and genetic analyses in prostate cancer.