Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 7 de 7
Filter
1.
PLoS Comput Biol ; 18(2): e1009901, 2022 02.
Article in English | MEDLINE | ID: mdl-35202398

ABSTRACT

Studying similarities in protein molecules has become a fundamental activity in much of biology and biomedical research, for which methods such as multiple sequence alignments are widely used. Most methods available for such comparisons cater to studying proteins which have clearly recognizable evolutionary relationships but not to proteins that recognize the same or similar ligands but do not share similarities in their sequence or structural folds. In many cases, proteins in the latter class share structural similarities only in their binding sites. While several algorithms are available for comparing binding sites, there are none for deriving structural motifs of the binding sites, independent of the whole proteins. We report the development of SiteMotif, a new algorithm that compares binding sites from multiple proteins and derives sequence-order independent structural site motifs. We have tested the algorithm at multiple levels of complexity and demonstrate its performance in different scenarios. We have benchmarked against 3 current methods available for binding site comparison and demonstrate superior performance of our algorithm. We show that SiteMotif identifies new structural motifs of spatially conserved residues in proteins, even when there is no sequence or fold-level similarity. We expect SiteMotif to be useful for deriving key mechanistic insights into the mode of ligand interaction, predict the ligand type that a protein can bind and improve the sensitivity of functional annotation.


Subject(s)
Algorithms , Proteins , Binding Sites , Ligands , Protein Binding , Proteins/chemistry
2.
J Chem Inf Model ; 62(19): 4810-4819, 2022 10 10.
Article in English | MEDLINE | ID: mdl-36122166

ABSTRACT

Protein function is a direct consequence of its sequence, structure, and the arrangement at the binding site. Bioinformatics using sequence analysis is typically used to gain a first insight into protein function. Protein structures, on the other hand, provide a higher resolution platform into understanding functions. As the protein structural information is increasingly becoming available through experimental structure determination and through advances in computational methods for structure prediction, the opportunity to utilize these data is also increasing. Structural analysis of small molecule ligand binding sites in particular provides a direct and more accurate window to infer protein function. However, it remains a poorly utilized resource due to the huge computational cost of existing methods that make large-scale structural comparisons of binding sites prohibitive. Here, we present an algorithm called FLAPP that produces very rapid atomic level alignments. By combining clique matching in graphs and the power of modern CPU architectures, FLAPP aligns a typical pair of binding sites at ∼12.5 ms using a single CPU core, ∼1 ms using 12 cores on a standard desktop machine, and performs a PDB-wide scan in 1-2 min. We perform rigorous validation of the algorithm at multiple levels of complexity and show that FLAPP provides accurate alignments. We also present a case study involving vitamin B12 binding sites to showcase the usefulness of FLAPP for performing an exhaustive alignment-based PDB-wide scan. We expect that this tool will be invaluable to the scientific community to quickly align millions of site pairs on a normal desktop machine to gain insights into protein function and drug discovery for drug target and off-target identification and polypharmacology.


Subject(s)
Computational Biology , Proteins , Algorithms , Binding Sites , Computational Biology/methods , Ligands , Proteins/chemistry , Vitamin B 12
3.
J Biol Chem ; 293(49): 19148-19156, 2018 12 07.
Article in English | MEDLINE | ID: mdl-30309984

ABSTRACT

About 1 billion years ago, in a single-celled holozoan ancestor of all animals, a gene fusion of two tRNA synthetases formed the bifunctional enzyme, glutamyl-prolyl-tRNA synthetase (EPRS). We propose here that a confluence of metabolic, biochemical, and environmental factors contributed to the specific fusion of glutamyl- (ERS) and prolyl- (PRS) tRNA synthetases. To test this idea, we developed a mathematical model that centers on the precursor-product relationship of glutamic acid and proline, as well as metabolic constraints on free glutamic acid availability near the time of the fusion event. Our findings indicate that proline content increased in the proteome during the emergence of animals, thereby increasing demand for free proline. Together, these constraints contributed to a marked cellular depletion of glutamic acid and its products, with potentially catastrophic consequences. In response, an ancient organism invented an elegant solution in which genes encoding ERS and PRS fused to form EPRS, forcing coexpression of the two enzymes and preventing lethal dysregulation. The substantial evolutionary advantage of this coregulatory mechanism is evidenced by the persistence of EPRS in nearly all extant animals.


Subject(s)
Amino Acyl-tRNA Synthetases/chemistry , Bacterial Proteins/chemistry , Evolution, Molecular , Models, Chemical , Amino Acyl-tRNA Synthetases/genetics , Amino Acyl-tRNA Synthetases/metabolism , Animals , Bacteria/genetics , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Citric Acid Cycle , Gene Fusion , Glutamate-tRNA Ligase/chemistry , Glutamate-tRNA Ligase/genetics , Glutamate-tRNA Ligase/metabolism , Glutamic Acid/chemistry , Glutamic Acid/metabolism , Ketoglutaric Acids/chemistry , Ketoglutaric Acids/metabolism , Proline/chemistry , Proline/metabolism , Protein Biosynthesis/genetics
4.
Structure ; 32(3): 362-375.e4, 2024 Mar 07.
Article in English | MEDLINE | ID: mdl-38194962

ABSTRACT

While predicting a ligand that binds to a protein is feasible with current methods, the opposite, i.e., the prediction of a receptor for a ligand remains challenging. We present an approach for predicting receptors of a given ligand that uses de novo design and structural bioinformatics. We have developed the algorithm CRD, comprising multiple modules combining fragment-based sub-site finding, a machine learning function to estimate the size of the site, a genetic algorithm that encodes knowledge on protein structures and a physics-based fitness scoring scheme. CRD includes a pseudo-receptor design component followed by a mapping component to identify proteins that might contain these sites. CRD recovers the sites and receptors of several natural ligands. It designs similar sites for similar ligands, yet to some extent can distinguish between closely related ligands. CRD correctly predicts receptor classes for several drugs and might become a valuable tool for drug discovery.


Subject(s)
Algorithms , Proteins , Binding Sites , Protein Binding , Ligands , Proteins/chemistry , Drug Design
5.
Curr Res Struct Biol ; 6: 100108, 2023.
Article in English | MEDLINE | ID: mdl-38106461

ABSTRACT

S-adenosylmethionine (SAM) is a ubiquitous co-factor that serves as a donor for methylation reactions and additionally serves as a donor of other functional groups such as amino and ribosyl moieties in a variety of other biochemical reactions. Such versatility in function is enabled by the ability of SAM to be recognized by a wide variety of protein molecules that vary in their sequences and structural folds. To understand what gives rise to specific SAM binding in diverse proteins, we set out to study if there are any structural patterns at their binding sites. A comprehensive analysis of structures of the binding sites of SAM by all-pair comparison and clustering, indicated the presence of 4 different site-types, only one among them being well studied. For each site-type we decipher the common minimum principle involved in SAM recognition by diverse proteins and derive structural motifs that are characteristic of SAM binding. The presence of the structural motifs with precise three-dimensional arrangement of amino acids in SAM sites that appear to have evolved independently, indicates that these are winning arrangements of residues to bring about SAM recognition. Further, we find high similarity between one of the SAM site types and a well known ATP binding site type. We demonstrate using in vitro experiments that a known SAM binding protein, HpyAII.M1, a type 2 methyltransferase can bind and hydrolyse ATP. We find common structural motifs that explain this, further supported through site-directed mutagenesis. Observation of similar motifs for binding two of the most ubiquitous ligands in multiple protein families with diverse sequences and structural folds presents compelling evidence at the molecular level in favour of convergent evolution.

6.
Front Chem ; 8: 593497, 2020.
Article in English | MEDLINE | ID: mdl-33381491

ABSTRACT

Tuberculosis is one of the deadliest infectious diseases worldwide and the prevalence of latent tuberculosis acts as a huge roadblock in the global effort to eradicate tuberculosis. Most of the currently available anti-tubercular drugs act against the actively replicating form of Mycobacterium tuberculosis (Mtb), and are not effective against the non-replicating dormant form present in latent tuberculosis. With about 30% of the global population harboring latent tuberculosis and the requirement for prolonged treatment duration with the available drugs in such cases, the rate of adherence and successful completion of therapy is low. This necessitates the discovery of new drugs effective against latent tuberculosis. In this work, we have employed a combination of bioinformatics and chemoinformatics approaches to identify potential targets and lead candidates against latent tuberculosis. Our pipeline adopts transcriptome-integrated metabolic flux analysis combined with an analysis of a transcriptome-integrated protein-protein interaction network to identify perturbations in dormant Mtb which leads to a shortlist of 6 potential drug targets. We perform a further selection of the candidate targets and identify potential leads for 3 targets using a range of bioinformatics methods including structural modeling, binding site association and ligand fingerprint similarities. Put together, we identify potential new strategies for targeting latent tuberculosis, new candidate drug targets as well as important lead clues for drug design.

7.
Structure ; 26(3): 499-512.e2, 2018 03 06.
Article in English | MEDLINE | ID: mdl-29514079

ABSTRACT

Protein-ligand interactions form the basis of most cellular events. Identifying ligand binding pockets in proteins will greatly facilitate rationalizing and predicting protein function. Ligand binding sites are unknown for many proteins of known three-dimensional (3D) structure, creating a gap in our understanding of protein structure-function relationships. To bridge this gap, we detect pockets in proteins of known 3D structures, using computational techniques. This augmented pocketome (PocketDB) consists of 249,096 pockets, which is about seven times larger than what is currently known. We deduce possible ligand associations for about 46% of the newly identified pockets. The augmented pocketome, when subjected to clustering based on similarities among pockets, yielded 2,161 site types, which are associated with 1,037 ligand types, together providing fold-site-type-ligand-type associations. The PocketDB resource facilitates a structure-based function annotation, delineation of the structural basis of ligand recognition, and provides functional clues for domains of unknown functions, allosteric proteins, and druggable pockets.


Subject(s)
Computational Biology/methods , Proteins/chemistry , Algorithms , Binding Sites , Databases, Protein , Models, Molecular , Protein Binding , Protein Conformation
SELECTION OF CITATIONS
SEARCH DETAIL