Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 21
Filter
Add more filters











Publication year range
1.
Sci Rep ; 12(1): 9093, 2022 05 31.
Article in English | MEDLINE | ID: mdl-35641576

ABSTRACT

Lysin of bacteriophages isolated from a particular ecosystem could be inducted as a bio-controlling tool against the inhabiting pathogenic bacterial strains. Our study aims at both experimental and computational characterization of the identical lysin gene product inherent in the genomes of two novel Myoviridae bacteriophages, Escherichia Phage C600M2 (GenBank accession number OK040807, Protein ID: UCJ01465) and Escherichia Phage CL1 (GenBank Genome accession number OK040806.1, Protein ID: UCJ01321) isolated from wastewater collected from the main water treatment plant in Qatar. The lysin protein, evinced to be a globular N-acetyl-muramidase with intrinsic "cd00737: endolysin_autolysin" domain, was further expressed and purified to be experimentally validated by turbidimetric assay for its utility as an anti-bacterial agent. Comprehensive computational analysis revealed that the scrutinized lysin protein shared 85-98% sequence identity with 61 bacteriophages, all native to wastewater allied environments. Despite varied Host Recognition Components encoded in their genomes, the similitude of lysins, suggests its apparent significance in host-pathogen interactions endemic to wastewater environment. The present study substantiates the identical lysin from Escherichia Phage C600M2 and Escherichia Phage CL1 as propitious "enzybiotic", a hybrid term to describe enzymes analogous to anti-biotics to combat antibiotic-resistant bacteria by in silico analysis and subsequent experimental validation.


Subject(s)
Bacteriophages , Water Purification , Ecosystem , N-Acetylmuramoyl-L-alanine Amidase/metabolism , Qatar , Wastewater
2.
Microbiol Resour Announc ; 11(1): e0109021, 2022 Jan 20.
Article in English | MEDLINE | ID: mdl-34989622

ABSTRACT

We report the genome sequences of Escherichia phage C600M2 (length, 88,162 bp; G+C content, 38.98%) and Escherichia phage CL1 (length, 87,820 bp; G+C content, 41.32%), which were isolated from a wastewater treatment plant in Qatar. Both Escherichia phage C600M2 and Escherichia phage CL1 genomes contain 128 protein-coding genes and 26 tRNAs.

3.
Sci Rep ; 11(1): 13452, 2021 06 29.
Article in English | MEDLINE | ID: mdl-34188094

ABSTRACT

The advent of personalised medicine promises a deeper understanding of mechanisms and therefore therapies. However, the connection between genomic sequences and clinical treatments is often unclear. We studied 50 breast cancer patients belonging to a population-cohort in the state of Qatar. From Sanger sequencing, we identified several new deleterious mutations in the estrogen receptor 1 gene (ESR1). The effect of these mutations on drug treatment in the protein target encoded by ESR1, namely the estrogen receptor, was achieved via rapid and accurate protein-ligand binding affinity interaction studies which were performed for the selected drugs and the natural ligand estrogen. Four nonsynonymous mutations in the ligand-binding domain were subjected to molecular dynamics simulation using absolute and relative binding free energy methods, leading to the ranking of the efficacy of six selected drugs for patients with the mutations. Our study shows that a personalised clinical decision system can be created by integrating an individual patient's genomic data at the molecular level within a computational pipeline which ranks the efficacy of binding of particular drugs to variant proteins.


Subject(s)
Antineoplastic Agents, Hormonal/pharmacology , Breast Neoplasms , Estrogen Receptor alpha/genetics , Mutation , Neoplasm Proteins/genetics , Precision Medicine , Breast Neoplasms/drug therapy , Breast Neoplasms/genetics , Female , Humans
4.
Environ Sci Technol ; 43(12): 4574-81, 2009 Jun 15.
Article in English | MEDLINE | ID: mdl-19603679

ABSTRACT

Examining global effects of toxins on gene expression profiles is proving to be a powerful method for toxicity assessment and for investigating mechanisms of toxicity. This study demonstrated the application of prokaryotic real-time gene expression profiling in Escherichia coli for toxicity assessment of environmental pollutants in water samples, by use of a cell-array library of 93 E. coli K12 strains with transcriptional green fluorescent protein (GFP) fusions covering most known stress response genes. The high-temporal-resolution gene expression data, for the first time, revealed complex and time-dependent transcriptional activities of various stress-associated genes in response to mercury and mitomycin (MMC) exposure and allowed for gene clustering analysis based on temporal response patterns. Compound-specific and distinctive gene expression profiles were obtained for MMC and mercury at different concentrations. MMC (genotoxin) induced not only the SOS response, which regulates DNA damage and repair, but also many other stress genes associated with drug resistance/sensitivity and chemical detoxification. A number of genes belonging to the P-type ATPase family and the MerR family were identified to be related to mercury resistance, among which zntA was found to be up-regulated at an increasing level as the mercury concentration increased. A mechanism-based evaluation of toxins based on real-time gene expression profiles promises, to be an efficient and informative method for toxicity assessment in environmental samples.


Subject(s)
Escherichia coli K12/drug effects , Escherichia coli Proteins/metabolism , Gene Expression Profiling , Gene Expression Regulation, Bacterial/physiology , Mercury/toxicity , Mitomycin/toxicity , Antibiotics, Antineoplastic/toxicity , Dose-Response Relationship, Drug , Environmental Monitoring , Environmental Pollutants/toxicity , Escherichia coli K12/genetics , Escherichia coli K12/metabolism , Escherichia coli Proteins/genetics , Sensitivity and Specificity
5.
BMC Genomics ; 10: 183, 2009 Apr 24.
Article in English | MEDLINE | ID: mdl-19393058

ABSTRACT

BACKGROUND: Embryonic genome activation (EGA) is a critical event for the preimplantation embryo, which is manifested by changes in chromatin structure, transcriptional machinery, expression of embryonic genes, and degradation of maternal transcripts. The objectives of this study were to determine transcript abundance of HMGN3a and SMARCAL1 in mature bovine oocytes and early bovine embryos, to perform comparative functional genomics analysis of these genes across mammals. RESULTS: New annotations of both HMGN3a and SMARCAL1 were submitted to the Bovine Genome Annotation Submission Database at BovineGenome.org. Careful analysis of the bovine SMARCAL1 consensus gene set for this protein (GLEAN_20241) showed that the NCBI protein contains sequencing errors, and that the actual bovine protein has a high degree of homology to the human protein. Our results showed that there was a high degree of structural conservation of HMGN3a and SMARCAL1 in the mammalian species studied. HMGN3a transcripts were present at similar levels in bovine matured oocytes and 2-4-cell embryos but at higher levels in 8-16-cell embryos, morulae and blastocysts. On the other hand, transcript levels of SMARCAL1 decreased throughout preimplantation development. CONCLUSION: The high levels of structural conservation of these proteins highlight the importance of chromatin remodeling in the regulation of gene expression, particularly during early mammalian embryonic development. The greater similarities of human and bovine HMGN3a and SMARCAL1 proteins may suggest the cow as a valuable model to study chromatin remodeling at the onset of mammalian development. Understanding the roles of chromatin remodeling proteins during embryonic development emphasizes the importance of epigenetics and could shed light on the underlying mechanisms of early mammalian development.


Subject(s)
DNA Helicases/genetics , Embryonic Development/genetics , Genomics , HMGN Proteins/genetics , Amino Acid Sequence , Animals , Blastocyst/physiology , Cattle/embryology , Cells, Cultured , Chromatin Assembly and Disassembly , Conserved Sequence , Gene Expression Regulation, Developmental , Humans , Models, Genetic , Models, Molecular , Molecular Sequence Data , Oocytes/physiology , Phylogeny , Protein Structure, Secondary , RNA, Messenger/metabolism , Sequence Alignment
6.
FEBS J ; 275(21): 5367-82, 2008 Nov.
Article in English | MEDLINE | ID: mdl-18959762

ABSTRACT

Gamma-tubulin belongs to the tubulin superfamily and plays an essential role in the nucleation of cellular microtubules. In the present study, we report the characterization of gamma-tubulin from the psychrophilic Antarctic ciliate Euplotes focardii. In this organism, gamma-tubulin is encoded by two genes, gamma-T1 and gamma-T2, that produce distinct isotypes. Comparison of the gamma-T1 and gamma-T2 primary sequences to a Euplotesgamma-tubulin consensus, derived from mesophilic (i.e. temperate) congeneric species, revealed the presence of numerous unique amino acid substitutions, particularly in gamma-T2. Structural models of gamma-T1 and gamma-T2, obtained using the 3D structure of human gamma-tubulin as a template, suggest that these substitutions are responsible for conformational and/or polarity differences located: (a) in the regions involved in longitudinal 'plus end' contacts; (b) in the T3 loop that participates in binding GTP; and (c) in the M loop that forms lateral interactions. Relative to gamma-T1, the gamma-T2 gene is amplified by approximately 18-fold in the macronuclear genome and is very strongly transcribed. Using confocal immunofluorescence microscopy, we found that the gamma-tubulins of E. focardii associate throughout the cell cycle with basal bodies of the non-motile dorsal cilia and of all of the cirri of the ventral surface (i.e. adoral membranelles, paraoral membrane, and frontoventral transverse, caudal and marginal cirri). By contrast, only gamma-T2 interacts with the centrosomes of the spindle during micronuclear mitosis. We also established that the gamma-T1 isotype associates only with basal bodies. Our results suggest that gamma-T1 and gamma-T2 perform different functions in the organization of the microtubule cytoskeleton of this protist and are consistent with the hypothesis that gamma-T1 and gamma-T2 have evolved sequence-based structural alterations that facilitate template nucleation of microtubules by the gamma-tubulin ring complex at cold temperatures.


Subject(s)
Cold Temperature , Cytoskeleton/chemistry , Euplotes/chemistry , Tubulin/physiology , Amino Acid Sequence , Animals , Ciliophora , Euplotes/ultrastructure , Microtubules/metabolism , Protein Conformation , Tubulin/chemistry , Tubulin/genetics
7.
PLoS Comput Biol ; 4(4): e1000066, 2008 Apr 25.
Article in English | MEDLINE | ID: mdl-18437203

ABSTRACT

Abasic (AP) sites in DNA arise through both endogenous and exogenous mechanisms. Since AP sites can prevent replication and transcription, the cell contains systems for their identification and repair. AP endonuclease (APEX1) cleaves the phosphodiester backbone 5' to the AP site. The cleavage, a key step in the base excision repair pathway, is followed by nucleotide insertion and removal of the downstream deoxyribose moiety, performed most often by DNA polymerase beta (pol-beta). While yeast two-hybrid studies and electrophoretic mobility shift assays provide evidence for interaction of APEX1 and pol-beta, the specifics remain obscure. We describe a theoretical study designed to predict detailed interacting surfaces between APEX1 and pol-beta based on published co-crystal structures of each enzyme bound to DNA. Several potentially interacting complexes were identified by sliding the protein molecules along DNA: two with pol-beta located downstream of APEX1 (3' to the damaged site) and three with pol-beta located upstream of APEX1 (5' to the damaged site). Molecular dynamics (MD) simulations, ensuring geometrical complementarity of interfaces, enabled us to predict interacting residues and calculate binding energies, which in two cases were sufficient (approximately -10.0 kcal/mol) to form a stable complex and in one case a weakly interacting complex. Analysis of interface behavior during MD simulation and visual inspection of interfaces allowed us to conclude that complexes with pol-beta at the 3'-side of APEX1 are those most likely to occur in vivo. Additional multiple sequence analyses of APEX1 and pol-beta in related organisms identified a set of correlated mutations of specific residues at the predicted interfaces. Based on these results, we propose that pol-beta in the open or closed conformation interacts and makes a stable interface with APEX1 bound to a cleaved abasic site on the 3' side. The method described here can be used for analysis in any DNA-metabolizing pathway where weak interactions are the principal mode of cross-talk among participants and co-crystal structures of the individual components are available.


Subject(s)
DNA Polymerase beta/chemistry , DNA Polymerase beta/ultrastructure , DNA-(Apurinic or Apyrimidinic Site) Lyase/chemistry , DNA-(Apurinic or Apyrimidinic Site) Lyase/ultrastructure , DNA/chemistry , DNA/ultrastructure , Models, Chemical , Models, Molecular , Binding Sites , Computer Simulation , Enzyme Activation , Nucleic Acid Conformation , Protein Binding , Protein Conformation
8.
Mol Cell ; 28(6): 1058-70, 2007 Dec 28.
Article in English | MEDLINE | ID: mdl-18158902

ABSTRACT

DinB is the only translesion Y family DNA polymerase conserved among bacteria, archaea, and eukaryotes. DinB and its orthologs possess a specialized lesion bypass function but also display potentially deleterious -1 frameshift mutagenic phenotypes when overproduced. We show that the DNA damage-inducible proteins UmuD(2) and RecA act in concert to modulate this mutagenic activity. Structural modeling suggests that the relatively open active site of DinB is enclosed by interaction with these proteins, thereby preventing the template bulging responsible for -1 frameshift mutagenesis. Intriguingly, residues that define the UmuD(2)-interacting surface on DinB statistically covary throughout evolution, suggesting a driving force for the maintenance of a regulatory protein-protein interaction at this site. Together, these observations indicate that proteins like RecA and UmuD(2) may be responsible for managing the mutagenic potential of DinB orthologs throughout evolution.


Subject(s)
DNA Polymerase beta/metabolism , DNA-Directed DNA Polymerase/metabolism , Escherichia coli Proteins/metabolism , Rec A Recombinases/metabolism , Amino Acid Sequence , Binding Sites/genetics , Blotting, Far-Western , DNA Polymerase beta/chemistry , DNA Polymerase beta/genetics , DNA-Directed DNA Polymerase/chemistry , DNA-Directed DNA Polymerase/genetics , Escherichia coli/enzymology , Escherichia coli/genetics , Escherichia coli/metabolism , Escherichia coli Proteins/chemistry , Escherichia coli Proteins/genetics , Models, Molecular , Molecular Sequence Data , Mutagenesis , Mutation , Protein Binding , Protein Structure, Tertiary , Rec A Recombinases/chemistry , Rec A Recombinases/genetics , Sequence Homology, Amino Acid
9.
BMC Struct Biol ; 7: 78, 2007 Nov 16.
Article in English | MEDLINE | ID: mdl-18005453

ABSTRACT

BACKGROUND: The majority of relations between proteins can be represented as a conventional sequential alignment. Nevertheless, unusual non-sequential alignments with different connectivity of the aligned fragments in compared proteins have been reported by many researchers. It is interesting to understand those non-sequential alignments; are they unique, sporadic cases or they occur frequently; do they belong to a few specific folds or spread among many different folds, as a common feature of protein structure. We present here a comprehensive large-scale study of non-sequential alignments between available protein structures in Protein Data Bank. RESULTS: The study has been conducted on a non-redundant set of 8,865 protein structures aligned with the aid of the TOPOFIT method. It has been estimated that between 17.4% and 35.2% of all alignments are non-sequential depending on variations in the parameters. Analysis of the data revealed that non-sequential relations between proteins do occur systematically and in large quantities. Various sizes and numbers of non-sequential fragments have been observed with all possible complexities of fragment rearrangements found for alignments consisting of up to 12 fragments. It has been found that non-sequential alignments are not limited to proteins of any particular fold and are present in more than two hundred of them. Moreover, many of them are found between proteins with different fold assignments. It has been shown that protein structure symmetry does not explain non-sequential alignments. Therefore, compelling evidences have been provided that non-sequential alignments between proteins are systematic and widespread across the protein universe. CONCLUSION: The phenomenon of the widespread occurrence of non-sequential alignments between proteins might represent a missing rule of protein structure organization. More detailed study of this phenomenon will enhance our understanding of protein stability, folding, and evolution.


Subject(s)
Protein Structure, Tertiary , Proteins/chemistry , Sequence Alignment , Sequence Analysis, Protein , Structural Homology, Protein , Animals , Databases, Protein , Humans , Proteins/genetics
10.
Nucleic Acids Res ; 35(Web Server issue): W384-92, 2007 Jul.
Article in English | MEDLINE | ID: mdl-17537826

ABSTRACT

SNPs located within the open reading frame of a gene that result in an alteration in the amino acid sequence of the encoded protein [nonsynonymous SNPs (nsSNPs)] might directly or indirectly affect functionality of the protein, alone or in the interactions in a multi-protein complex, by increasing/decreasing the activity of the metabolic pathway. Understanding the functional consequences of such changes and drawing conclusions about the molecular basis of diseases, involves integrating information from multiple heterogeneous sources including sequence, structure data and pathway relations between proteins. The data from NCBI's SNP database (dbSNP), gene and protein databases from Entrez, protein structures from the PDB and pathway information from KEGG have all been cross referenced into the StSNP web server, in an effort to provide combined integrated, reports about nsSNPs. StSNP provides 'on the fly' comparative modeling of nsSNPs with links to metabolic pathway information, along with real-time visual comparative analysis of the modeled structures using the Friend software application. The use of metabolic pathways in StSNP allows a researcher to examine possible disease-related pathways associated with a particular nsSNP(s), and link the diseases with the current available molecular structure data. The server is publicly available at http://glinka.bio.neu.edu/StSNP/.


Subject(s)
Chromosome Mapping/methods , Computational Biology/methods , Databases, Genetic , Polymorphism, Single Nucleotide/genetics , Proteins/chemistry , Proteins/genetics , Sequence Analysis, DNA/methods , Sequence Analysis, Protein/methods , Software , Algorithms , Database Management Systems , Humans , Internet , Open Reading Frames/genetics , Protein Interaction Mapping/methods , Proteins/analysis , Sequence Alignment/methods
11.
Nucleic Acids Res ; 35(Database issue): D317-21, 2007 Jan.
Article in English | MEDLINE | ID: mdl-17065464

ABSTRACT

TOPOFIT-DB (T-DB) is a public web-based database of protein structural alignments based on the TOPOFIT method, providing a comprehensive resource for comparative analysis of protein structure families. The TOPOFIT method is based on the discovery of a saturation point on the alignment curve (topomax point) which presents an ability to objectively identify a border between common and variable parts in a protein structural family, providing additional insight into protein comparison and functional annotation. TOPOFIT also effectively detects non-sequential relations between protein structures. T-DB provides users with the convenient ability to retrieve and analyze structural neighbors for a protein; do one-to-all calculation of a user provided structure against the entire current PDB release with T-Server, and pair-wise comparison using the TOPOFIT method through the T-Pair web page. All outputs are reported in various web-based tables and graphics, with automated viewing of the structure-sequence alignments in the Friend software package for complete, detailed analysis. T-DB presents researchers with the opportunity for comprehensive studies of the variability in proteins and is publicly available at http://mozart.bio.neu.edu/topofit/index.php.


Subject(s)
Databases, Protein , Structural Homology, Protein , Internet , Mathematical Computing , User-Computer Interface
12.
Bioinformatics ; 21(18): 3677-8, 2005 Sep 15.
Article in English | MEDLINE | ID: mdl-16076889

ABSTRACT

UNLABELLED: Friend is a bioinformatics application designed for simultaneous analysis and visualization of multiple structures and sequences of proteins and/or DNA/RNA. The application provides basic functionalities, such as structure visualization, with different rendering and coloring, sequence alignment and simple phylogeny analysis, along with a number of extended features to perform more complex analyses of sequence structure relationships, including structural alignment of proteins, investigation of specific interaction motifs, studies of protein-protein and protein-DNA interactions and protein super-families. It is also useful for functional annotation of proteins, protein modeling and protein folding studies. Friend provides three levels of usage: (1) an extensive GUI for a scientist with no programming experience, (2) a command line interface for scripting for a scientist with some programming experience and (3) the ability to extend Friend with user written libraries for an experienced programmer. The application is linked and communicates with local and remote sequence and structure databases. AVAILABILITY: http://mozart.bio.neu.edu/friend.


Subject(s)
Computational Biology/methods , Software , Computational Biology/instrumentation , Computer Graphics , DNA/chemistry , Database Management Systems , Databases, Genetic , Databases, Protein , Exons , Internet , Muramidase/chemistry , Nucleic Acid Conformation , Phylogeny , Protein Folding , Proteins , RNA/chemistry , Sequence Alignment , Sequence Analysis, Protein , User-Computer Interface
13.
J Bioinform Comput Biol ; 3(1): 127-43, 2005 Feb.
Article in English | MEDLINE | ID: mdl-15751116

ABSTRACT

THEMATICS (Theoretical Microscopic Titration Curves) is a simple, reliable computational predictor of the active sites of enzymes from structure. Our method, based on well-established Finite Difference Poisson-Boltzmann techniques, identifies the ionisable residues with anomalous predicted titration behavior. A cluster of two or more such perturbed residues is a very reliable predictor of the active site. The protein does not have to bear any resemblance in sequence or structure to any previously characterized protein, but the method does require the three-dimensional structure. We now present evidence that THEMATICS can also locate the active site in structures built by comparative modeling from similar structures. Results are given for a total of 21 sets of proteins, including 21 templates and 83 comparative model structures. Detailed results are presented for three sets of orthologous proteins (Triosephosphate isomerase, 6-Hydroxymethyl-7,8-dihydropterin pyrophosphokinase, and Aspartate aminotransferase) and for one set of human homologues of Aldose reductase with different functions. THEMATICS correctly locates the active site in the model structures. This suggests that the method can be applicable to a much larger set of proteins for which an experimentally determined structure is unavailable. With a few exceptions, the predicted active sites in the comparative model structures are similar to that of the corresponding template structure.


Subject(s)
Algorithms , Enzymes/chemistry , Models, Chemical , Models, Molecular , Sequence Analysis, Protein/methods , Software , Titrimetry/methods , Binding Sites , Enzyme Activation , Enzymes/analysis , Protein Binding , Structure-Activity Relationship
14.
Protein Sci ; 13(7): 1865-74, 2004 Jul.
Article in English | MEDLINE | ID: mdl-15215530

ABSTRACT

Similarity of protein structures has been analyzed using three-dimensional Delaunay triangulation patterns derived from the backbone representation. It has been found that structurally related proteins have a common spatial invariant part, a set of tetrahedrons, mathematically described as a common spatial subgraph volume of the three-dimensional contact graph derived from Delaunay tessellation (DT). Based on this property of protein structures, we present a novel common volume superimposition (TOPOFIT) method to produce structural alignments. Structural alignments usually evaluated by a number of equivalent (aligned) positions (N(e)) with corresponding root mean square deviation (RMSD). The superimposition of the DT patterns allows one to uniquely identify a maximal common number of equivalent residues in the structural alignment. In other words, TOPOFIT identifies a feature point on the RMSD N(e) curve, a topomax point, until which the topologies of two structures correspond to each other, including backbone and interresidue contacts, whereas the growing number of mismatches between the DT patterns occurs at larger RMSD (N(e)) after the topomax point. It has been found that the topomax point is present in all alignments from different protein structural classes; therefore, the TOPOFIT method identifies common, invariant structural parts between proteins. The alignments produced by the TOPOFIT method have a good correlation with alignments produced by other current methods. This novel method opens new opportunities for the comparative analysis of protein structures and for more detailed studies on understanding the molecular principles of tertiary structure organization and functionality. The TOPOFIT method also helps to detect conformational changes, topological differences in variable parts, which are particularly important for studies of variations in active/ binding sites and protein classification.


Subject(s)
Proteins/chemistry , Software , Structural Homology, Protein , Computer Graphics , Protein Structure, Tertiary
15.
Bioinformatics ; 20(11): 1801-3, 2004 Jul 22.
Article in English | MEDLINE | ID: mdl-14988102

ABSTRACT

UNLABELLED: Comparative analysis of exon/intron organization of genes and their resulting protein structures is important for understanding evolutionary relationships between species, rules of protein organization and protein functionality. We present Structural Exon Database (SEDB), with a Web interface, an application that allows users to retrieve the exon/intron organization of genes and map the location of the exon boundaries and the intron phase onto a multiple structural alignment. SEDB is linked with Friend, an integrated analytical multiple sequence/structure viewer, which allows simultaneous visualization of exon boundaries on structure and sequence alignments. With SEDB researchers can study the correlations of gene structure with the properties of the encoded three-dimensional protein structures across eukaryotic organisms. AVAILABILITY: SEDB is publicly available at http://glinka.bio.neu.edu/SEDB/SEDB.html SUPPLEMENTARY INFORMATION: On the SEDB Web site.


Subject(s)
Databases, Genetic , Exons/genetics , Proteins/chemistry , Proteins/genetics , Sequence Alignment/methods , Sequence Analysis, Protein/methods , User-Computer Interface , Internet , Models, Chemical , Online Systems , Protein Conformation
16.
Nucleic Acids Res ; 31(13): 3375-80, 2003 Jul 01.
Article in English | MEDLINE | ID: mdl-12824331

ABSTRACT

The following resources for comparative protein structure modeling and analysis are described (http://salilab.org): MODELLER, a program for comparative modeling by satisfaction of spatial restraints; MODWEB, a web server for automated comparative modeling that relies on PSI-BLAST, IMPALA and MODELLER; MODLOOP, a web server for automated loop modeling that relies on MODELLER; MOULDER, a CPU intensive protocol of MODWEB for building comparative models based on distant known structures; MODBASE, a comprehensive database of annotated comparative models for all sequences detectably related to a known structure; MODVIEW, a Netscape plugin for Linux that integrates viewing of multiple sequences and structures; and SNPWEB, a web server for structure-based prediction of the functional impact of a single amino acid substitution.


Subject(s)
Software , Structural Homology, Protein , Internet , Models, Molecular , Protein Folding , Proteins/chemistry , Reproducibility of Results , Sequence Alignment , Sequence Homology, Amino Acid , Systems Integration
17.
J Mol Biol ; 325(1): 39-63, 2003 Jan 03.
Article in English | MEDLINE | ID: mdl-12473451

ABSTRACT

Binding ATP to tryptophanyl-tRNA synthetase (TrpRS) in a catalytically competent configuration for amino acid activation destabilizes the enzyme structure prior to forming the transition state. This conclusion follows from monitoring the titration of TrpRS with ATP by small angle solution X-ray scattering, enzyme activity, and crystal structures. ATP induces a significantly smaller radius of gyration at pH=7 with a transition midpoint at approximately 8mM. A non-reciprocal dependence of Trp and ATP dissociation constants on concentrations of the second substrate show that Trp binding enhances affinity for ATP, while the affinity for Trp falls with the square of the [ATP] over the same concentration range ( approximately 5mM) that induces the more compact conformation. Two distinct TrpRS:ATP structures have been solved, a high-affinity complex grown with 1mM ATP and a low-affinity complex grown at 10mM ATP. The former is isomorphous with unliganded TrpRS and the Trp complex from monoclinic crystals. Reacting groups of the two individually-bound substrates are separated by 6.7A. Although it lacks tryptophan, the low-affinity complex has a closed conformation similar to that observed in the presence of both ATP and Trp analogs such as indolmycin, and resembles a complex previously postulated to form in the closely-related TyrRS upon induced-fit active-site assembly, just prior to catalysis. Titration of TrpRS with ATP therefore successively produces structurally distinct high- and low-affinity ATP-bound states. The higher quality X-ray data for the closed ATP complex (2.2A) provide new structural details likely related to catalysis, including an extension of the KMSKS loop that engages the second lysine and serine residues, K195 and S196, with the alpha and gamma-phosphates; interactions of the K111 side-chain with the gamma-phosphate; and a water molecule bridging the consensus sequence residue T15 to the beta-phosphate. Induced-fit therefore strengthens active-site interactions with ATP, substantially intensifying the interaction of the KMSKS loop with the leaving PP(i) group. Formation of this conformation in the absence of a Trp analog implies that ATP is a key allosteric effector for TrpRS. The paradoxical requirement for high [ATP] implies that Gibbs binding free energy is stored in an unfavorable protein conformation and can then be recovered for useful purposes, including catalysis in the case of TrpRS.


Subject(s)
Adenosine Triphosphate/metabolism , Tryptophan-tRNA Ligase/chemistry , Tryptophan-tRNA Ligase/metabolism , Adenosine Triphosphate/pharmacology , Allosteric Regulation/drug effects , Binding Sites , Catalysis , Crystallography, X-Ray , Dimerization , Diphosphates/metabolism , Geobacillus stearothermophilus/enzymology , Hydrogen-Ion Concentration , Kinetics , Ligands , Models, Molecular , Protein Binding , Protein Conformation/drug effects , Ribose/metabolism , Rotation , Solutions/chemistry , Static Electricity , Structure-Activity Relationship , Thermodynamics , Tryptophan/metabolism , Tryptophan/pharmacology
18.
Bioinformatics ; 19(1): 165-6, 2003 Jan.
Article in English | MEDLINE | ID: mdl-12499313

ABSTRACT

SUMMARY: We describe ModView, a web application for visualization of multiple protein sequences and structures. ModView integrates a multiple structure viewer, a multiple sequence alignment editor, and a database querying engine. It is possible to interactively manipulate hundreds of proteins, to visualize conservative and variable residues, active and binding sites, fragments, and domains in protein families, as well as to display large macromolecular complexes such as ribosomes or viruses. As a Netscape plug-in, ModView can be included in HTML pages along with text and figures, which makes it useful for teaching and presentations. ModView is also suitable as a graphical interface to various databases because it can be controlled through JavaScript commands and called from CGI scripts. AVAILABILITY: ModView is available at http://guitar.rockefeller.edu/modview.


Subject(s)
Database Management Systems , Proteins/chemistry , Sequence Alignment/methods , Sequence Analysis, Protein/methods , User-Computer Interface , Documentation/methods , Imaging, Three-Dimensional/methods , Protein Conformation , Proteins/genetics
20.
Bioinformatics ; 18(1): 200-1, 2002 Jan.
Article in English | MEDLINE | ID: mdl-11836232

ABSTRACT

A database comprising all ligand-binding sites of known structure aligned with all related protein sequences and structures is described. Currently, the database contains approximately 50000 ligand-binding sites for small molecules found in the Protein Data Bank (PDB). The structure-structure alignments are obtained by the Combinatorial Extension (CE) program (Shindyalov and Bourne, Protein Eng., 11, 739-747, 1998) and sequence-structure alignments are extracted from the ModBase database of comparative protein structure models for all known protein sequences (Sanchez et al., Nucleic Acids Res., 28, 250-253, 2000). It is possible to search for binding sites in LigBase by a variety of criteria. LigBase reports summarize ligand data including relevant structural information from the PDB file, such as ligand type and size, and contain links to all related protein sequences in the TrEMBL database. Residues in the binding sites are graphically depicted for comparison with other structurally defined family members. LigBase provides a resource for the analysis of families of related binding sites.


Subject(s)
Databases, Protein , Proteins/chemistry , Proteins/genetics , Binding Sites , Computational Biology , Internet , Ligands , Sequence Alignment/statistics & numerical data
SELECTION OF CITATIONS
SEARCH DETAIL