Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
1.
Nucleic Acids Res ; 42(Database issue): D336-46, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24271400

RESUMEN

ModBase (http://salilab.org/modbase) is a database of annotated comparative protein structure models. The models are calculated by ModPipe, an automated modeling pipeline that relies primarily on Modeller for fold assignment, sequence-structure alignment, model building and model assessment (http://salilab.org/modeller/). ModBase currently contains almost 30 million reliable models for domains in 4.7 million unique protein sequences. ModBase allows users to compute or update comparative models on demand, through an interface to the ModWeb modeling server (http://salilab.org/modweb). ModBase models are also available through the Protein Model Portal (http://www.proteinmodelportal.org/). Recently developed associated resources include the AllosMod server for modeling ligand-induced protein dynamics (http://salilab.org/allosmod), the AllosMod-FoXS server for predicting a structural ensemble that fits an SAXS profile (http://salilab.org/allosmod-foxs), the FoXSDock server for protein-protein docking filtered by an SAXS profile (http://salilab.org/foxsdock), the SAXS Merge server for automatic merging of SAXS profiles (http://salilab.org/saxsmerge) and the Pose & Rank server for scoring protein-ligand complexes (http://salilab.org/poseandrank). In this update, we also highlight two applications of ModBase: a PSI:Biology initiative to maximize the structural coverage of the human alpha-helical transmembrane proteome and a determination of structural determinants of human immunodeficiency virus-1 protease specificity.


Asunto(s)
Bases de Datos de Proteínas , Modelos Moleculares , Homología Estructural de Proteína , Proteasa del VIH/química , Humanos , Internet , Proteínas de la Membrana/química , Anotación de Secuencia Molecular , Estructura Terciaria de Proteína , Proteoma/química , Dispersión del Ángulo Pequeño , Difracción de Rayos X
2.
Bioinformatics ; 29(24): 3158-66, 2013 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-24078704

RESUMEN

MOTIVATION: Statistical potentials have been widely used for modeling whole proteins and their parts (e.g. sidechains and loops) as well as interactions between proteins, nucleic acids and small molecules. Here, we formulate the statistical potentials entirely within a statistical framework, avoiding questionable statistical mechanical assumptions and approximations, including a definition of the reference state. RESULTS: We derive a general Bayesian framework for inferring statistically optimized atomic potentials (SOAP) in which the reference state is replaced with data-driven 'recovery' functions. Moreover, we restrain the relative orientation between two covalent bonds instead of a simple distance between two atoms, in an effort to capture orientation-dependent interactions such as hydrogen bonds. To demonstrate this general approach, we computed statistical potentials for protein-protein docking (SOAP-PP) and loop modeling (SOAP-Loop). For docking, a near-native model is within the top 10 scoring models in 40% of the PatchDock benchmark cases, compared with 23 and 27% for the state-of-the-art ZDOCK and FireDock scoring functions, respectively. Similarly, for modeling 12-residue loops in the PLOP benchmark, the average main-chain root mean square deviation of the best scored conformations by SOAP-Loop is 1.5 Å, close to the average root mean square deviation of the best sampled conformations (1.2 Å) and significantly better than that selected by Rosetta (2.1 Å), DFIRE (2.3 Å), DOPE (2.5 Å) and PLOP scoring functions (3.0 Å). Our Bayesian framework may also result in more accurate statistical potentials for additional modeling applications, thus affording better leverage of the experimentally determined protein structures. AVAILABILITY AND IMPLEMENTATION: SOAP-PP and SOAP-Loop are available as part of MODELLER (http://salilab.org/modeller).


Asunto(s)
Teorema de Bayes , Modelos Estadísticos , Proteínas/química , Programas Informáticos , Biología Computacional , Enlace de Hidrógeno , Simulación del Acoplamiento Molecular , Conformación Proteica , Dominios y Motivos de Interacción de Proteínas , Proteínas/metabolismo
3.
J Chem Inf Model ; 54(6): 1687-99, 2014 Jun 23.
Artículo en Inglés | MEDLINE | ID: mdl-24802635

RESUMEN

Enzymes in the glutathione transferase (GST) superfamily catalyze the conjugation of glutathione (GSH) to electrophilic substrates. As a consequence they are involved in a number of key biological processes, including protection of cells against chemical damage, steroid and prostaglandin biosynthesis, tyrosine catabolism, and cell apoptosis. Although virtual screening has been used widely to discover substrates by docking potential noncovalent ligands into active site clefts of enzymes, docking has been rarely constrained by a covalent bond between the enzyme and ligand. In this study, we investigate the accuracy of docking poses and substrate discovery in the GST superfamily, by docking 6738 potential ligands from the KEGG and MetaCyc compound libraries into 14 representative GST enzymes with known structures and substrates using the PLOP program [ Jacobson Proteins 2004 , 55 , 351 ]. For X-ray structures as receptors, one of the top 3 ranked models is within 3 Å all-atom root mean square deviation (RMSD) of the native complex in 11 of the 14 cases; the enrichment LogAUC value is better than random in all cases, and better than 25 in 7 of 11 cases. For comparative models as receptors, near-native ligand-enzyme configurations are often sampled but difficult to rank highly. For models based on templates with the highest sequence identity, the enrichment LogAUC is better than 25 in 5 of 11 cases, not significantly different from the crystal structures. In conclusion, we show that covalent docking can be a useful tool for substrate discovery and point out specific challenges for future method improvement.


Asunto(s)
Glutatión Transferasa/metabolismo , Simulación del Acoplamiento Molecular , Animales , Sitios de Unión , Dominio Catalítico , Cristalografía por Rayos X , Bases de Datos de Proteínas , Glutatión Transferasa/química , Humanos , Ligandos , Especificidad por Sustrato
4.
BMC Bioinformatics ; 9: 373, 2008 Sep 12.
Artículo en Inglés | MEDLINE | ID: mdl-18789148

RESUMEN

BACKGROUND: Simulating the major molecular events inside an Escherichia coli cell can lead to a very large number of reactions that compose its overall behaviour. Not only should the model be accurate, but it is imperative for the experimenter to create an efficient model to obtain the results in a timely fashion. Here, we show that for many parameter regimes, the effect of the host cell genome on the transcription of a gene from a plasmid-borne promoter is negligible, allowing one to simulate the system more efficiently by removing the computational load associated with representing the presence of the rest of the genome. The key parameter is the on-rate of RNAP binding to the promoter (k_on), and we compare the total number of transcripts produced from a plasmid vector generated as a function of this rate constant, for two versions of our gene expression model, one incorporating the host cell genome and one excluding it. By sweeping parameters, we identify the k_on range for which the difference between the genome and no-genome models drops below 5%, over a wide range of doubling times, mRNA degradation rates, plasmid copy numbers, and gene lengths. RESULTS: We assess the effect of the simulating the presence of the genome over a four-dimensional parameter space, considering: 24 min

Asunto(s)
Algoritmos , Mapeo Cromosómico/métodos , Escherichia coli/genética , Genoma Bacteriano/genética , Análisis de Secuencia de ADN/métodos , Factores de Transcripción/genética , Secuencia de Bases , Datos de Secuencia Molecular , Reproducibilidad de los Resultados , Sensibilidad y Especificidad
5.
Phys Rev E Stat Nonlin Soft Matter Phys ; 77(2 Pt 1): 021908, 2008 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-18352052

RESUMEN

Fluorescent proteins are frequently used as reporters for gene expression in living cells, either by being expressed in tandem with a protein of interest or through the creation of fusion proteins. The data yielded by the fluorescence output are of considerable interest in efforts to formulate quantitative models of cellular behavior underway in fields such as systems biology and synthetic biology. An often neglected aspect of these proteins, however, is their maturation: Before a fluorescent protein can generate a fluorescent signal, it must mature through a series of steps (folding, cyclization, and oxidation) that may take from many minutes to over a day. The presence of these maturation steps creates a distinction between the observed gene expression profile and the actual profile. We examine this effect through a simplified gene expression model and conclude that fluorescent protein maturation can have significant effects on estimates of both the mean protein levels and the variability in gene expression. The model shows that in many regimes, the observed variability will be increased by the maturation process, but indicates the existence of regimes in which the observed variability will actually be less than the true variability of the target protein. The latter effect arises from a low-pass filtering effect introduced by the chain of maturation reactions. The results suggest that the maturation of fluorescent proteins should be taken into account when using such proteins as quantitative indicators of gene expression levels.


Asunto(s)
Regulación de la Expresión Génica/fisiología , Expresión Génica/fisiología , Modelos Biológicos , Modelos Estadísticos , Biosíntesis de Proteínas/fisiología , Proteínas/metabolismo , Simulación por Computador , Proteínas/genética
6.
Phys Rev E Stat Nonlin Soft Matter Phys ; 77(2 Pt 1): 021919, 2008 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-18352063

RESUMEN

One aim of synthetic biology is to exert systematic control over cellular behavior, either for medical purposes or to "program" microorganisms. An engineering approach to the design of biological controllers demands a quantitative understanding of the dynamics of both the system to be controlled and the controllers themselves. Here we focus on a widely used method of exerting control in bacterial cells: plasmid vectors bearing gene-promoter pairs. We study two variants of the simplest such element, an unregulated promoter constitutively expressing its gene, against the varying genomic background of four Escherichia coli cell strains. Absolute protein numbers and rates of expression vary with both cell strain and plasmid type, as does the variability of expression across the population. Total variability is most strongly coupled to the cell division process, and after cell size is scaled away, plasmid copy number regulation emerges as a significant effect. We present simple models that capture the main features of the system behavior. Our results confirm that complex interactions between plasmids and their hosts can have significant effects on both expression and variability, even in deliberately simplified systems.


Asunto(s)
Proteínas de Escherichia coli/fisiología , Escherichia coli/fisiología , Expresión Génica/genética , Variación Genética/genética , Modelos Genéticos , Plásmidos/genética , Transfección/métodos , Simulación por Computador , Regulación Bacteriana de la Expresión Génica/fisiología
7.
PLoS One ; 13(11): e0206654, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-30399156

RESUMEN

Accurate predictions of T-cell epitopes would be useful for designing vaccines, immunotherapies for cancer and autoimmune diseases, and improved protein therapies. The humoral immune response involves uptake of antigens by antigen presenting cells (APCs), APC processing and presentation of peptides on MHC class II (pMHCII), and T-cell receptor (TCR) recognition of pMHCII complexes. Most in silico methods predict only peptide-MHCII binding, resulting in significant over-prediction of CD4 T-cell epitopes. We present a method, ITCell, for prediction of T-cell epitopes within an input protein antigen sequence for given MHCII and TCR sequences. The method integrates information about three stages of the immune response pathway: antigen cleavage, MHCII presentation, and TCR recognition. First, antigen cleavage sites are predicted based on the cleavage profiles of cathepsins S, B, and H. Second, for each 12-mer peptide in the antigen sequence we predict whether it will bind to a given MHCII, based on the scores of modeled peptide-MHCII complexes. Third, we predict whether or not any of the top scoring peptide-MHCII complexes can bind to a given TCR, based on the scores of modeled ternary peptide-MHCII-TCR complexes and the distribution of predicted cleavage sites. Our benchmarks consist of epitope predictions generated by this algorithm, checked against 20 peptide-MHCII-TCR crystal structures, as well as epitope predictions for four peptide-MHCII-TCR complexes with known epitopes and TCR sequences but without crystal structures. ITCell successfully identified the correct epitopes as one of the 20 top scoring peptides for 22 of 24 benchmark cases. To validate the method using a clinically relevant application, we utilized five factor VIII-specific TCR sequences from hemophilia A subjects who developed an immune response to factor VIII replacement therapy. The known HLA-DR1-restricted factor VIII epitope was among the six top-scoring factor VIII peptides predicted by ITCall to bind HLA-DR1 and all five TCRs. Our integrative approach is more accurate than current single-stage epitope prediction algorithms applied to the same benchmarks. It is freely available as a web server (http://salilab.org/itcell).


Asunto(s)
Presentación de Antígeno , Antígenos/inmunología , Linfocitos T CD4-Positivos/inmunología , Epítopos de Linfocito T/inmunología , Antígenos de Histocompatibilidad Clase II/inmunología , Antígenos de Histocompatibilidad Clase II/metabolismo , Modelos Inmunológicos , Receptores de Antígenos de Linfocitos T/inmunología , Algoritmos , Antígenos/metabolismo , Catepsinas/metabolismo , Simulación por Computador , Factor VIII/inmunología , Hemofilia A/inmunología , Hemofilia A/terapia , Humanos , Estructura Terciaria de Proteína
8.
J Biol Phys ; 33(1): 67-95, 2007 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-19669554

RESUMEN

Biological systems often involve chemical reactions occurring in low-molecule-number regimes, where fluctuations are not negligible and thus stochastic models are required to capture the system behaviour. The resulting models are generally quite large and complex, involving many reactions and species. For clarity and computational tractability, it is important to be able to simplify these systems to equivalent ones involving fewer elements. While many model simplification approaches have been developed for deterministic systems, there has been limited work on applying these approaches to stochastic modelling. Here, we describe a method that reduces the complexity of stochastic biochemical network models, and apply this method to the reduction of a mammalian signalling cascade and a detailed model of the process of bacterial gene expression. Our results indicate that the simplified model gives an accurate representation for not only the average numbers of all species, but also for the associated fluctuations and statistical parameters.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA