Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 30
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Proc Natl Acad Sci U S A ; 120(44): e2306177120, 2023 Oct 31.
Artigo em Inglês | MEDLINE | ID: mdl-37871210

RESUMO

Lepidopterans affect crop production worldwide. The use of transgenes encoding insecticidal proteins from Bacillus thuringiensis (Bt) in crop plants is a well-established technology that enhances protection against lepidopteran larvae. Concern about widespread field-evolved resistance to Bt proteins has highlighted an urgent need for new insecticidal proteins with different modes or sites of action. We discovered a new family of insecticidal proteins from ferns. The prototype protein from Pteris species (Order Polypodiales) and variants from two other orders of ferns, Schizaeales and Ophioglossales, were effective against important lepidopteran pests of maize and soybean in diet-based assays. Transgenic maize and soybean plants producing these proteins were more resistant to insect damage than controls. We report here the crystal structure of a variant of the prototype protein to 1.98 Å resolution. Remarkably, despite being derived from plants, the structure resembles the 3-domain Cry proteins from Bt but has only two out of three of their characteristic domains, lacking the C-terminal domain which is typically required for their activities. Two of the fern proteins were effective against strains of fall armyworm that were resistant to Bt 3-domain Cry proteins Cry1Fa or Cry2A.127. This therefore represents a novel family of insecticidal proteins that have the potential to provide future tools for pest control.


Assuntos
Bacillus thuringiensis , Gleiquênias , Inseticidas , Traqueófitas , Animais , Inseticidas/metabolismo , Bacillus thuringiensis/genética , Bacillus thuringiensis/metabolismo , Plantas Geneticamente Modificadas/genética , Plantas Geneticamente Modificadas/metabolismo , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Controle Biológico de Vetores , Endotoxinas/genética , Endotoxinas/metabolismo , Proteínas Hemolisinas/genética , Proteínas Hemolisinas/metabolismo , Traqueófitas/metabolismo , Zea mays/metabolismo
2.
Nat Commun ; 14(1): 4171, 2023 07 13.
Artigo em Inglês | MEDLINE | ID: mdl-37443175

RESUMO

The broad adoption of transgenic crops has revolutionized agriculture. However, resistance to insecticidal proteins by agricultural pests poses a continuous challenge to maintaining crop productivity and new proteins are urgently needed to replace those utilized for existing transgenic traits. We identified an insecticidal membrane attack complex/perforin (MACPF) protein, Mpf2Ba1, with strong activity against the devastating coleopteran pest western corn rootworm (WCR) and a novel site of action. Using an integrative structural biology approach, we determined monomeric, pre-pore and pore structures, revealing changes between structural states at high resolution. We discovered an assembly inhibition mechanism, a molecular switch that activates pre-pore oligomerization upon gut fluid incubation and solved the highest resolution MACPF pore structure to-date. Our findings demonstrate not only the utility of Mpf2Ba1 in the development of biotechnology solutions for protecting maize from WCR to promote food security, but also uncover previously unknown mechanistic principles of bacterial MACPF assembly.


Assuntos
Besouros , Inseticidas , Animais , Inseticidas/farmacologia , Inseticidas/metabolismo , Zea mays/metabolismo , Besouros/fisiologia , Controle Biológico de Vetores , Plantas Geneticamente Modificadas/metabolismo , Animais Geneticamente Modificados , Perforina/metabolismo , Endotoxinas/metabolismo , Larva/metabolismo , Resistência a Inseticidas
3.
Biol Chem ; 393(3): 177-86, 2012 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-22718633

RESUMO

Cathepsin E splice variant 2 appears in a number of gastric carcinomas. Here we report detecting this variant in HeLa cells using polyclonal antibodies and biotinylated inhibitor pepstatin A. An overexpression of GFP fusion proteins of cathepsin E and its splice variant within HEK-293T cells was performed to show their localization. Their distribution under a fluorescence microscope showed that they are colocalized. We also expressed variants 1 and 2 of cathepsins E, with propeptide and without it, in Escherichia coli. After refolding from the inclusion bodies, the enzymatic activity and circular dichroism spectra of the splice variant 2 were compared to those of the wild-type mature active cathepsins E. While full-length cathepsin E variant 1 is activated at acid pH, the splice variant remains inactive. In contrast to the active cathepsin E, the splice variant 2 predominantly assumes ß-sheet structure, prone to oligomerization, at least under in vitro conditions, as shown by atomic force microscopy as shallow disk-like particles. A comparative structure model of splice variant 2 was computed based on its alignment to the known structure of cathepsin E intermediate (Protein Data Bank code 1TZS) and used to rationalize its conformational properties and loss of activity.


Assuntos
Catepsina E/química , Sequência de Aminoácidos , Catepsina E/genética , Catepsina E/metabolismo , Escherichia coli/genética , Expressão Gênica , Células HEK293 , Células HeLa , Humanos , Microscopia de Força Atômica , Modelos Moleculares , Dados de Sequência Molecular , Isoformas de Proteínas/química , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Proteínas Recombinantes/química , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Alinhamento de Sequência
4.
J Struct Funct Genomics ; 10(4): 269-80, 2009 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-19760129

RESUMO

Membrane proteins serve as cellular gatekeepers, regulators, and sensors. Prior studies have explored the functional breadth and evolution of proteins and families of particular interest, such as the diversity of transport-associated membrane protein families in prokaryotes and eukaryotes, the composition of integral membrane proteins, and family classification of all human G-protein coupled receptors. However, a comprehensive analysis of the content and evolutionary associations between membrane proteins and families in a diverse set of genomes is lacking. Here, a membrane protein annotation pipeline was developed to define the integral membrane genome and associations between 21,379 proteins from 34 genomes; most, but not all of these proteins belong to 598 defined families. The pipeline was used to provide target input for a structural genomics project that successfully cloned, expressed, and purified 61 of our first 96 selected targets in yeast. Furthermore, the methodology was applied (1) to explore the evolutionary history of the substrate-binding transmembrane domains of the human ABC transporter superfamily, (2) to identify the multidrug resistance-associated membrane proteins in whole genomes, and (3) to identify putative new membrane protein families.


Assuntos
Transportadores de Cassetes de Ligação de ATP/genética , Resistência a Múltiplos Medicamentos/genética , Evolução Molecular , Genoma Humano/genética , Proteínas de Membrana/genética , Receptores Acoplados a Proteínas G/genética , Animais , Genômica/métodos , Humanos , Estrutura Secundária de Proteína/genética
5.
Protein Eng Des Sel ; 22(9): 569-74, 2009 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-19587024

RESUMO

Comparing the structures of proteins is crucial to gaining insight into protein evolution and function. Here, we align the sequences of multiple protein structures by a dynamic programming optimization of a scoring function that is a sum of an affine gap penalty and terms dependent on various sequence and structure features (SALIGN). The features include amino acid residue type, residue position, residue accessible surface area, residue secondary structure state and the conformation of a short segment centered on the residue. The multiple alignment is built by following the 'guide' tree constructed from the matrix of all pairwise protein alignment scores. Importantly, the method does not depend on the exact values of various parameters, such as feature weights and gap penalties, because the optimal alignment across a range of parameter values is found. Using multiple structure alignments in the HOMSTRAD database, SALIGN was benchmarked against MUSTANG for multiple alignments as well as against TM-align and CE for pairwise alignments. On the average, SALIGN produces a 15% improvement in structural overlap over HOMSTRAD and 14% over MUSTANG, and yields more equivalent structural positions than TM-align and CE in 90% and 95% of cases, respectively. The utility of accurate multiple structure alignment is illustrated by its application to comparative protein structure modeling.


Assuntos
Proteínas/química , Proteínas/genética , Alinhamento de Sequência/métodos , Algoritmos , Sequência de Aminoácidos , Bases de Dados de Proteínas , Conformação Proteica , Análise de Sequência de Proteína
6.
PLoS Negl Trop Dis ; 3(4): e418, 2009.
Artigo em Inglês | MEDLINE | ID: mdl-19381286

RESUMO

BACKGROUND: Conventional patent-based drug development incentives work badly for the developing world, where commercial markets are usually small to non-existent. For this reason, the past decade has seen extensive experimentation with alternative R&D institutions ranging from private-public partnerships to development prizes. Despite extensive discussion, however, one of the most promising avenues-open source drug discovery-has remained elusive. We argue that the stumbling block has been the absence of a critical mass of preexisting work that volunteers can improve through a series of granular contributions. Historically, open source software collaborations have almost never succeeded without such "kernels". METHODOLOGY/PRINCIPAL FINDINGS: HERE, WE USE A COMPUTATIONAL PIPELINE FOR: (i) comparative structure modeling of target proteins, (ii) predicting the localization of ligand binding sites on their surfaces, and (iii) assessing the similarity of the predicted ligands to known drugs. Our kernel currently contains 143 and 297 protein targets from ten pathogen genomes that are predicted to bind a known drug or a molecule similar to a known drug, respectively. The kernel provides a source of potential drug targets and drug candidates around which an online open source community can nucleate. Using NMR spectroscopy, we have experimentally tested our predictions for two of these targets, confirming one and invalidating the other. CONCLUSIONS/SIGNIFICANCE: The TDI kernel, which is being offered under the Creative Commons attribution share-alike license for free and unrestricted use, can be accessed on the World Wide Web at http://www.tropicaldisease.org. We hope that the kernel will facilitate collaborative efforts towards the discovery of new drugs against parasites that cause tropical diseases.


Assuntos
Antiparasitários/química , Antiparasitários/farmacologia , Descoberta de Drogas/métodos , Software , Medicina Tropical/métodos , Simulação por Computador , Humanos , Modelos Moleculares
8.
J Struct Funct Genomics ; 10(2): 107-25, 2009 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-19219566

RESUMO

To study the substrate specificity of enzymes, we use the amidohydrolase and enolase superfamilies as model systems; members of these superfamilies share a common TIM barrel fold and catalyze a wide range of chemical reactions. Here, we describe a collaboration between the Enzyme Specificity Consortium (ENSPEC) and the New York SGX Research Center for Structural Genomics (NYSGXRC) that aims to maximize the structural coverage of the amidohydrolase and enolase superfamilies. Using sequence- and structure-based protein comparisons, we first selected 535 target proteins from a variety of genomes for high-throughput structure determination by X-ray crystallography; 63 of these targets were not previously annotated as superfamily members. To date, 20 unique amidohydrolase and 41 unique enolase structures have been determined, increasing the fraction of sequences in the two superfamilies that can be modeled based on at least 30% sequence identity from 45% to 73%. We present case studies of proteins related to uronate isomerase (an amidohydrolase superfamily member) and mandelate racemase (an enolase superfamily member), to illustrate how this structure-focused approach can be used to generate hypotheses about sequence-structure-function relationships.


Assuntos
Amidoidrolases/química , Biologia Computacional/métodos , Genômica/métodos , Fosfopiruvato Hidratase/química , Sítios de Ligação , Bases de Dados de Proteínas , Conformação Proteica , Dobramento de Proteína , Especificidade por Substrato
9.
Nucleic Acids Res ; 37(Database issue): D347-54, 2009 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18948282

RESUMO

MODBASE (http://salilab.org/modbase) is a database of annotated comparative protein structure models. The models are calculated by MODPIPE, an automated modeling pipeline that relies primarily on MODELLER for fold assignment, sequence-structure alignment, model building and model assessment (http:/salilab.org/modeller). MODBASE currently contains 5,152,695 reliable models for domains in 1,593,209 unique protein sequences; only models based on statistically significant alignments and/or models assessed to have the correct fold are included. MODBASE also allows users to calculate comparative models on demand, through an interface to the MODWEB modeling server (http://salilab.org/modweb). Other resources integrated with MODBASE include databases of multiple protein structure alignments (DBAli), structurally defined ligand binding sites (LIGBASE), predicted ligand binding sites (AnnoLyze), structurally defined binary domain interfaces (PIBASE) and annotated single nucleotide polymorphisms and somatic mutations found in human proteins (LS-SNP, LS-Mut). MODBASE models are also available through the Protein Model Portal (http://www.proteinmodelportal.org/).


Assuntos
Bases de Dados de Proteínas , Modelos Moleculares , Estrutura Terciária de Proteína , Homologia Estrutural de Proteína , Genômica , Humanos , Ligantes , Mutação , Polimorfismo de Nucleotídeo Único , Dobramento de Proteína , Domínios e Motivos de Interação entre Proteínas , Proteínas/genética , Interface Usuário-Computador
10.
Protein Sci ; 17(11): 1881-93, 2008 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-18832340

RESUMO

Comparative structure models are available for two orders of magnitude more protein sequences than are experimentally determined structures. These models, however, suffer from two limitations that experimentally determined structures do not: They frequently contain significant errors, and their accuracy cannot be readily assessed. We have addressed the latter limitation by developing a protocol optimized specifically for predicting the Calpha root-mean-squared deviation (RMSD) and native overlap (NO3.5A) errors of a model in the absence of its native structure. In contrast to most traditional assessment scores that merely predict one model is more accurate than others, this approach quantifies the error in an absolute sense, thus helping to determine whether or not the model is suitable for intended applications. The assessment relies on a model-specific scoring function constructed by a support vector machine. This regression optimizes the weights of up to nine features, including various sequence similarity measures and statistical potentials, extracted from a tailored training set of models unique to the model being assessed: If possible, we use similarly sized models with the same fold; otherwise, we use similarly sized models with the same secondary structure composition. This protocol predicts the RMSD and NO3.5A errors for a diverse set of 580,317 comparative models of 6174 sequences with correlation coefficients (r) of 0.84 and 0.86, respectively, to the actual errors. This scoring function achieves the best correlation compared to 13 other tested assessment criteria that achieved correlations ranging from 0.35 to 0.71.


Assuntos
Biologia Computacional/métodos , Modelos Moleculares , Proteínas/química , Sequência de Aminoácidos , Bases de Dados de Proteínas , Dobramento de Proteína , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Análise de Sequência de Proteína , Software
11.
Methods Mol Biol ; 426: 145-59, 2008.
Artigo em Inglês | MEDLINE | ID: mdl-18542861

RESUMO

Genome sequencing projects have resulted in a rapid increase in the number of known protein sequences. In contrast, only about one-hundredth of these sequences have been characterized using experimental structure determination methods. Computational protein structure modeling techniques have the potential to bridge this sequence-structure gap. This chapter presents an example that illustrates the use of MODELLER to construct a comparative model for a protein with unknown structure. Automation of similar protocols (correction of protcols) has resulted in models of useful accuracy for domains in more than half of all known protein sequences.


Assuntos
Modelos Moleculares , Conformação Proteica , Proteínas/química , Proteínas/ultraestrutura , Software , Sequência de Aminoácidos , Dados de Sequência Molecular , Dobramento de Proteína , Homologia de Sequência de Aminoácidos
12.
BMC Bioinformatics ; 9: 249, 2008 May 27.
Artigo em Inglês | MEDLINE | ID: mdl-18505562

RESUMO

BACKGROUND: A number of studies have used protein interaction data alone for protein function prediction. Here, we introduce a computational approach for annotation of enzymes, based on the observation that similar protein sequences are more likely to perform the same function if they share similar interacting partners. RESULTS: The method has been tested against the PSI-BLAST program using a set of 3,890 protein sequences from which interaction data was available. For protein sequences that align with at least 40% sequence identity to a known enzyme, the specificity of our method in predicting the first three EC digits increased from 80% to 90% at 80% coverage when compared to PSI-BLAST. CONCLUSION: Our method can also be used in proteins for which homologous sequences with known interacting partners can be detected. Thus, our method could increase 10% the specificity of genome-wide enzyme predictions based on sequence matching by PSI-BLAST alone.


Assuntos
Enzimas/metabolismo , Homologia de Sequência de Aminoácidos , Software , Sequência de Aminoácidos/fisiologia , Bases de Dados de Proteínas , Enzimas/análise , Lógica Fuzzy , Reconhecimento Automatizado de Padrão , Valor Preditivo dos Testes , Mapeamento de Interação de Proteínas , Proteínas/análise , Proteínas/metabolismo , Alinhamento de Sequência , Análise de Sequência de Proteína , Relação Estrutura-Atividade , Especificidade por Substrato/genética
13.
Structure ; 16(4): 535-48, 2008 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-18400176

RESUMO

In this paper, we present a structure of the mammalian ribosome determined at approximately 8.7 A resolution by electron cryomicroscopy and single-particle methods. A model of the ribosome was created by docking homology models of subunit rRNAs and conserved proteins into the density map. We then modeled expansion segments in the subunit rRNAs and found unclaimed density for approximately 20 proteins. In general, many conserved proteins and novel proteins interact with expansion segments to form an integrated framework that may stabilize the mature ribosome. Our structure provides a snapshot of the mammalian ribosome at the beginning of translation and lends support to current models in which large movements of the small subunit and L1 stalk occur during tRNA translocation. Finally, details are presented for intersubunit bridges that are specific to the eukaryotic ribosome. We suggest that these bridges may help reset the conformation of the ribosome to prepare for the next cycle of chain elongation.


Assuntos
Modelos Moleculares , RNA Ribossômico/química , Proteínas Ribossômicas/química , Subunidades Ribossômicas Menores de Eucariotos/química , Ribossomos/química , Animais , Microscopia Crioeletrônica , Cães , Processamento de Imagem Assistida por Computador , Biossíntese de Proteínas , RNA de Transferência/química , Receptores de Quinase C Ativada , Receptores de Superfície Celular/química
14.
Protein Sci ; 16(12): 2585-96, 2007 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-17965183

RESUMO

Pathogens have evolved numerous strategies to infect their hosts, while hosts have evolved immune responses and other defenses to these foreign challenges. The vast majority of host-pathogen interactions involve protein-protein recognition, yet our current understanding of these interactions is limited. Here, we present and apply a computational whole-genome protocol that generates testable predictions of host-pathogen protein interactions. The protocol first scans the host and pathogen genomes for proteins with similarity to known protein complexes, then assesses these putative interactions, using structure if available, and, finally, filters the remaining interactions using biological context, such as the stage-specific expression of pathogen proteins and tissue expression of host proteins. The technique was applied to 10 pathogens, including species of Mycobacterium, apicomplexa, and kinetoplastida, responsible for "neglected" human diseases. The method was assessed by (1) comparison to a set of known host-pathogen interactions, (2) comparison to gene expression and essentiality data describing host and pathogen genes involved in infection, and (3) analysis of the functional properties of the human proteins predicted to interact with pathogen proteins, demonstrating an enrichment for functionally relevant host-pathogen interactions. We present several specific predictions that warrant experimental follow-up, including interactions from previously characterized mechanisms, such as cytoadhesion and protease inhibition, as well as suspected interactions in hypothesized networks, such as apoptotic pathways. Our computational method provides a means to mine whole-genome data and is complementary to experimental efforts in elucidating networks of host-pathogen protein interactions.


Assuntos
Proteínas de Bactérias/metabolismo , Biologia Computacional , Interações Hospedeiro-Patógeno , Mapeamento de Interação de Proteínas , Proteínas/química , Proteínas/metabolismo , Proteínas de Protozoários/metabolismo , Algoritmos , Proteínas de Bactérias/química , Bases de Dados de Proteínas , Humanos , Mycobacterium/metabolismo , Ligação Proteica , Proteínas de Protozoários/química , Análise de Sequência de Proteína , Software
15.
Nucleic Acids Res ; 35(Web Server issue): W393-7, 2007 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-17478513

RESUMO

The DBAli tools use a comprehensive set of structural alignments in the DBAli database to leverage the structural information deposited in the Protein Data Bank (PDB). These tools include (i) the DBAlit program that allows users to input the 3D coordinates of a protein structure for comparison by MAMMOTH against all chains in the PDB; (ii) the AnnoLite and AnnoLyze programs that annotate a target structure based on its stored relationships to other structures; (iii) the ModClus program that clusters structures by sequence and structure similarities; (iv) the ModDom program that identifies domains as recurrent structural fragments and (v) an implementation of the COMPARER method in the SALIGN command in MODELLER that creates a multiple structure alignment for a set of related protein structures. Thus, the DBAli tools, which are freely accessible via the World Wide Web at http://salilab.org/DBAli/, allow users to mine the protein structure space by establishing relationships between protein structures and their functions.


Assuntos
Algoritmos , Biologia Computacional/métodos , Bases de Dados de Proteínas , Proteínas/química , Proteínas/metabolismo , Pseudomonas aeruginosa/metabolismo , Alinhamento de Sequência/métodos , Análise de Sequência de Proteína/métodos , Software , Sequência de Aminoácidos , Interpretação Estatística de Dados , Internet , Dados de Sequência Molecular , Conformação Proteica , Proteínas/classificação , Homologia de Sequência de Aminoácidos , Relação Estrutura-Atividade
16.
Curr Protoc Protein Sci ; Chapter 2: Unit 2.9, 2007 11.
Artigo em Inglês | MEDLINE | ID: mdl-18429317

RESUMO

Functional characterization of a protein sequence is a common goal in biology, and is usually facilitated by having an accurate three-dimensional (3-D) structure of the studied protein. In the absence of an experimentally determined structure, comparative or homology modeling can sometimes provide a useful 3-D model for a protein that is related to at least one known protein structure. Comparative modeling predicts the 3-D structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described.


Assuntos
Modelos Moleculares , Conformação Proteica , Animais , Internet , L-Lactato Desidrogenase/química , Trichomonas vaginalis/enzimologia
17.
Proc Natl Acad Sci U S A ; 103(7): 2172-7, 2006 Feb 14.
Artigo em Inglês | MEDLINE | ID: mdl-16461911

RESUMO

The nuclear pore complex (NPC) consists of multiple copies of approximately 30 different proteins [nucleoporins (nups)], forming a channel in the nuclear envelope that mediates macromolecular transport between the cytosol and the nucleus. With <5% of the nup residues currently available in experimentally determined structures, little is known about the detailed structure of the NPC. Here, we use a combined computational and biochemical approach to assign folds for approximately 95% of the residues in the yeast and vertebrate nups. These fold assignments suggest an underlying simplicity in the composition and modularity in the architecture of all eukaryotic NPCs. The simplicity in NPC composition is reflected in the presence of only eight fold types, with the three most frequent folds accounting for approximately 85% of the residues. The modularity in NPC architecture is reflected in its hierarchical and symmetrical organization that partitions the predicted nup folds into three groups: the transmembrane group containing transmembrane helices and a cadherin fold, the central scaffold group containing beta-propeller and alpha-solenoid folds, and the peripheral FG group containing predominantly the FG repeats and the coiled-coil fold. Moreover, similarities between structures in coated vesicles and those in the NPC support our prior hypothesis for their common evolutionary origin in a progenitor protocoatomer. The small number of predicted fold types in the NPC and their internal symmetries suggest that the bulk of the NPC structure has evolved through extensive motif and gene duplication from a simple precursor set of only a few proteins.


Assuntos
Complexo de Proteínas Formadoras de Poros Nucleares/química , Poro Nuclear/química , Proteínas de Saccharomyces cerevisiae/química , Saccharomyces cerevisiae/metabolismo , Biologia Computacional , Evolução Molecular , Carioferinas/química , Dobramento de Proteína , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína
18.
Nucleic Acids Res ; 34(Database issue): D291-5, 2006 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-16381869

RESUMO

MODBASE (http://salilab.org/modbase) is a database of annotated comparative protein structure models for all available protein sequences that can be matched to at least one known protein structure. The models are calculated by MODPIPE, an automated modeling pipeline that relies on MODELLER for fold assignment, sequence-structure alignment, model building and model assessment (http:/salilab.org/modeller). MODBASE is updated regularly to reflect the growth in protein sequence and structure databases, and improvements in the software for calculating the models. MODBASE currently contains 3 094 524 reliable models for domains in 1 094 750 out of 1 817 889 unique protein sequences in the UniProt database (July 5, 2005); only models based on statistically significant alignments and models assessed to have the correct fold despite insignificant alignments are included. MODBASE also allows users to generate comparative models for proteins of interest with the automated modeling server MODWEB (http://salilab.org/modweb). Our other resources integrated with MODBASE include comprehensive databases of multiple protein structure alignments (DBAli, http://salilab.org/dbali), structurally defined ligand binding sites and structurally defined binary domain interfaces (PIBASE, http://salilab.org/pibase) as well as predictions of ligand binding sites, interactions between yeast proteins, and functional consequences of human nsSNPs (LS-SNP, http://salilab.org/LS-SNP).


Assuntos
Bases de Dados de Proteínas , Modelos Moleculares , Proteínas/química , Homologia Estrutural de Proteína , Sítios de Ligação , Humanos , Internet , Ligantes , Polimorfismo de Nucleotídeo Único , Estrutura Terciária de Proteína , Proteínas/genética , Proteínas/metabolismo , Software , Integração de Sistemas , Interface Usuário-Computador
19.
Curr Protoc Bioinformatics ; Chapter 5: Unit-5.6, 2006 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-18428767

RESUMO

Functional characterization of a protein sequence is one of the most frequent problems in biology. This task is usually facilitated by accurate three-dimensional (3-D) structure of the studied protein. In the absence of an experimentally determined structure, comparative or homology modeling can sometimes provide a useful 3-D model for a protein that is related to at least one known protein structure. Comparative modeling predicts the 3-D structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described.


Assuntos
Cristalografia/métodos , Modelos Químicos , Modelos Moleculares , Proteínas/química , Proteínas/ultraestrutura , Análise de Sequência de Proteína/métodos , Software , Algoritmos , Sequência de Aminoácidos , Simulação por Computador , Dados de Sequência Molecular , Conformação Proteica , Dobramento de Proteína
20.
Proc Natl Acad Sci U S A ; 102(20): 7151-6, 2005 May 17.
Artigo em Inglês | MEDLINE | ID: mdl-15883372

RESUMO

The function of an uncharacterized protein is usually inferred either from its homology to, or its interactions with, characterized proteins. Here, we use both sequence similarity and protein interactions to identify relationships between remotely related protein sequences. We rely on the fact that homologous sequences share similar interactions, and, therefore, the set of interacting partners of the partners of a given protein is enriched by its homologs. The approach was bench-marked by assigning the fold and functional family to test sequences of known structure. Specifically, we relied on 1,434 proteins with known folds, as defined in the Structural Classification of Proteins (SCOP) database, and with known interacting partners, as defined in the Database of Interacting Proteins (DIP). For this subset, the specificity of fold assignment was increased from 54% for position-specific iterative BLAST to 75% for our approach, with a concomitant increase in sensitivity for a few percentage points. Similarly, the specificity of family assignment at the e-value threshold of 10(-8) was increased from 70% to 87%. The proposed method would be a useful tool for large-scale automated discovery of remote relationships between protein sequences, given its unique reliance on sequence similarity and protein-protein interactions.


Assuntos
Evolução Molecular , Proteínas/genética , Proteínas/metabolismo , Proteômica/métodos , Homologia de Sequência de Aminoácidos , Biologia Computacional , Bases de Dados de Proteínas , Conformação Proteica , Dobramento de Proteína
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...