Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 114
Filtrar
1.
Proteins ; 72(1): 197-208, 2008 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-18214965

RESUMO

The FK506-binding proteins (FKBPs) are a unique group of chaperones found in a wide variety of organisms. They perform a number of cellular functions including protein folding, regulation of cytokines, transport of steroid receptor complexes, nucleic acid binding, histone assembly, and modulation of apoptosis. These functions are mediated by specific domains that adopt distinct tertiary conformations. Using the Threading/ASSEmbly/Refinement (TASSER) approach, tertiary structures were predicted for a total of 45 FKBPs in 23 species. These models were compared with previously characterized FKBP solution structures and the predicted structures were employed to identify groups of homologous proteins. The resulting classification may be utilized to infer functional roles of newly discovered FKBPs. The three-dimensional conformations revealed that this family may have undergone several modifications throughout evolution, including loss of N- and C-terminal regions, duplication of FKBP domains as well as insertions of entire functional motifs. Docking simulations suggest that additional sequence segments outside FKBP domains may modulate the binding affinity of FKBPs to immunosuppressive drugs. The docking models also indicate the presence of a helix-loop-helix (HLH) region within a subset of FKBPs, which may be responsible for the interaction between this group of proteins and nucleic acids.


Assuntos
Proteínas de Ligação a Tacrolimo/química , Proteínas de Ligação a Tacrolimo/classificação , Sequência de Aminoácidos , Animais , Sítios de Ligação , Bombyx , Simulação por Computador , Cristalografia por Raios X , Humanos , Dados de Sequência Molecular , Estrutura Terciária de Proteína , RNA/metabolismo , Homologia de Sequência de Aminoácidos , Proteínas de Ligação a Tacrolimo/metabolismo
2.
Curr Pharm Biotechnol ; 3(4): 329-47, 2002 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-12463416

RESUMO

Protein folding, the problem of how an amino acid sequence folds into a unique three-dimensional shape, has been a long-standing problem in biology. The success of genome-wide sequencing efforts has increased the interest in understanding the protein folding enigma, because realizing the value of the genomic sequences rests on the accuracy with which the encoded gene products are understood. Although a complete understanding of the kinetics and thermodynamics of protein folding has remained elusive, there has been considerable progress in techniques to predict protein structure from amino acid sequences. The prediction techniques fall into three general classes: comparative modeling, threading and ab initio folding. The current state of research in each of these three areas is reviewed here in detail. Efforts to apply each method to proteome-wide analysis are reviewed, and some of the key technical hurdles that remain are presented. Protein folding technologies, while not yet providing a full understanding of the protein folding process, have clearly progressed to the point of being useful in enabling structure-based annotation of genomic sequences.


Assuntos
Biologia Computacional/métodos , Dobramento de Proteína , Animais , Fenômenos Biofísicos , Biofísica , Biologia Computacional/tendências , Humanos
3.
J Struct Biol ; 134(2-3): 232-45, 2001.
Artigo em Inglês | MEDLINE | ID: mdl-11551182

RESUMO

In order to circumvent limitations of sequence based methods in the process of making functional predictions for proteins, we have developed a methodology that uses a sequence-to-structure-to-function paradigm. First, an approximate three-dimensional structure is predicted. Then, a three-dimensional descriptor of the functional site, termed a Fuzzy Functional Form, or FFF, is used to screen the structure for the presence of the functional site of interest (Fetrow et al., 1998; Fetrow and Skolnick, 1998). Previously, a disulfide oxidoreductase FFF was developed and applied to predicted structures obtained from a small structural database. Here, using a substantially larger structural database, we expand the analysis of the disulfide oxidoreductase FFF to the B. subtilis genome. To ascertain the performance of the FFF, its results are compared to those obtained using both the sequence alignment method BLAST and three local sequence motif databases: PRINTS, Prosite, and Blocks. The FFF method is then compared in detail to Blocks and it is shown that the FFF is more flexible and sensitive in finding a specific function in a set of unknown proteins. In addition, the estimated false positive rate of function prediction is significantly lower using the FFF structural motif, rather than the standard sequence motif methods. We also present a second FFF and describe a specific example of the results of its whole-genome application to D. melanogaster using a newer threading algorithm. Our results from all of these studies indicate that the addition of three-dimensional structural information adds significant value in the prediction of biochemical function of genomic sequences.


Assuntos
Proteínas/química , Proteínas/fisiologia , Algoritmos , Sequência de Aminoácidos , Animais , Bacillus subtilis/enzimologia , Bacillus subtilis/genética , Proteínas de Bactérias/química , Proteínas de Bactérias/genética , Proteínas de Bactérias/fisiologia , Drosophila melanogaster/enzimologia , Drosophila melanogaster/genética , Genoma , Genoma Bacteriano , Humanos , Proteínas de Insetos/genética , Proteínas de Insetos/fisiologia , Dados de Sequência Molecular , Proteína Dissulfeto Redutase (Glutationa)/química , Proteína Dissulfeto Redutase (Glutationa)/fisiologia , Proteínas Tirosina Fosfatases/química , Proteínas Tirosina Fosfatases/fisiologia , Proteínas/genética , Relação Estrutura-Atividade
4.
Biopolymers ; 59(5): 305-9, 2001 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-11514933

RESUMO

We introduce a new variant of the root mean square distance (RMSD) for comparing protein structures whose range of values is independent of protein size. This new dimensionless measure (relative RMSD, or RRMSD) is zero between identical structures and one between structures that are as globally dissimilar as an average pair of random polypeptides of respective sizes. The RRMSD probability distribution between random polypeptides converges to a universal curve as the chain length increases. The correlation coefficients between aligned random structures are computed as a function of polypeptide size showing two characteristic lengths of 4.7 and 37 residues. These lengths mark the separation between phases of different structural order between native protein fragments. The implications for threading are discussed.


Assuntos
Peptídeos/química , Proteínas/química , Modelos Estatísticos , Probabilidade
5.
Proc Natl Acad Sci U S A ; 98(18): 10125-30, 2001 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-11504922

RESUMO

The successful prediction of protein structure from amino acid sequence requires two features: an efficient conformational search algorithm and an energy function with a global minimum in the native state. As a step toward addressing both issues, a threading-based method of secondary and tertiary restraint prediction has been developed and applied to ab initio folding. Such restraints are derived by extracting consensus contacts and local secondary structure from at least weakly scoring structures that, in some cases, can lack any global similarity to the sequence of interest. Furthermore, to generate representative protein structures, a reduced lattice-based protein model is used with replica exchange Monte Carlo to explore conformational space. We report results on the application of this methodology, termed TOUCHSTONE, to 65 proteins whose lengths range from 39 to 146 residues. For 47 (40) proteins, a cluster centroid whose rms deviation from native is below 6.5 (5) A is found in one of the five lowest energy centroids. The number of correctly predicted proteins increases to 50 when atomic detail is added and a knowledge-based atomic potential is combined with clustered and nonclustered structures for candidate selection. The combination of the ratio of the relative number of contacts to the protein length and the number of clusters generated by the folding algorithm is a reliable indicator of the likelihood of successful fold prediction, thereby opening the way for genome-scale ab initio folding.


Assuntos
Simulação por Computador , Modelos Moleculares , Proteínas/química , Algoritmos , Método de Monte Carlo , Dobramento de Proteína , Estrutura Terciária de Proteína
6.
Proteins ; 44(3): 223-32, 2001 Aug 15.
Artigo em Inglês | MEDLINE | ID: mdl-11455595

RESUMO

A heavy atom distance-dependent knowledge-based pairwise potential has been developed. This statistical potential is first evaluated and optimized with the native structure z-scores from gapless threading. The potential is then used to recognize the native and near-native structures from both published decoy test sets, as well as decoys obtained from our group's protein structure prediction program. In the gapless threading test, there is an average z-score improvement of 4 units in the optimized atomic potential over the residue-based quasichemical potential. Examination of the z-scores for individual pairwise distance shells indicates that the specificity for the native protein structure is greatest at pairwise distances of 3.5-6.5 A, i.e., in the first solvation shell. On applying the current atomic potential to test sets obtained from the web, composed of native protein and decoy structures, the current generation of the potential performs better than residue-based potentials as well as the other published atomic potentials in the task of selecting native and near-native structures. This newly developed potential is also applied to structures of varying quality generated by our group's protein structure prediction program. The current atomic potential tends to pick lower RMSD structures than do residue-based contact potentials. In particular, this atomic pairwise interaction potential has better selectivity especially for near-native structures. As such, it can be used to select near-native folds generated by structure prediction algorithms as well as for protein structure refinement.


Assuntos
Biologia Computacional/métodos , Simulação por Computador , Proteínas/química , Bases de Dados Factuais , Conformação Proteica , Software
7.
Proteins ; 44(2): 133-49, 2001 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-11391776

RESUMO

An improved generalized comparative modeling method, GENECOMP, for the refinement of threading models is developed and validated on the Fischer database of 68 probe-template pairs, a standard benchmark used to evaluate threading approaches. The basic idea is to perform ab initio folding using a lattice protein model, SICHO, near the template provided by the new threading algorithm PROSPECTOR. PROSPECTOR also provides predicted contacts and secondary structure for the template-aligned regions, and possibly for the unaligned regions by garnering additional information from other top-scoring threaded structures. Since the lowest-energy structure generated by the simulations is not necessarily the best structure, we employed two structure-selection protocols: distance geometry and clustering. In general, clustering is found to generate somewhat better quality structures in 38 of 68 cases. When applied to the Fischer database, the protocol does no harm and in a significant number of cases improves upon the initial threading model, sometimes dramatically. The procedure is readily automated and can be implemented on a genomic scale.


Assuntos
Técnicas de Química Combinatória/métodos , Modelos Moleculares , Proteínas/química , Alinhamento de Sequência/métodos , Algoritmos , Biologia Computacional/métodos , Simulação por Computador , Bases de Dados Factuais , Modelos Químicos , Método de Monte Carlo , Dobramento de Proteína
8.
Bioinformatics ; 17(5): 468-78, 2001 May.
Artigo em Inglês | MEDLINE | ID: mdl-11331242

RESUMO

MOTIVATION: Information about a particular protein or protein family is usually distributed among multiple databases and often in more than one entry in each database. Retrieval and organization of this information can be a laborious task. This task is complicated even further by the existence of alternative terms for the same concept. RESULTS: The PDB, SWISS-PROT, ENZYME, and CATH databases have been imported into a combined relational database, BIOMOLQUEST: A powerful search engine has been built using this database as a back end. The search engine achieves significant improvements in query performance by automatically utilizing cross-references between the legacy databases. The results of the queries are presented in an organized, hierarchical way.


Assuntos
Bases de Dados Factuais , Proteínas/química , Proteínas/fisiologia , Indexação e Redação de Resumos , Algoritmos , Biologia Computacional , Sistemas de Gerenciamento de Base de Dados , Software
9.
Protein Sci ; 10(5): 1005-14, 2001 May.
Artigo em Inglês | MEDLINE | ID: mdl-11316881

RESUMO

A function annotation method using the sequence-to-structure-to-function paradigm is applied to the identification of all disulfide oxidoreductases in the Saccharomyces cerevisiae genome. The method identifies 27 sequences as potential disulfide oxidoreductases. All previously known thioredoxins, glutaredoxins, and disulfide isomerases are correctly identified. Three of the 27 predictions are probable false-positives. Three novel predictions, which subsequently have been experimentally validated, are presented. Two additional novel predictions suggest a disulfide oxidoreductase regulatory mechanism for two subunits (OST3 and OST6) of the yeast oligosaccharyltransferase complex. Based on homology, this prediction can be extended to a potential tumor suppressor gene, N33, in humans, whose biochemical function was not previously known. Attempts to obtain a folded, active N33 construct to test the prediction were unsuccessful. The results show that structure prediction coupled with biochemically relevant structural motifs is a powerful method for the function annotation of genome sequences and can provide more detailed, robust predictions than function prediction methods that rely on sequence comparison alone.


Assuntos
Biologia Computacional/métodos , Oxirredutases/química , Oxirredutases/metabolismo , Relação Quantitativa Estrutura-Atividade , Saccharomyces cerevisiae/enzimologia , Saccharomyces cerevisiae/genética , Algoritmos , Motivos de Aminoácidos , Animais , Sítios de Ligação , Caenorhabditis elegans/enzimologia , Caenorhabditis elegans/genética , Dicroísmo Circular , Bases de Dados como Assunto , Genoma Fúngico , Humanos , Espectroscopia de Ressonância Magnética , Modelos Moleculares , Oxirredutases/genética , Conformação Proteica , Subunidades Proteicas , Reprodutibilidade dos Testes , Homologia de Sequência de Aminoácidos
10.
Proteins ; 42(3): 319-31, 2001 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-11151004

RESUMO

PROSPECTOR (PROtein Structure Predictor Employing Combined Threading to Optimize Results) is a new threading approach that uses sequence profiles to generate an initial probe-template alignment and then uses this "partly thawed" alignment in the evaluation of pair interactions. Two types of sequence profiles are used: the close set, composed of sequences in which sequence identity lies between 35% and 90%; and the distant set, composed of sequences with a FASTA E-score less than 10. Thus, a total of four scoring functions are used in a hierarchical method: the close (distant) sequence profiles screen a structural database to provide an initial alignment of the probe sequence in each of the templates. The same database is then screened with a scoring function composed of sequence plus secondary structure plus pair interaction profiles. This combined hierarchical threading method is called PROSPECTOR1. For the original Fischer database, 59 of 68 pairs are correctly identified in the top position. Next, the set of the top 20 scoring sequences (four scoring functions times the top five structures) is used to construct a protein-specific pair potential based on consensus side-chain contacts occurring in 25% of the structures. In subsequent threading iterations, this protein-specific pair potential, when combined in a composite manner, is found to be more sensitive in identifying the correct pairs than when the original statistical potential is used, and it increases the number of recognized structures for the combined scoring functions, termed PROSPECTOR2, to a total of 61 Fischer pairs identified in the top position. Application to a second, smaller Fischer database of 27 probe-template pairs places 18 (17) structures in the top position for PROSPECTOR1 (PROSPECTOR2). Overall, these studies show that the use of pair interactions as assessed by the improved Z-score enhances the specificity of probe-template matches. Thus, when the hierarchy of scoring functions is combined, the ability to identify correct probe-template pairs is significantly enhanced. Finally, a web server has been established for use by the academic community (http://bioinformatics.danforthcenter.org/services/threading.html).


Assuntos
Biologia Computacional , Conformação Proteica , Software , Benchmarking , Bases de Dados Factuais , Internet
11.
Proteins ; Suppl 5: 149-56, 2001.
Artigo em Inglês | MEDLINE | ID: mdl-11835492

RESUMO

A combination of sequence comparison, threading, lattice, and off-lattice Monte Carlo (MC) simulations and clustering of MC trajectories was used to predict the structure of all (but one) targets of the CASP4 experiment on protein structure prediction. Although this method is automated and is operationally the same regardless of the level of uniqueness of the query proteins, here we focus on the more difficult targets at the border of the fold recognition and new fold categories. For a few targets (T0110 is probably the best example), the ab initio method produced more accurate models than models obtained by the fold recognition techniques. For the most difficult targets from the new fold categories, substantial fragments of structures have been correctly predicted. Possible improvements of the method are briefly discussed.


Assuntos
Conformação Proteica , Dobramento de Proteína , Análise por Conglomerados , Modelos Moleculares , Método de Monte Carlo , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Análise de Sequência de Proteína , Software
13.
Biophys J ; 79(4): 1787-99, 2000 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-11023886

RESUMO

The impact on protein evolution of the physical laws that govern folding remains obscure. Here, by analyzing in silico-evolved sequences subjected to evolutionary pressure for fast folding, it is shown that: First, a subset of residues in the thermodynamic folding nucleus is mainly responsible for modulating the protein folding rate. Second and most important, the protein topology itself is of paramount importance in determining the location of these residues in the structure. Further stabilization of the interactions in this nucleus leads to fast folding sequences. Third, these nucleation points restrict the sequence space available to the protein during evolution. Correlated mutations between positions around these hot spots arise in a statistically significant manner, and most involve contacting residues. When a similar analysis is carried out on real proteins, qualitatively similar results are obtained.


Assuntos
Evolução Molecular , Dobramento de Proteína , Proteínas/química , Proteínas/genética , Fenômenos Biofísicos , Biofísica , Bases de Dados Factuais , Cinética , Modelos Moleculares , Mutação , Alinhamento de Sequência , Termodinâmica
14.
Proteins ; 41(1): 86-97, 2000 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-10944396

RESUMO

A procedure for the reconstruction of all-atom protein structures from side-chain center-based low-resolution models is introduced and applied to a set of test proteins with high-resolution X-ray structures. The accuracy of the rebuilt all-atom models is measured by root mean square deviations to the corresponding X-ray structures and percentages of correct chi(1) and chi(2) side-chain dihedrals. The benefit of including C(alpha) positions in the low-resolution model is examined, and the effect of lattice-based models on the reconstruction accuracy is discussed. Programs and scripts implementing the reconstruction procedure are made available through the NIH research resource for Multiscale Modeling Tools in Structural Biology (http://mmtsb.scripps.edu).


Assuntos
Proteínas/química , Modelos Moleculares
15.
Nat Biotechnol ; 18(3): 283-7, 2000 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-10700142

RESUMO

Structural genomics projects aim to solve the experimental structures of all possible protein folds. Such projects entail a conceptual shift from traditional structural biology in which structural information is obtained on known proteins to one in which the structure of a protein is determined first and the function assigned only later. Whereas the goal of converting protein structure into function can be accomplished by traditional sequence motif-based approaches, recent studies have shown that assignment of a protein's biochemical function can also be achieved by scanning its structure for a match to the geometry and chemical identity of a known active site. Importantly, this approach can use low-resolution structures provided by contemporary structure prediction methods. When applied to genomes, structural information (either experimental or predicted) is likely to play an important role in high-throughput function assignment.


Assuntos
Genoma , Biologia Molecular/métodos , Dobramento de Proteína , Animais , Simulação por Computador , Bases de Dados Factuais , Evolução Molecular , Humanos , Internet , Relação Estrutura-Atividade
16.
Proteins ; 38(1): 3-16, 2000 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-10651034

RESUMO

A method is presented for the derivation of knowledge-based pair potentials that corrects for the various compositions of different proteins. The resulting statistical pair potential is more specific than that derived from previous approaches as assessed by gapless threading results. Additionally, a methodology is presented that interpolates between statistical potentials when no homologous examples to the protein of interest are in the structural database used to derive the potential, to a Go-like potential (in which native interactions are favorable and all nonnative interactions are not) when homologous proteins are present. For cases in which no protein exceeds 30% sequence identity, pairs of weakly homologous interacting fragments are employed to enhance the specificity of the potential. In gapless threading, the mean z score increases from -10.4 for the best statistical pair potential to -12.8 when the local sequence similarity, fragment-based pair potentials are used. Examination of the ab initio structure prediction of four representative globular proteins consistently reveals a qualitative improvement in the yield of structures in the 4 to 6 A rmsd from native range when the fragment-based pair potential is used relative to that when the quasichemical pair potential is employed. This suggests that such protein-specific potentials provide a significant advantage relative to generic quasichemical potentials.


Assuntos
Proteínas/química , Aminoácidos/química , Simulação por Computador , Bases de Dados Factuais , Método de Monte Carlo , Fragmentos de Peptídeos/química , Conformação Proteica , Dobramento de Proteína , Estrutura Terciária de Proteína , Termodinâmica
17.
Proteins ; 38(1): 17-28, 2000 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-10651035

RESUMO

Reduced lattice models of the three de novo designed helical proteins alpha2, alpha2C, and alpha2D were studied. Low temperature stable folds were obtained for all three proteins. In all cases, the lowest energy folds were four-helix bundles. The folding pathway is qualitatively the same for all proteins studied. The energies of various topologies are similar, especially for the alpha2 polypeptide. The simulated crossover from molten globule to native-like behavior is very similar to that seen in experimental studies. Simulations on a reduced protein model reproduce most of the experimental properties of the alpha2, alpha2C, and alpha2D proteins. Stable four-helix bundle structures were obtained, with increasing native-like behavior on-going from alpha2 to alpha2D that mimics experiment.


Assuntos
Proteínas/química , Sequência de Aminoácidos , Simulação por Computador , Desenho de Fármacos , Dados de Sequência Molecular , Dobramento de Proteína , Estrutura Secundária de Proteína , Termodinâmica
18.
Trends Biotechnol ; 18(1): 34-9, 2000 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-10631780

RESUMO

The genome-sequencing projects are providing a detailed 'parts list' of life. A key to comprehending this list is understanding the function of each gene and each protein at various levels. Sequence-based methods for function prediction are inadequate because of the multifunctional nature of proteins. However, just knowing the structure of the protein is also insufficient for prediction of multiple functional sites. Structural descriptors for protein functional sites are crucial for unlocking the secrets in both the sequence and structural-genomics projects.


Assuntos
Genes , Proteínas/química , Proteínas/metabolismo , Proteínas/genética , Relação Estrutura-Atividade
20.
Biophys J ; 77(6): 2942-52, 1999 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-10585918

RESUMO

Small peptides that might have some features of globular proteins can provide important insights into the protein folding problem. Two simulation methods, Monte Carlo Dynamics (MCD), based on the Metropolis sampling scheme, and Entropy Sampling Monte Carlo (ESMC), were applied in a study of a high-resolution lattice model of the C-terminal fragment of the B1 domain of protein G. The results provide a detailed description of folding dynamics and thermodynamics and agree with recent experimental findings (. Nature. 390:196-197). In particular, it was found that the folding is cooperative and has features of an all-or-none transition. Hairpin assembly is usually initiated by turn formation; however, hydrophobic collapse, followed by the system rearrangement, was also observed. The denatured state exhibits a substantial amount of fluctuating helical conformations, despite the strong beta-type secondary structure propensities encoded in the sequence.


Assuntos
Proteínas/química , Sequência de Aminoácidos , Animais , Fenômenos Biofísicos , Biofísica , Modelos Moleculares , Dados de Sequência Molecular , Método de Monte Carlo , Proteínas do Tecido Nervoso/química , Conformação Proteica , Dobramento de Proteína , Estrutura Secundária de Proteína , Termodinâmica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA