Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 32
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Curr Pharm Biotechnol ; 3(4): 329-47, 2002 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-12463416

RESUMO

Protein folding, the problem of how an amino acid sequence folds into a unique three-dimensional shape, has been a long-standing problem in biology. The success of genome-wide sequencing efforts has increased the interest in understanding the protein folding enigma, because realizing the value of the genomic sequences rests on the accuracy with which the encoded gene products are understood. Although a complete understanding of the kinetics and thermodynamics of protein folding has remained elusive, there has been considerable progress in techniques to predict protein structure from amino acid sequences. The prediction techniques fall into three general classes: comparative modeling, threading and ab initio folding. The current state of research in each of these three areas is reviewed here in detail. Efforts to apply each method to proteome-wide analysis are reviewed, and some of the key technical hurdles that remain are presented. Protein folding technologies, while not yet providing a full understanding of the protein folding process, have clearly progressed to the point of being useful in enabling structure-based annotation of genomic sequences.


Assuntos
Biologia Computacional/métodos , Dobramento de Proteína , Animais , Fenômenos Biofísicos , Biofísica , Biologia Computacional/tendências , Humanos
2.
Biochemistry ; 40(51): 15846-55, 2001 Dec 25.
Artigo em Inglês | MEDLINE | ID: mdl-11747462

RESUMO

We report the application of our newly developed dielectric resonator-based flow and stopped-flow kinetic EPR systematically to probe protein folding in yeast iso-1-cytochrome c at cysteine-directed spin-labeled locations. The locations studied have not been previously directly probed by other techniques, and we observe them on a time scale stretching from 50 micros to seconds. On the basis of crystal structure and homology information, the following mutation-tolerant, externally located cysteine labeling sites were chosen (in helices, T8C, E66C, and N92C; in loops, E21C, V28C, H39C, D50C, and K79C), and labeling at these sites was not destabilizing. Dilution of denaturant was used to induce folding and thereby to cause a change in the spin label EPR signal as folding altered the motion of the spin label. Under folding conditions, including the presence of imidazole to eliminate kinetic trapping due to heme misligation, a phase of folding on the 20-30 ms time scale was found. This phase occurred not only at the T8C and N92C labeling sites in the N- and C-terminal helices, where such a phase has been associated with folding in these helices, but overall at labeling sites throughout the protein. In the absence of imidazole the 20-30 ms phase disappeared, and another phase having the time scale of 1 s appeared throughout the protein. There was evidence under all conditions for a burst phase on a scale of less than several milliseconds which occurred at labeling positions V28C, H39C, D50C, E66C, and K79C in the middle of the protein sequence. At spin-labeled D50C rapid-mix flow EPR indicated a very short approximately 50 micros phase possibly associated with the prefolding or compaction of the loop to which D50 belongs. Spin labels have been criticized as perturbing the phenomena which they measure, but our spin labeling strategy has reported common kinetic themes and not perturbed, disconnected kinetic events.


Assuntos
Cisteína/genética , Grupo dos Citocromos c/química , Grupo dos Citocromos c/genética , Citocromos c , Mutagênese Sítio-Dirigida , Dobramento de Proteína , Proteínas de Saccharomyces cerevisiae/química , Proteínas de Saccharomyces cerevisiae/genética , Marcadores de Spin , Dicroísmo Circular , Espectroscopia de Ressonância de Spin Eletrônica/métodos , Isoenzimas/química , Isoenzimas/genética , Cinética , Espectrometria de Massas , Proteínas Recombinantes/química , Proteínas Recombinantes/genética , Serina/genética
3.
J Struct Biol ; 134(2-3): 232-45, 2001.
Artigo em Inglês | MEDLINE | ID: mdl-11551182

RESUMO

In order to circumvent limitations of sequence based methods in the process of making functional predictions for proteins, we have developed a methodology that uses a sequence-to-structure-to-function paradigm. First, an approximate three-dimensional structure is predicted. Then, a three-dimensional descriptor of the functional site, termed a Fuzzy Functional Form, or FFF, is used to screen the structure for the presence of the functional site of interest (Fetrow et al., 1998; Fetrow and Skolnick, 1998). Previously, a disulfide oxidoreductase FFF was developed and applied to predicted structures obtained from a small structural database. Here, using a substantially larger structural database, we expand the analysis of the disulfide oxidoreductase FFF to the B. subtilis genome. To ascertain the performance of the FFF, its results are compared to those obtained using both the sequence alignment method BLAST and three local sequence motif databases: PRINTS, Prosite, and Blocks. The FFF method is then compared in detail to Blocks and it is shown that the FFF is more flexible and sensitive in finding a specific function in a set of unknown proteins. In addition, the estimated false positive rate of function prediction is significantly lower using the FFF structural motif, rather than the standard sequence motif methods. We also present a second FFF and describe a specific example of the results of its whole-genome application to D. melanogaster using a newer threading algorithm. Our results from all of these studies indicate that the addition of three-dimensional structural information adds significant value in the prediction of biochemical function of genomic sequences.


Assuntos
Proteínas/química , Proteínas/fisiologia , Algoritmos , Sequência de Aminoácidos , Animais , Bacillus subtilis/enzimologia , Bacillus subtilis/genética , Proteínas de Bactérias/química , Proteínas de Bactérias/genética , Proteínas de Bactérias/fisiologia , Drosophila melanogaster/enzimologia , Drosophila melanogaster/genética , Genoma , Genoma Bacteriano , Humanos , Proteínas de Insetos/genética , Proteínas de Insetos/fisiologia , Dados de Sequência Molecular , Proteína Dissulfeto Redutase (Glutationa)/química , Proteína Dissulfeto Redutase (Glutationa)/fisiologia , Proteínas Tirosina Fosfatases/química , Proteínas Tirosina Fosfatases/fisiologia , Proteínas/genética , Relação Estrutura-Atividade
4.
Curr Opin Drug Discov Devel ; 4(3): 291-5, 2001 May.
Artigo em Inglês | MEDLINE | ID: mdl-11560060

RESUMO

Existing functional annotation transfer is fraught with inaccuracies that may hinder forward interpretation and mining of genomic data. Hand-curation of the annotation placed into databases is not practical. In lieu of experimental evidence, computational biological approaches offer high-throughput tools to predict function accurately; however, these methods are still notably deficient in defining and describing the complexity of protein function. Enriching genomic sequences obtained from sequencing efforts and expression array methods with protein function information and classification will be an efficient first step for incorporating genomic data into drug discovery programs.


Assuntos
Biologia Computacional , Biblioteca Genômica , Proteínas/química , Proteínas/fisiologia , Bases de Dados Factuais , Modelos Moleculares , Conformação Proteica , Proteínas/genética , Relação Estrutura-Atividade
5.
Protein Sci ; 10(5): 1005-14, 2001 May.
Artigo em Inglês | MEDLINE | ID: mdl-11316881

RESUMO

A function annotation method using the sequence-to-structure-to-function paradigm is applied to the identification of all disulfide oxidoreductases in the Saccharomyces cerevisiae genome. The method identifies 27 sequences as potential disulfide oxidoreductases. All previously known thioredoxins, glutaredoxins, and disulfide isomerases are correctly identified. Three of the 27 predictions are probable false-positives. Three novel predictions, which subsequently have been experimentally validated, are presented. Two additional novel predictions suggest a disulfide oxidoreductase regulatory mechanism for two subunits (OST3 and OST6) of the yeast oligosaccharyltransferase complex. Based on homology, this prediction can be extended to a potential tumor suppressor gene, N33, in humans, whose biochemical function was not previously known. Attempts to obtain a folded, active N33 construct to test the prediction were unsuccessful. The results show that structure prediction coupled with biochemically relevant structural motifs is a powerful method for the function annotation of genome sequences and can provide more detailed, robust predictions than function prediction methods that rely on sequence comparison alone.


Assuntos
Biologia Computacional/métodos , Oxirredutases/química , Oxirredutases/metabolismo , Relação Quantitativa Estrutura-Atividade , Saccharomyces cerevisiae/enzimologia , Saccharomyces cerevisiae/genética , Algoritmos , Motivos de Aminoácidos , Animais , Sítios de Ligação , Caenorhabditis elegans/enzimologia , Caenorhabditis elegans/genética , Dicroísmo Circular , Bases de Dados como Assunto , Genoma Fúngico , Humanos , Espectroscopia de Ressonância Magnética , Modelos Moleculares , Oxirredutases/genética , Conformação Proteica , Subunidades Proteicas , Reprodutibilidade dos Testes , Homologia de Sequência de Aminoácidos
6.
Nat Biotechnol ; 18(3): 283-7, 2000 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-10700142

RESUMO

Structural genomics projects aim to solve the experimental structures of all possible protein folds. Such projects entail a conceptual shift from traditional structural biology in which structural information is obtained on known proteins to one in which the structure of a protein is determined first and the function assigned only later. Whereas the goal of converting protein structure into function can be accomplished by traditional sequence motif-based approaches, recent studies have shown that assignment of a protein's biochemical function can also be achieved by scanning its structure for a match to the geometry and chemical identity of a known active site. Importantly, this approach can use low-resolution structures provided by contemporary structure prediction methods. When applied to genomes, structural information (either experimental or predicted) is likely to play an important role in high-throughput function assignment.


Assuntos
Genoma , Biologia Molecular/métodos , Dobramento de Proteína , Animais , Simulação por Computador , Bases de Dados Factuais , Evolução Molecular , Humanos , Internet , Relação Estrutura-Atividade
7.
Trends Biotechnol ; 18(1): 34-9, 2000 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-10631780

RESUMO

The genome-sequencing projects are providing a detailed 'parts list' of life. A key to comprehending this list is understanding the function of each gene and each protein at various levels. Sequence-based methods for function prediction are inadequate because of the multifunctional nature of proteins. However, just knowing the structure of the protein is also insufficient for prediction of multiple functional sites. Structural descriptors for protein functional sites are crucial for unlocking the secrets in both the sequence and structural-genomics projects.


Assuntos
Genes , Proteínas/química , Proteínas/metabolismo , Proteínas/genética , Relação Estrutura-Atividade
8.
FASEB J ; 13(13): 1866-74, 1999 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-10506591

RESUMO

In previous work, 3-dimensional descriptors of protein function ('fuzzy functional forms') were used to identify disulfide oxidoreductase active sites in high-resolution protein structures. During this analysis, a potential disulfide oxidoreductase active site in the serine/threonine protein phosphatase-1 (PP1) crystal structure was discovered. In PP1, the potential redox active site is located in close proximity to the phosphatase active site. This result is interesting in view of literature suggesting that serine/threonine phosphatases could be subject to redox control mechanisms within the cell; however, the actual source of this control is unknown. Additional analysis presented here shows that the putative oxidoreductase active site is highly conserved in the serine/threonine phosphatase-1 subfamily, but not in the serine/threonine phosphatase-2A or -2B subfamilies. These results demonstrate the significant advantages of using structure-based motifs for protein functional site identification. First, a putative disulfide oxidoreductase active site has been identified in serine-threonine phosphatases using a descriptor built from the glutaredoxin/thioredoxin family, proteins that have no apparent evolutionary relationship whatsoever to the PP1 proteins. Second, the proximity of the putative disulfide oxidoreductase active site to the phosphatase active site provides evidence toward a regulatory control mechanism. No sequence-based method could provide either piece of information.


Assuntos
Domínio Catalítico , Fosfoproteínas Fosfatases/química , Fosfoproteínas Fosfatases/classificação , Proteína Dissulfeto Redutase (Glutationa)/química , Sequência de Aminoácidos , Regulação Enzimológica da Expressão Gênica , Modelos Moleculares , Dados de Sequência Molecular , Oxirredução , Proteína Fosfatase 1 , Proteína Fosfatase 2 , Alinhamento de Sequência , Análise de Sequência
9.
J Mol Biol ; 293(2): 351-66, 1999 Oct 22.
Artigo em Inglês | MEDLINE | ID: mdl-10529350

RESUMO

The majority of cell surface receptors involved in antigen recognition by T cells and in the orchestration of the subsequent cell signalling events are glycoproteins. The length of a typical N-linked sugar is comparable with that of an immunoglobulin domain (30 A). Thus, by virtue of their size alone, oligosaccharides may be expected to play a significant role in the functions and properties of the cell surface proteins to which they are attached. A databank of oligosaccharide structures has been constructed from NMR and crystallographic data to aid in the interpretation of crystal structures of glycoproteins. As unambiguous electron density can usually only be assigned to the glycan cores, the remainder of the sugar is then modelled into the crystal lattice by superimposing the appropriate oligosaccharide from the database. This approach provides insights into the roles that glycosylation might play in cell surface receptors, by providing models that delineate potential close packing interactions on the cell surface. It has been proposed that the specific recognition of antigen by T cells results in the formation of an immunological synapse between the T cell and the antigen-presenting cell. The cell adhesion glycoproteins, such as CD2 and CD48, help to form a cell junction, providing a molecular spacer between opposing cells. The oligosaccharides located on the membrane proximal domains of CD2 and CD48 provide a scaffold to orient the binding faces, which leads to increased affinity. In the next step, recruitment of the peptide major histocompatibility complex (pMHC) by the T-cell receptors (TCRs) requires mobility on the membrane surface. The TCR sugars are located such that they could prevent non-specific aggregation. Importantly, the sugars limit the possible geometry and spacing of TCR/MHC clusters which precede cell signalling. We postulate that, in the final stage, the sugars could play a general role in controlling the assembly and stabilisation of the complexes in the synapse and in protecting them from proteolysis during prolonged T-cell engagement.


Assuntos
Antígenos CD/química , Antígenos CD/metabolismo , Receptores de Antígenos de Linfócitos T/química , Receptores de Antígenos de Linfócitos T/metabolismo , Linfócitos T/imunologia , Linfócitos T/metabolismo , Animais , Células Apresentadoras de Antígenos/citologia , Células Apresentadoras de Antígenos/imunologia , Células Apresentadoras de Antígenos/metabolismo , Antígenos CD/imunologia , Configuração de Carboidratos , Sequência de Carboidratos , Glicosilação , Antígenos de Histocompatibilidade Classe I/química , Antígenos de Histocompatibilidade Classe I/imunologia , Antígenos de Histocompatibilidade Classe I/metabolismo , Humanos , Dados de Sequência Molecular , Oligossacarídeos/química , Oligossacarídeos/metabolismo , Conformação Proteica , Receptores de Antígenos de Linfócitos T/imunologia , Linfócitos T/citologia
10.
Pac Symp Biocomput ; : 278-89, 1999.
Artigo em Inglês | MEDLINE | ID: mdl-10380204

RESUMO

An understanding of the regularities in the side chain conformations of proteins and how these are related to local backbone structures is important for protein modeling and design. Previous work using regular secondary structures and regular divisions of the backbone dihedral angle data has shown that these rotamers are sensitive to the protein's local backbone conformation. In this preliminary study, we demonstrate a method for combining a more general backbone structure model with an objective clustering algorithm to investigate the effects of backbone structures on side chain rotamer classes and distributions. For the local structure classification, we use the Structural Building Blocks (SBB) categories, which represent all types of secondary structure, including regular structures, capping structures, and loops. For classification of side chain data, we use Minimum Message Length (MML) clustering from information theory. We show an example of how MML clustering on data classified by backbone SBBs can reveal different distributions of rotamer classes among the SBBs. Using these preliminary results, some of the characteristics of a rotamer library created using MML clustering on SBB dependent rotamer data are demonstrated.


Assuntos
Bases de Dados Factuais , Teoria da Informação , Biblioteca de Peptídeos , Proteínas/química , Biologia Computacional/métodos , Internet , Conformação Proteica , Software
11.
Protein Sci ; 8(5): 1104-15, 1999 May.
Artigo em Inglês | MEDLINE | ID: mdl-10338021

RESUMO

A database of functional sites for proteins with known structures, SITE, is constructed and used in conjunction with a simple pattern matching program SiteMatch to evaluate possible function conservation in a recently constructed database of fold predictions for Escherichia coli proteins (Rychlewski L et al., 1999, Protein Sci 8:614-624). In this and other prediction databases, fold predictions are based on algorithms that can recognize weak sequence similarities and putatively assign new proteins into already characterized protein families. It is not clear whether such sequence similarities arise from distant homologies or general similarity of physicochemical features along the sequence. Leaving aside the important question of nature of relations within fold superfamilies, it is possible to assess possible function conservation by looking at the pattern of conservation of crucial functional residues. SITE consists of a multilevel function description based on structure annotations and structure analyses. In particular, active site residues, ligand binding residues, and patterns of hydrophobic residues on the protein surface are used to describe different functional features. SiteMatch, a simple pattern matching program, is designed to check the conservation of residues involved in protein activity in alignments generated by any alignment method. Here, this procedure is used to study conservation of functional features in alignments between protein sequences from the E. coli genome and their optimal structural templates. The optimal templates were identified and alignments taken from the database of genomic structural predictions was described in a previous publication (Rychlewski L et al., 1999, Protein Sci 8:614-624). An automated assessment of function conservation is used to analyze the relation between fold and function similarity for a large number of fold predictions. For instance, it is shown that identifying low significance predictions with a high level of functional residue conservations can be used to extend the prediction sensitivity for fold prediction methods. Over 100 new fold/function predictions in this class were obtained in the E. coli genome. At the same time, about 30% of our previous fold predictions are not confirmed as function predictions, further highlighting the problem of function divergence in fold superfamilies.


Assuntos
Algoritmos , Dobramento de Proteína , Sequência de Aminoácidos , Simulação por Computador , Bases de Dados Factuais , Modelos Moleculares , Dados de Sequência Molecular , Relação Estrutura-Atividade
12.
Biochemistry ; 38(14): 4480-92, 1999 Apr 06.
Artigo em Inglês | MEDLINE | ID: mdl-10194370

RESUMO

A protocol for complete isotopic labeling of iso-1-cytochrome c from the eukaryote Saccharomyces cerevisiae is reported. Assignments are reported for the vast majority of the 15N amide resonances in both oxidized and reduced states. 15N heteronuclear relaxation experiments were collected to study the picosecond-nanosecond backbone dynamics of this protein. Relaxation rates were computed and fit to spectral density functions by a model-free analysis. Backbone amides in the overlapping loop B/C region are the most flexible on the picosecond-nanosecond time scale in both forms of the protein. The results show that, on average, the protein backbone is slightly more dynamic in the oxidized than the reduced state, though not significantly so. Exchange terms, which suggest significant motion on a time scale at least an order of magnitude slower than the overall correlation time of 5.2 ns, were required for only two residues in the reduced state and 27 residues in the oxidized state. When analyzed on a per-residue basis, the lower order parameters found in the oxidized state were scattered throughout the protein, with a few continuous segments found in loop C and the C-terminal helix, suggesting greater flexibility of these regions in the oxidized state. The results provide dynamic interpretations for previously presented structural and functional data, including redox-dependent changes that occur in the protein. The way is now paved for extensive dynamic analysis of variant cytochromes c.


Assuntos
Grupo dos Citocromos c/química , Citocromos c , Ressonância Magnética Nuclear Biomolecular , Proteínas de Saccharomyces cerevisiae , Cristalografia por Raios X , Cisteína/química , Cisteína/genética , Grupo dos Citocromos c/biossíntese , Grupo dos Citocromos c/genética , Fermentação , Isoenzimas/biossíntese , Isoenzimas/química , Isoenzimas/genética , Modelos Químicos , Isótopos de Nitrogênio , Oxirredução , Proteínas Recombinantes/biossíntese , Proteínas Recombinantes/química , Proteínas Recombinantes/isolamento & purificação , Saccharomyces cerevisiae/enzimologia , Saccharomyces cerevisiae/crescimento & desenvolvimento , Termodinâmica , Treonina/química , Treonina/genética
13.
Biochemistry ; 38(14): 4493-503, 1999 Apr 06.
Artigo em Inglês | MEDLINE | ID: mdl-10194371

RESUMO

Heteronuclear NMR spectroscopy was used to measure the hydrogen-deuterium exchange rates of backbone amide hydrogens in both oxidized and reduced [U-15N]iso-1-cytochrome c from the yeast Saccharomyces cerevisiae. The exchange data confirm previously reported data [Marmorino et al. (1993) Protein Sci. 2, 1966-1974], resolve several inconsistencies, and provide more thorough coverage of exchange rates throughout the cytochrome c protein in both oxidation states. Combining the data previously collected on unlabeled C102T with the current data collected on [U-15N]C102T, exchange rates for 53 protons in the oxidized state and 52 protons in the reduced state can now be reported. Most significantly, hydrogen exchange measurements on [U-15N]iso-1-cytochrome c allowed the observation of exchange behavior of the secondary structures, such as large loops, that are not extensively hydrogen-bonded. For the helices, the most slowly exchanging protons are found in the middle of the helix, with more rapidly exchanging protons at the helix ends. The observation for the Omega-loops in cytochrome c is just the opposite. In the loops, the ends contain the most slowly exchanging protons and the loop middles allow more rapid exchange. This is found to be true in cytochrome c loops, even though the loop ends are not attached to any regular secondary structures. Some of the exchange data are strikingly inconsistent with data collected on the C102S variant at a different pH, which suggests pH-dependent dynamic differences in the protein structure. This new hydrogen exchange data for loop residues could have implications for the substructure model of eukaryotic cytochrome c folding. Isotopic labeling of variant forms of cytochrome c can now be used to answer many questions about the structure and folding of this model protein.


Assuntos
Grupo dos Citocromos c/química , Citocromos c , Ressonância Magnética Nuclear Biomolecular , Prótons , Proteínas de Saccharomyces cerevisiae , Cisteína/química , Estabilidade Enzimática , Ferro/química , Cinética , Metionina/química , Modelos Moleculares , Isótopos de Nitrogênio , Oxirredução , Desnaturação Proteica , Estrutura Secundária de Proteína , Saccharomyces cerevisiae/enzimologia , Soluções , Treonina/química
14.
J Mol Biol ; 282(4): 703-11, 1998 Oct 02.
Artigo em Inglês | MEDLINE | ID: mdl-9743619

RESUMO

The application of an automated method for the screening of protein activity based on the sequence-to-structure-to-function paradigm is presented for the complete Escherichia coli genome. First, the structure of the protein is identified from its sequence using a threading algorithm, which aligns the sequences to the best matching structure in a structural database and extends sequence analysis well beyond the limits of local sequence identity. Then, the active site is identified in the resulting sequence-to-structure alignment using a "fuzzy functional form" (FFF), a three-dimensional descriptor of the active site of a protein. Here, this sequence-to-structure-to-function concept is applied to analysis of the complete E. coli genome, i.e. all E. coli open reading frames (ORFs) are screened for the thiol-disulfide oxidoreductase activity of the glutaredoxin/thioredoxin protein family. We show that the method can identify the active sites in ten sequences that are known to or proposed to exhibit this activity. Furthermore, oxidoreductase activity is predicted in two other sequences that have not been identified previously. This method distinguishes protein pairs with similar active sites from proteins pairs that are just topological cousins, i.e. those having similar global folds, but not necessarily similar active sites. Thus, this method provides a novel approach for extraction of active site and functional information based on three-dimensional structures, rather than simple sequence analysis. Prediction of protein activity is fully automated and easily extendible to new functions. Finally, it is demonstrated here that the method can be applied to complete genome database analysis.


Assuntos
Escherichia coli/genética , Genoma Bacteriano , Oxirredutases , Proteína Dissulfeto Redutase (Glutationa)/metabolismo , Proteínas/química , Tiorredoxinas/química , Algoritmos , Automação , Sítios de Ligação , Bases de Dados Factuais , Escherichia coli/química , Escherichia coli/enzimologia , Glutarredoxinas , Fases de Leitura Aberta/genética , Conformação Proteica , Proteína Dissulfeto Redutase (Glutationa)/química , Proteína Dissulfeto Redutase (Glutationa)/genética , Dobramento de Proteína , Proteínas/genética , Proteínas/metabolismo , Alinhamento de Sequência , Software , Relação Estrutura-Atividade , Tiorredoxinas/genética , Tiorredoxinas/metabolismo
15.
Pac Symp Biocomput ; : 485-96, 1998.
Artigo em Inglês | MEDLINE | ID: mdl-9697206

RESUMO

We introduce a hypothesis that present day proteins evolved from "proto-proteins," small 15-20 residue peptides with some elements of secondary structure and primitive function. Increasingly stable and functional proteins arose by adding structural elements to produce the small domains or protein modules that we would recognize today. From this point of view, the surprising similarities between small structural fragments of large proteins, that are usually taken as examples of convergent, function-driven evolution, are interpreted in exactly the opposite way--as traces of common evolutionary origin. As an example, a hypothetical evolutionary tree for two families of RNA binding proteins, the OB fold, a family of all beta proteins, and RBD fold, an alpha/beta protein family is presented. We argue that both protein families could have evolved from the same RNA-binding proto-protein, which had a form of beta-loop-beta RNA binding motif.


Assuntos
Evolução Molecular , Estrutura Secundária de Proteína , Proteínas de Ligação a RNA/química , Proteínas de Ligação a RNA/genética , Sequência de Aminoácidos , Bactérias , Sítios de Ligação , Gráficos por Computador , Simulação por Computador , Proteínas de Ligação a DNA/química , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Humanos , Modelos Genéticos , Modelos Moleculares , Dados de Sequência Molecular , Dobramento de Proteína , Proteínas de Ligação a RNA/metabolismo , Alinhamento de Sequência , Homologia de Sequência de Aminoácidos
16.
J Mol Biol ; 281(5): 949-68, 1998 Sep 04.
Artigo em Inglês | MEDLINE | ID: mdl-9719646

RESUMO

The practical exploitation of the vast numbers of sequences in the genome sequence databases is crucially dependent on the ability to identify the function of each sequence. Unfortunately, current methods, including global sequence alignment and local sequence motif identification, are limited by the extent of sequence similarity between sequences of unknown and known function; these methods increasingly fail as the sequence identity diverges into and beyond the twilight zone of sequence identity. To address this problem, a novel method for identification of protein function based directly on the sequence-to-structure-to-function paradigm is described. Descriptors of protein active sites, termed "fuzzy functional forms" or FFFs, are created based on the geometry and conformation of the active site. By way of illustration, the active sites responsible for the disulfide oxidoreductase activity of the glutaredoxin/thioredoxin family and the RNA hydrolytic activity of the T1 ribonuclease family are presented. First, the FFFs are shown to correctly identify their corresponding active sites in a library of exact protein models produced by crystallography or NMR spectroscopy, most of which lack the specified activity. Next, these FFFs are used to screen for active sites in low-to-moderate resolution models produced by ab initio folding or threading prediction algorithms. Again, the FFFs can specifically identify the functional sites of these proteins from their predicted structures. The results demonstrate that low-to-moderate resolution models as produced by state-of-the-art tertiary structure prediction algorithms are sufficient to identify protein active sites. Prediction of a novel function for the gamma subunit of a yeast glycosyl transferase and prediction of the function of two hypothetical yeast proteins whose models were produced via threading are presented. This work suggests a means for the large-scale functional screening of genomic sequence databases based on the prediction of structure from sequence, then on the identification of functional active sites in the predicted structure.


Assuntos
Oxirredutases , Proteínas/química , Ribonucleases/química , Tiorredoxinas/química , Algoritmos , Sequência de Aminoácidos , Sítios de Ligação/fisiologia , Bases de Dados como Assunto , Proteínas Fúngicas/química , Proteínas Fúngicas/fisiologia , Glutarredoxinas , Modelos Moleculares , Conformação Molecular , Dados de Sequência Molecular , Proteína Dissulfeto Redutase (Glutationa)/química , Dobramento de Proteína , Estrutura Terciária de Proteína , Proteínas/fisiologia , Ribonucleases/fisiologia , Alinhamento de Sequência , Relação Estrutura-Atividade
17.
Protein Sci ; 7(4): 994-1005, 1998 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-9568906

RESUMO

In yeast iso-1-cytochrome c, the side chain of histidine 26 (His26) attaches omega loop A to the main body of the protein by forming a hydrogen bond to the backbone atom carbonyl of glutamic acid 44. The His26 side chain also forms a stabilizing intra-loop interaction through a hydrogen bond to the backbone amide of asparagine 31. To investigate the importance of loop-protein attachment and intra-loop interactions to the structure and function of this protein, a series of site-directed and random-directed mutations were produced at His26. Yeast strains expressing these variant proteins were analyzed for their ability to grow on non-fermentable carbon sources and for their intracellular production of cytochrome c. While the data show that mutations at His26 lead to slightly decreased intracellular amounts of cytochrome c, the level of cytochrome c function is decreased more. The data suggest that cytochrome c reductase binding is affected more than cytochrome c oxidase or lactate dehydrogenase binding. We propose that mutations at this residue increase loop mobility, which, in turn, decreases the protein's ability to bind redox partners.


Assuntos
Grupo dos Citocromos c/química , Citocromos c , Histidina/genética , Proteínas de Saccharomyces cerevisiae , Saccharomyces cerevisiae/química , Sequência de Aminoácidos , Divisão Celular/genética , Complexo IV da Cadeia de Transporte de Elétrons/metabolismo , Proteínas Fúngicas/química , Ligação de Hidrogênio , L-Lactato Desidrogenase/metabolismo , Modelos Moleculares , Dados de Sequência Molecular , Mutagênese Sítio-Dirigida/genética , NADH Desidrogenase/metabolismo , Ligação Proteica/fisiologia , Dobramento de Proteína , Estrutura Secundária de Proteína , Espectrofotometria , Relação Estrutura-Atividade
18.
Biochemistry ; 37(8): 2477-87, 1998 Feb 24.
Artigo em Inglês | MEDLINE | ID: mdl-9485396

RESUMO

Residues 75-78 form a tight turn within Omega-loop D in Saccharomyces cerevisiae iso-1-cytochrome c. Directed, random mutagenesis of invariant residues proline 76 and glycine 77 in this turn were analyzed for the in vivo functionality and level of protein within the cell. All proteins, except Pro76Val, also exhibit a significant decrease in intracellular cytochrome c levels, ranging from 15% to 80% of wild type. Furthermore, all isolated mutant strains, except the one expressing Pro76Val, exhibit a significant decrease in growth on lactate medium, suggesting that the variant cytochromes are much less functional than wild type. This requirement for protein function is clearly the cause for the strict invariance of these residues in eukaryotic cytochromes c. Seven proteins with mutations just at Pro76 were purified and studied by circular dichroism spectroscopy. All proteins with mutations at Pro76 exhibit melting temperatures about 7 degreesC less than that of the wild-type protein, suggesting that mutation of Pro76 affects the entropy of the denatured state. It is proposed that the functional significance of Pro76 and Gly77 is the requirement for a type II (betagammaL) beta-turn in this loop, the conformation of which requires a glycine at the third position, and that a change occurs in this turn conformation upon a change in the redox state of the protein.


Assuntos
Grupo dos Citocromos c/química , Grupo dos Citocromos c/metabolismo , Citocromos c , Proteínas de Saccharomyces cerevisiae , Animais , Sítios de Ligação/genética , Dicroísmo Circular , Grupo dos Citocromos c/genética , Células Eucarióticas , Glicina/química , Modelos Moleculares , Mutagênese Sítio-Dirigida , Mutação Puntual , Prolina/química , Conformação Proteica , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/crescimento & desenvolvimento , Saccharomyces cerevisiae/metabolismo , Temperatura
19.
Fold Des ; 3(6): 535-48, 1998.
Artigo em Inglês | MEDLINE | ID: mdl-9889164

RESUMO

BACKGROUND: Database-searching methods based on sequence similarity have become the most commonly used tools for characterizing newly sequenced proteins. Due to the often underestimated functional diversity in protein families and superfamilies, however, it is difficult to make the characterization specific and accurate. In this work, we have extended a method for active-site identification from predicted protein structures. RESULTS: The structural conservation and variation of the active sites of the alpha/beta hydrolases with known structures were studied. The similarities were incorporated into a three-dimensional motif that specifies essential requirements for the enzymatic functions. A threading algorithm was used to align 651 Escherichia coli open reading frames (ORFs) to one of the members of the alpha/beta hydrolase fold family. These ORFs were then screened according to our three-dimensional motif and with an extra requirement that demands conservation of the key active-site residues among the proteins that bear significant sequence similarity to the ORFs. 17 ORFs from E. coli were predicted to have hydrolase activity and their putative active-site residues were identified. Most were in agreement with the experiments and results of other database-searching methods. The study further suggests that YHET_ECOLI, a hypothetical protein classified as a member of the UPF0017 family (an uncharacterized protein family), bears all the hallmarks of the alpha/beta hydrolase family. CONCLUSIONS: The novel feature of our method is that it uses three-dimensional structural information for function prediction. The results demonstrate the importance and necessity of such a method to fill the gap between sequence alignment and function prediction; furthermore, the method provides a way to verify the structure predictions, which enables an expansion of the applicable scope of the threading algorithms.


Assuntos
Escherichia coli/genética , Genoma Bacteriano , Hidrolases/química , Hidrolases/genética , Sequência de Aminoácidos , Bases de Dados Factuais , Dados de Sequência Molecular , Valor Preditivo dos Testes , Dobramento de Proteína
20.
Biochemistry ; 36(10): 2884-97, 1997 Mar 11.
Artigo em Inglês | MEDLINE | ID: mdl-9062118

RESUMO

The kinetics of chemically induced folding and unfolding processes in spin-labeled yeast iso-1-cytochrome c were measured by stopped-flow electron paramagnetic resonance (EPR). Stopped-flow EPR, based on a new dielectric resonator structure [Sienkiewicz, A., Qu, K., & Scholes, C. P. (1994) Rev. Sci. Instrum. 65, 68-74], gives a new temporal component to probing nanosecond molecular tumbling motions that are modulated by macromolecular processes requiring time resolution of milliseconds to seconds. The stopped-flow EPR technique presented in this work is a kinetic technique that has not been previously used with such a time resolution on spin-labeled systems, and it has the potential for application to numerous spin-labeled sites in this and other proteins. The cysteine-specific spin-label, methanethiosulfonate spin-label (MTSSL), was attached to yeast iso-1-cytochrome c at the single naturally occurring cysteine102, and the emphasis for this work was on this disulfide-attached spin-labeled prototype. This probe has the advantage of reflecting the protein tertiary fold, as shown by recent, systematic site-directed spin labeling of T4 lysozyme [Mchaourab, H. S. Lietzow, M. A., Hideg, K., & Hubbell, W. L. (1996) Biochemistry 35, 7692-7704], and protein backbone dynamics, as also shown by model peptide studies [Todd, A. P., & Millhauser, G. L. (1991) Biochemistry 30, 5515-5523]. The C-terminal cytochrome c helix where the label is attached is thought to be critical in the initial steps of protein folding and unfolding. Stopped-flow EPR resolved the monoexponential, guanidinium-induced unfolding process at pH 6.5 with an approximately 20 ms time constant; this experiment required less than 150 microL of 80 microM spin-labeled protein. We observed an approximately 50-fold decrease of this unfolding time from the 1 s range to the 20 ms time range as the guanidinium denaturant concentration was increased from 0.6 to 2.0 M. The more complex refolding kinetics of our labeled cytochrome were studied by stopped-flow EPR at pH 5.0 and 6.5. The spin probe showed a fast kinetic process compatible with the time range over which hydrogen/deuterium amide protection indicates helix formation; this process was monoexponential at pH 5.0. At pH 6.5, there was evidence of an additional slower kinetic phase resolved by stopped-flow EPR and by heme-ligation-sensitive UV-Vis that indicated a slower folding where heme misligation may be involved. Since the disulfide-attached probe has reported folding and backbone dynamics in other systems, the implication is that our kinetic experiments were directly sensing events of the C-terminal helix formation and possibly the N- and C-terminal helical interaction. The cysteine-labeled protein was also studied under equilibrium conditions to characterize probe mobility and the effect of the probe on protein thermodynamics. The difference in spin probe mobility between folded and denatured protein was marked, and in the folded protein, the motion of the probe was anisotropically restricted. The motion of the attached nitroxide in the folded protein appears to be restricted about the carbon and sulfur bonds which tether it to the cysteine. The original point of cysteine sulfur attachment is approximately 11 A from the heme iron within the C-terminal helix near its interface with the N-terminal helix, but the low-temperature EPR spin probe line width showed that the probe lies more distant (> 15 A) from the heme iron. By all physical evidence, the protein labeled at cysteine102 folded, but the spin probe in this prototype system perturbed packing which lowered the thermal melting temperature, the free energy of folding, the guanidinium concentration at the midpoint of the unfolding transition, the m parameter of the denaturant, and the helical CD signature. This study prepares the way for study of protein folding/unfolding kinetics using EPR spectroscopy of spin-labels placed at specific cysteine-mutated sites within


Assuntos
Grupo dos Citocromos c/química , Grupo dos Citocromos c/metabolismo , Citocromos c , Dobramento de Proteína , Proteínas de Saccharomyces cerevisiae , Saccharomyces cerevisiae/química , Dicroísmo Circular , Cisteína/metabolismo , Espectroscopia de Ressonância de Spin Eletrônica , Guanidina , Guanidinas/farmacologia , Concentração de Íons de Hidrogênio , Cinética , Mesilatos/metabolismo , Modelos Moleculares , Conformação Proteica , Desnaturação Proteica , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Espectrofotometria , Marcadores de Spin , Temperatura , Termodinâmica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...