Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
1.
J Mol Evol ; 91(6): 854-864, 2023 12.
Artigo em Inglês | MEDLINE | ID: mdl-38060007

RESUMO

Folds are the architecture and topology of a protein domain. Categories of folds are very few compared to the astronomical number of sequences. Eukaryotes have more protein folds than Archaea and Bacteria. These folds are of two types: shared with Archaea and/or Bacteria on one hand and specific to eukaryotic clades on the other hand. The first kind of folds is inherited from the first endosymbiosis and confirms the mixed origin of eukaryotes. In a dataset of 1073 folds whose presence or absence has been evidenced among 210 species equally distributed in the three super-kingdoms, we have identified 28 eukaryotic folds unambiguously inherited from Bacteria and 40 eukaryotic folds unambiguously inherited from Archaea. Compared to previous studies, the repartition of informational function is higher than expected for folds originated from Bacteria and as high as expected for folds inherited from Archaea. The second type of folds is specifically eukaryotic and associated with an increase of new folds within eukaryotes distributed in particular clades. Reconstructed ancestral states coupled with dating of each node on the tree of life provided fold appearance rates. The rate is on average twice higher within Eukaryota than within Bacteria or Archaea. The highest rates are found in the origins of eukaryotes, holozoans, metazoans, metazoans stricto sensu, and vertebrates: the roots of these clades correspond to bursts of fold evolution. We could correlate the functions of some of the fold synapomorphies within eukaryotes with significant evolutionary events. Among them, we find evidence for the rise of multicellularity, adaptive immune system, or virus folds which could be linked to an ecological shift made by tetrapods.


Assuntos
Archaea , Bactérias , Animais , Filogenia , Bactérias/genética , Archaea/genética , Proteínas , Eucariotos/genética , Evolução Biológica
2.
J Struct Biol ; 211(2): 107543, 2020 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-32522553

RESUMO

The effects of a single residue substitution on the protein backbone are frequently quite small and there are many other potential sources of structural variation for protein. We present here a methodology considering different sources of distortions in order to isolate the very effect of the mutation. To validate our methodology, we consider a well-studied family with many single mutants: the human lysozyme. Most of the perturbations are expected to be at the very localisation of the mutation, but in many cases the effects are propagated at long range. We show that the distances between the mutated residue and the 5% most disturbed residues exponentially decreases. One third of the affected residues are in direct contact with the mutated position; the remaining two thirds are potential allosteric effects. We confirm the reliability of the residues identified as significantly perturbed by comparing our results to experimental studies. We confirm with the present method all the previously identified perturbations. This study shows that mutations have long-range impact on protein backbone that can be detected, although the displacement of the affected atoms is small.


Assuntos
Muramidase/ultraestrutura , Proteínas Mutantes/ultraestrutura , Conformação Proteica , Proteínas/ultraestrutura , Sequência de Aminoácidos/genética , Humanos , Muramidase/química , Muramidase/genética , Proteínas Mutantes/química , Proteínas Mutantes/genética , Mutação/genética , Mutação Puntual/genética , Proteínas/química , Proteínas/genética
3.
Bioinformatics ; 35(20): 3970-3980, 2019 10 15.
Artigo em Inglês | MEDLINE | ID: mdl-30942864

RESUMO

MOTIVATION: Multiple sequence alignment programs have proved to be very useful and have already been evaluated in the literature yet not alignment programs based on structure or both sequence and structure. In the present article we wish to evaluate the added value provided through considering structures. RESULTS: We compared the multiple alignments resulting from 25 programs either based on sequence, structure or both, to reference alignments deposited in five databases (BALIBASE 2 and 3, HOMSTRAD, OXBENCH and SISYPHUS). On the whole, the structure-based methods compute more reliable alignments than the sequence-based ones, and even than the sequence+structure-based programs whatever the databases. Two programs lead, MAMMOTH and MATRAS, nevertheless the performances of MUSTANG, MATT, 3DCOMB, TCOFFEE+TM_ALIGN and TCOFFEE+SAP are better for some alignments. The advantage of structure-based methods increases at low levels of sequence identity, or for residues in regular secondary structures or buried ones. Concerning gap management, sequence-based programs set less gaps than structure-based programs. Concerning the databases, the alignments of the manually built databases are more challenging for the programs. AVAILABILITY AND IMPLEMENTATION: All data and results presented in this study are available at: http://wwwabi.snv.jussieu.fr/people/mathilde/download/AliMulComp/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Alinhamento de Sequência , Algoritmos , Bases de Dados de Proteínas , Estrutura Secundária de Proteína , Proteínas , Software
4.
Proteins ; 86(8): 853-867, 2018 08.
Artigo em Inglês | MEDLINE | ID: mdl-29569365

RESUMO

A structural database of 11 families of chains differing by a single amino acid substitution has been built. Another structural dataset of 5 families with identical sequences has been used for comparison. The RMSD computed after a global superimposition of the mutated protein on each native one is smaller than the RMSD calculated among proteins of identical sequences. The effect of the perturbation is very local, and not necessarily the highest at the position of the mutation. A RMSD between mutated and native proteins is computed over a 3-residue or a 7-residue window at each position. To separate the effects of structural fluctuations due to point mutations from other sources, pair RMSD have been translated into P values which themselves are included in a score called P-RANK. This score allows highlighting small backbone distortions by comparing these RMSD between mutated and native positions to the RMSD at the same positions in the absence of a mutation. It results from the P-RANK that 38% of all mutations produce a significant effect on the displacement. When compared with a random distribution of RMSD at un-mutated positions, we show that, even if the RMSD is greater when the mutation is in loops than in regular secondary structure, the relative effect is more important for regular secondary structures and for buried positions. We confirm the absence of correlation between RMSD and the predicted variation of free energy of folding but we found a small correlation between high RMSD and the error in the prediction of ΔΔG.


Assuntos
Mutação Puntual , Proteínas/química , Bases de Dados de Proteínas , Modelos Moleculares , Estrutura Secundária de Proteína , Termodinâmica
5.
J Chem Inf Model ; 58(11): 2355-2368, 2018 11 26.
Artigo em Inglês | MEDLINE | ID: mdl-30299093

RESUMO

Small cyclic peptides represent a promising class of therapeutic molecules with unique chemical properties. However, the poor knowledge of their structural characteristics makes their computational design and structure prediction a real challenge. In order to better describe their conformational space, we developed a method, named EGSCyP, for the exhaustive exploration of the energy landscape of small head-to-tail cyclic peptides. The method can be summarized by (i) a global exploration of the conformational space based on a mechanistic representation of the peptide and the use of robotics-based algorithms to deal with the closure constraint and (ii) an all-atom refinement of the obtained conformations. EGSCyP can handle D-form residues and N-methylations. Two strategies for the side-chains placement were implemented and compared. To validate our approach, we applied it to a set of three variants of cyclic RGDFV pentapeptides, including the drug candidate Cilengitide. A comparative analysis was made with respect to replica exchange molecular dynamics simulations in implicit solvent. Its results show that the EGSCyP method provides a very complete characterization of the conformational space of small cyclic pentapeptides.


Assuntos
Algoritmos , Simulação de Dinâmica Molecular , Peptídeos Cíclicos/química , Venenos de Serpentes/química , Fenômenos Biomecânicos , Análise por Conglomerados , Preparações Farmacêuticas/química , Conformação Proteica , Estrutura Secundária de Proteína
6.
Nat Genet ; 39(3): 366-72, 2007 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-17322883

RESUMO

Autosomal recessive hereditary spastic paraplegia (ARHSP) with thin corpus callosum (TCC) is a common and clinically distinct form of familial spastic paraplegia that is linked to the SPG11 locus on chromosome 15 in most affected families. We analyzed 12 ARHSP-TCC families, refined the SPG11 candidate interval and identified ten mutations in a previously unidentified gene expressed ubiquitously in the nervous system but most prominently in the cerebellum, cerebral cortex, hippocampus and pineal gland. The mutations were either nonsense or insertions and deletions leading to a frameshift, suggesting a loss-of-function mechanism. The identification of the function of the gene will provide insight into the mechanisms leading to the degeneration of the corticospinal tract and other brain structures in this frequent form of ARHSP.


Assuntos
Corpo Caloso/patologia , Mutação , Proteínas/genética , Paraplegia Espástica Hereditária/genética , Paraplegia Espástica Hereditária/patologia , Adolescente , Adulto , Idade de Início , Animais , Sequência de Bases , Células COS , Córtex Cerebral/metabolismo , Criança , Chlorocebus aethiops , Cromossomos Humanos Par 15 , Análise Mutacional de DNA , Ligação Genética , Genótipo , Humanos , Escore Lod , Dados de Sequência Molecular , Linhagem , Proteínas/metabolismo , Ratos , Ratos Sprague-Dawley
7.
BMC Genomics ; 16: 573, 2015 Aug 04.
Artigo em Inglês | MEDLINE | ID: mdl-26238241

RESUMO

BACKGROUND: The co-chaperone Hop [heat shock protein (HSP) organizing protein] has been shown to act as an adaptor for protein folding and maturation, in concert with Hsp70 and Hsp90. The hop gene is of eukaryotic origin. Likewise, the chloroplast elongation factor G (cEF-G) catalyzes the translocation step in chloroplast protein synthesis. The chl-fus gene, which encodes the cEF-G protein, is of plastid origin. Both proteins, Hop and cEF-G, derived from domain duplications. It was demonstrated that the nuclear chl-fus gene locates in opposite orientation to a hop gene in Glycine max. We explored 53 available plant genomes from Chlorophyta to higher plants, to determine whether the chl-fus gene was transferred directly downstream of the primordial hop in the proto-eukaryote host cell. Since both genes came from exon/module duplication events, we wanted to explore the involvement of introns in the early origin and the ensuing evolutionary changes in gene structure. RESULTS: We reconstructed the evolutionary history of the two convergent plant genes, on the basis of their gene structure, microsynteny and microcolinearity, from 53 plant nuclear genomes. Despite a high degree (72%) of microcolinearity among vascular plants, our results demonstrate that their adjacency was a product of chromosomal rearrangements. Based on predicted exon--intron structures, we inferred the molecular events giving rise to the current form of genes. Therefore, we propose a simple model of exon/module shuffling by intronic recombinations in which phase-0 introns were essential for domain duplication, and a phase-1 intron for transit peptide recruiting. Finally, we demonstrate a natural susceptibility of the intergenic region to recombine or delete, seriously threatening the integrity of the chl-fus gene for the future. CONCLUSIONS: Our results are consistent with the interpretation that the chl-fus gene was transferred from the chloroplast to a chromosome different from that of hop, in the primitive photosynthetic eukaryote, and much later before the appearance of angiosperms, it was recombined downstream of hop. Exon/module shuffling mediated by symmetric intron phases (i.e., phase-0 introns) was essential for gene evolution. The intergenic region is prone to recombine, risking the integrity of both genes.


Assuntos
Proteínas de Cloroplastos/genética , Cloroplastos/genética , Instabilidade Cromossômica/genética , Recombinação Genética , Sequência de Aminoácidos , Evolução Biológica , Genoma de Planta , Proteínas de Choque Térmico/genética , Íntrons/genética , Plantas/genética , Dobramento de Proteína
8.
Methods Mol Biol ; 2627: 195-210, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36959449

RESUMO

Evaluation of the structural perturbations introduced by a single amino acid mutation is the main issue for protein structural biology. We propose here to present some recent advances in methods, allowing the splitting of distortion between the actual substitution effect and the contribution of the local flexibility of the position where the mutation occurs. Its main drawback is the need of many structures with a single mutation in each of them. To bypass this difficulty, we propose to use molecular modeling tools, with several software enabling us to build a model from a template, given the sequence. As a proof of concept, we rely on a gold standard, the human lysozyme. Both wild-type and three mutant structures are available in the PDB. Two of these mutations result in amyloid fibril formation, and the last one is neutral. As a conclusion, irrespective of the algorithm used for modeling, side chain conformations at the site of mutation are reliable, although long-range effects are out of reach of these tools.


Assuntos
Proteínas , Software , Humanos , Mutação , Proteínas/química , Modelos Moleculares , Algoritmos , Conformação Proteica
9.
Evolution ; 76(8): 1706-1719, 2022 08.
Artigo em Inglês | MEDLINE | ID: mdl-35765784

RESUMO

Several studies showed that folds (topology of protein secondary structures) distribution in proteomes may be a global proxy to build phylogeny. Then, some folds should be synapomorphies (derived characters exclusively shared among taxa). However, previous studies used methods that did not allow synapomorphy identification, which requires congruence analysis of folds as individual characters. Here, we map SCOP folds onto a sample of 210 species across the tree of life (TOL). Congruence is assessed using retention index of each fold for the TOL, and principal component analysis for deeper branches. Using a bicluster mapping approach, we define synapomorphic blocks of folds (SBF) sharing similar presence/absence patterns. Among the 1232 folds, 20% are universally present in our TOL, whereas 54% are reliable synapomorphies. These results are similar with CATH and ECOD databases. Eukaryotes are characterized by a large number of them, and several SBFs clearly support nested eukaryotic clades (divergence times from 1100 to 380 mya). Although clearly separated, the three superkingdoms reveal a strong mosaic pattern. This pattern is consistent with the dual origin of eukaryotes and witness secondary endosymbiosis in their phothosynthetic clades. Our study unveils direct analysis of folds synapomorphies as key characters to unravel evolutionary history of species.


Assuntos
Evolução Biológica , Eucariotos , Filogenia , Simbiose
10.
Nucleic Acids Res ; 37(Database issue): D374-9, 2009 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18945702

RESUMO

SPROUTS (Structural Prediction for pRotein fOlding UTility System) is a new database that provides access to various structural data sets and integrated functionalities not yet available to the community. The originality of the SPROUTS database is the ability to gain access to a variety of structural analyses at one place and with a strong interaction between them. SPROUTS currently combines data pertaining to 429 structures that capture representative folds and results related to the prediction of critical residues expected to belong to the folding nucleus: the MIR (Most Interacting Residues), the description of the structures in terms of modular fragments: the TEF (Tightened End Fragments), and the calculation at each position of the free energy change gradient upon mutation by one of the 19 amino acids. All database results can be displayed and downloaded in textual files and Excel spreadsheets and visualized on the protein structure. SPROUTS is a unique resource to access as well as visualize state-of-the-art characteristics of protein folding and analyse the effect of point mutations on protein structure. It is available at http://bioinformatics.eas.asu.edu/sprouts.html.


Assuntos
Bases de Dados de Proteínas , Mutação Puntual , Conformação Proteica , Estabilidade Proteica , Proteínas/genética , Gráficos por Computador , Dobramento de Proteína , Proteínas/química
11.
Cell Stress Chaperones ; 14(3): 281-9, 2009 May.
Artigo em Inglês | MEDLINE | ID: mdl-18987995

RESUMO

The co-chaperone Hop [heat shock protein (HSP) organising protein] is known to bind both Hsp70 and Hsp90. Hop comprises three repeats of a tetratricopeptide repeat (TPR) domain, each consisting of three TPR motifs. The first and last TPR domains are followed by a domain containing several dipeptide (DP) repeats called the DP domain. These analyses suggest that the hop genes result from successive recombination events of an ancestral TPR-DP module. From a hydrophobic cluster analysis of homologous Hop protein sequences derived from gene families, we can postulate that shifts in the open reading frames are at the origin of the present sequences. Moreover, these shifts can be related to the presence or absence of biological function. We propose to extend the family of Hop co-chaperons into the kingdom of bacteria, as several structurally related genes have been identified by hydrophobic cluster analysis. We also provide evidence of common structural characteristics between hop and hip genes, suggesting a shared precursor of ancestral TPR-DP domains.


Assuntos
Sequência de Aminoácidos , Íntrons/genética , Chaperonas Moleculares , Análise de Sequência de Proteína , Animais , Células Eucarióticas/fisiologia , Evolução Molecular , Proteínas de Choque Térmico HSP70/genética , Proteínas de Choque Térmico HSP70/metabolismo , Proteínas de Choque Térmico HSP90/genética , Proteínas de Choque Térmico HSP90/metabolismo , Humanos , Chaperonas Moleculares/genética , Chaperonas Moleculares/metabolismo , Dados de Sequência Molecular , Células Procarióticas/fisiologia , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína
12.
Biochimie ; 151: 37-41, 2018 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-29857183

RESUMO

Evaluating the model quality of protein structures that evolve in environments with particular physicochemical properties requires scoring functions that are adapted to their specific residue compositions and/or structural characteristics. Thus, computational methods developed for structures from the cytosol cannot work properly on membrane or secreted proteins. Here, we present MyPMFs, an easy-to-use tool that allows users to train statistical potentials of mean force (PMFs) on the protein structures of their choice, with all parameters being adjustable. We demonstrate its use by creating an accurate statistical potential for transmembrane protein domains. We also show its usefulness to study the influence of the physical environment on residue interactions within protein structures. Our open-source software is freely available for download at https://github.com/bibip-impmc/mypmfs.


Assuntos
Biologia Computacional/métodos , Modelos Moleculares , Conformação Proteica , Proteínas/química , Aminoácidos/química
13.
Biochem Biophys Res Commun ; 364(2): 325-31, 2007 Dec 14.
Artigo em Inglês | MEDLINE | ID: mdl-17950698

RESUMO

The evolutionary origin of some nuclear encoded proteins that translocate proteins across the chloroplast envelope remains unknown. Therefore, sequences of GTPase proteins constituting the Arabidopsis thaliana translocon at the outer membrane of chloroplast (atToc) complexes were analyzed by means of HCA. In particular, atToc159 and related proteins (atToc132, atToc120, and atToc90) do not have proven homologues of prokaryotic or eukaryotic ancestry. We established that the three domains commonly referred to as A, G, and M originate from the GTPase G domain, tandemly repeated, and probably evolving toward an unstructured conformation in the case of the A domain. It resulted from this study a putative common ancestor for these proteins and a new domain definition, in particular the splitting of A into three domains (A1, A2, and A3), has been proposed. The family of Toc159, previously containing A. thaliana and Pisum sativum, has been extended to Medicago truncatula and Populus trichocarpa and it has been revised for Oryza sativa. They have also been compared to GTPase subunits involved in the cpSRP system. A distant homology has been revealed among Toc and cpSRP GTP-hydrolyzing proteins of A. thaliana, and repetitions of a GTPase domain were also found in cpSRP protein receptors, by means of HCA analysis.


Assuntos
Proteínas de Arabidopsis/genética , Arabidopsis/genética , GTP Fosfo-Hidrolases/genética , Genoma de Planta , Proteínas de Membrana/genética , Partícula de Reconhecimento de Sinal/genética , Tilacoides/metabolismo , Sequência de Aminoácidos , Medicago truncatula/genética , Dados de Sequência Molecular , Oryza/genética , Pisum sativum/genética , Populus/genética , Estrutura Terciária de Proteína , Subunidades Proteicas/genética , Homologia de Sequência de Aminoácidos
14.
J Biomol Struct Dyn ; 22(4): 487-92, 2005 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-15588112

RESUMO

Database scanning programs such as BLAST and FASTA are used nowadays by most biologists for the post-genomic processing of DNA or protein sequence information (in particular to retrieve the structure/function of uncharacterized proteins). Unfortunately, their results can be polluted by identical alignments (called redundancies) coming from the same protein or DNA sequences present in different entries of the database. This makes the efficient use of the listed alignments difficult. Pretreatment of databases has been proposed to suppress strictly identical entries. However, there still remain many identical alignments since redundancies may occur locally for entries corresponding to various fragments of the same sequence or for entries corresponding to very homologous sequences but differing at the level of a few residues such as ortholog proteins. In the present work, we show that redundant alignments can be indeed numerous even when working with a pretreated non-redundant data bank, going as high as 60% of the output results according to the query and the bank. Therefore the accuracy and the efficiency of the post-genomic work will be greatly increased if these redundancies are removed. To solve this up to now unaddressed problem, we have developed an algorithm that allows for the efficient and safe suppression of all the redundancies with no loss of information. This algorithm is based on various filtering steps that we describe here in the context of the Automat similarity search program, and such an algorithm should also be added to the other similarity search programs (BLAST, FASTA, etc...).


Assuntos
Algoritmos , Bases de Dados Factuais , Alinhamento de Sequência/métodos , Homologia de Sequência de Aminoácidos , Sequência de Aminoácidos , Arabidopsis/enzimologia , Arabidopsis/genética , Sequência Consenso , DNA/química , Sistemas de Gerenciamento de Base de Dados , Genoma Bacteriano , Genoma de Planta , Hidrolases/química , Armazenamento e Recuperação da Informação , Dados de Sequência Molecular , Proteínas/química , Proteínas/genética
15.
J Leukoc Biol ; 76(1): 125-34, 2004 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-15075357

RESUMO

Recent reports suggest that immunity against the prion protein (PrP) retards transmissible spongiform encephalopathies progression in infected mice. A major obstacle to the development of vaccines comes from the fact that PrP is poorly immunogenic, as it is seen as self by the host immune system. Additional questions concern the immune mechanisms involved in protection and the risk of eliciting adverse reactions in the central nervous system of treated patients. Peptide-based vaccines offer an attractive strategy to overcome these difficulties. We have undertaken the identification of the immunogenic regions of PrP, which trigger helper T cells (Th) associated with antibody production. Our results identify two main regions, one between the structured and flexible portion of PrP (98-127) and a second between alpha 1 and alpha 2 helix (143-187). Peptides (30-mer) corresponding to these regions elicit class II-restricted Th cells and antibody production against native PrP and could therefore be of potential interest for a peptide-based vaccination.


Assuntos
Epitopos de Linfócito T , Proteínas PrPC/química , Proteínas PrPC/imunologia , Subpopulações de Linfócitos T/imunologia , Linfócitos T Auxiliares-Indutores/imunologia , Sequência de Aminoácidos , Animais , Citocinas/biossíntese , Ensaio de Imunoadsorção Enzimática , Imunofluorescência , Antígenos de Histocompatibilidade Classe II , Camundongos , Dados de Sequência Molecular , Biblioteca de Peptídeos , Peptídeos/imunologia , Proteínas Recombinantes/imunologia , Transfecção
16.
J Integr Bioinform ; 12(3): 277, 2015 Oct 06.
Artigo em Inglês | MEDLINE | ID: mdl-26673793

RESUMO

Scientific legacy workflows are often developed over many years, poorly documented and implemented with scripting languages. In the context of our cross-disciplinary projects we face the problem of maintaining such scientific workflows. This paper presents the Workflow Instrumentation for Structure Extraction (WISE) method used to process several ad-hoc legacy workflows written in Python and automatically produce their workflow structural skeleton. Unlike many existing methods, WISE does not assume input workflows to be preprocessed in a known workflow formalism. It is also able to identify and analyze calls to external tools. We present the method and report its results on several scientific workflows.


Assuntos
Mineração de Dados/métodos , Processamento Eletrônico de Dados/métodos , Linguagens de Programação
17.
PLoS One ; 10(4): e0125098, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25915049

RESUMO

BACKGROUND: Folding nucleus of globular proteins formation starts by the mutual interaction of a group of hydrophobic amino acids whose close contacts allow subsequent formation and stability of the 3D structure. These early steps can be predicted by simulation of the folding process through a Monte Carlo (MC) coarse grain model in a discrete space. We previously defined MIRs (Most Interacting Residues), as the set of residues presenting a large number of non-covalent neighbour interactions during such simulation. MIRs are good candidates to define the minimal number of residues giving rise to a given fold instead of another one, although their proportion is rather high, typically [15-20]% of the sequences. Having in mind experiments with two sequences of very high levels of sequence identity (up to 90%) but different folds, we combined the MIR method, which takes sequence as single input, with the "fuzzy oil drop" (FOD) model that requires a 3D structure, in order to estimate the residues coding for the fold. FOD assumes that a globular protein follows an idealised 3D Gaussian distribution of hydrophobicity density, with the maximum in the centre and minima at the surface of the "drop". If the actual local density of hydrophobicity around a given amino acid is as high as the ideal one, then this amino acid is assigned to the core of the globular protein, and it is assumed to follow the FOD model. Therefore one obtains a distribution of the amino acids of a protein according to their agreement or rejection with the FOD model. RESULTS: We compared and combined MIR and FOD methods to define the minimal nucleus, or keystone, of two populated folds: immunoglobulin-like (Ig) and flavodoxins (Flav). The combination of these two approaches defines some positions both predicted as a MIR and assigned as accordant with the FOD model. It is shown here that for these two folds, the intersection of the predicted sets of residues significantly differs from random selection. It reduces the number of selected residues by each individual method and allows a reasonable agreement with experimentally determined key residues coding for the particular fold. In addition, the intersection of the two methods significantly increases the specificity of the prediction, providing a robust set of residues that constitute the folding nucleus.


Assuntos
Flavodoxina/química , Imunoglobulinas/química , Modelos Moleculares , Algoritmos , Sequência de Aminoácidos , Animais , Bactérias/química , Bactérias/metabolismo , Sítios de Ligação , Humanos , Interações Hidrofóbicas e Hidrofílicas , Método de Monte Carlo , Dobramento de Proteína , Estrutura Secundária de Proteína
18.
Proteins ; 49(4): 446-56, 2002 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-12402355

RESUMO

A three-dimensional Voronoi tessellation of folded proteins is used to analyze geometrical and topological properties of a set of proteins. To each amino acid is associated a central point surrounded by a Voronoi cell. Voronoi cells describe the packing of the amino acids. Special attention is given to reproduction of the protein surface. Once the Voronoi cells are built, a lot of tools from geometrical analysis can be applied to investigate the protein structure; volume of cells, number of faces per cell, and number of sides per face are the usual signatures of the protein structure. A distinct difference between faces related to primary, secondary, and tertiary structures has been observed. Faces threaded by the main-chain have on average more than six edges, whereas those related to helical packing of the amino acid chain have less than five edges. The faces on the protein surface have on average five edges within 1% error. The average number of faces on the protein surface for a given type of amino acid brings a new point of view in the characterization of the exposition to the solvent and the classification of amino acid as hydrophilic or hydrophobic. It may be a convenient tool for model validation.


Assuntos
Proteínas de Bactérias , Biologia Computacional/métodos , Dobramento de Proteína , Proteínas/química , Solventes , Algoritmos , Proteínas de Escherichia coli/química , Interações Hidrofóbicas e Hidrofílicas , Proteínas de Membrana/química , Proteínas Quimiotáticas Aceptoras de Metil , Modelos Moleculares , Conformação Proteica , Reprodutibilidade dos Testes , Eletricidade Estática , Distribuições Estatísticas
19.
C R Biol ; 327(5): 431-43, 2004 May.
Artigo em Inglês | MEDLINE | ID: mdl-15255474

RESUMO

The folding process of a set of 42 proteins, representative of the various folds, has been simulated by means of a Monte Carlo method on a discrete lattice, using two different potentials of mean force. Multiple compact fragments of contiguous residues are formed in the simulation, stable in composition, but not in geometry. During time, the number of fragments decreases until one final compact globular state is reached. We focused on the early steps of the folding in order to evidence the maximum number of fragments, provided they are sufficiently stable in sequence. A correlation has been established between these proto fragments and regular secondary-structure elements, whatever their nature, alpha helices or beta strands. Quantitatively, this is revealed by an overall mean one-residue quality factor of nearly 60%, which is better for proteins mainly composed of alpha helices. The correspondence between the number of fragments and the number of secondary-structure elements is of 77% and the regions separating successive fragments are mainly located in loops. Besides, hydrophobic clusters deduced from HCA correspond to fragments with an equivalent accuracy. These results suggest that folding pathways do not contain structurally static intermediate. However, since the beginning of folding, most residues that will later form one given secondary structure are kept close in space by being involved in the same fragment. This aggregation may be a way to accelerate the formation of the native state and enforces the key role played by hydrophobic residues in the formation of the fragments, thus in the folding process itself.


Assuntos
Fragmentos de Peptídeos/química , Dobramento de Proteína , Proteínas/química , Proteínas/metabolismo , Simulação por Computador , Modelos Moleculares , Fragmentos de Peptídeos/metabolismo , Conformação Proteica , Termodinâmica
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa