Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 36
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Database (Oxford) ; 20202020 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-32449511

RESUMEN

Determining the molecular function of enzymes discovered by genome sequencing represents a primary foundation for understanding many aspects of biology. Historically, classification of enzyme reactions has used the enzyme nomenclature system developed to describe the overall reactions performed by biochemically characterized enzymes, irrespective of their associated sequences. In contrast, functional classification and assignment for the millions of protein sequences of unknown function now available is largely done in two computational steps, first by similarity-based assignment of newly obtained sequences to homologous groups, followed by transferring to them the known functions of similar biochemically characterized homologs. Due to the fundamental differences in their etiologies and practice, `how' these chemistry- and evolution-centric functional classification systems relate to each other has been difficult to explore on a large scale. To investigate this issue in a new way, we integrated two published ontologies that had previously described each of these classification systems independently. The resulting infrastructure was then used to compare the functional assignments obtained from each classification system for the well-studied and functionally diverse enolase superfamily. Mapping these function assignments to protein structure and reaction similarity networks shows a profound and complex disconnect between the homology- and chemistry-based classification systems. This conclusion mirrors previous observations suggesting that except for closely related sequences, facile annotation transfer from small numbers of characterized enzymes to the huge number uncharacterized homologs to which they are related is problematic. Our extension of these comparisons to large enzyme superfamilies in a computationally intelligent manner provides a foundation for new directions in protein function prediction for the huge proportion of sequences of unknown function represented in major databases. Interactive sequence, reaction, substrate and product similarity networks computed for this work for the enolase and two other superfamilies are freely available for download from the Structure Function Linkage Database Archive (http://sfld.rbvi.ucsf.edu).


Asunto(s)
Biología Computacional/métodos , Bases de Datos de Proteínas , Enzimas , Enzimas/química , Enzimas/clasificación , Enzimas/fisiología , Anotación de Secuencia Molecular , Relación Estructura-Actividad
2.
J Biol Chem ; 295(2): 314-324, 2020 01 10.
Artículo en Inglés | MEDLINE | ID: mdl-31796628

RESUMEN

The catalytic residues of an enzyme comprise the amino acids located in the active center responsible for accelerating the enzyme-catalyzed reaction. These residues lower the activation energy of reactions by performing several catalytic functions. Decades of enzymology research has established general themes regarding the roles of specific residues in these catalytic reactions, but it has been more difficult to explore these roles in a more systematic way. Here, we review the data on the catalytic residues of 648 enzymes, as annotated in the Mechanism and Catalytic Site Atlas (M-CSA), and compare our results with those in previous studies. We structured this analysis around three key properties of the catalytic residues: amino acid type, catalytic function, and sequence conservation in homologous proteins. As expected, we observed that catalysis is mostly accomplished by a small set of residues performing a limited number of catalytic functions. Catalytic residues are typically highly conserved, but to a smaller degree in homologues that perform different reactions or are nonenzymes (pseudoenzymes). Cross-analysis yielded further insights revealing which residues perform particular functions and how often. We obtained more detailed specificity rules for certain functions by identifying the chemical group upon which the residue acts. Finally, we show the mutation tolerance of the catalytic residues based on their roles. The characterization of the catalytic residues, their functions, and conservation, as presented here, is key to understanding the impact of mutations in evolution, disease, and enzyme design. The tools developed for this analysis are available at the M-CSA website and allow for user specific analysis of the same data.


Asunto(s)
Aminoácidos/química , Dominio Catalítico , Enzimas/química , Secuencia de Aminoácidos , Aminoácidos/metabolismo , Animales , Biocatálisis , Secuencia Conservada , Bases de Datos de Proteínas , Enzimas/metabolismo , Humanos
3.
Methods Enzymol ; 606: 1-71, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-30097089

RESUMEN

The radical SAM superfamily contains over 100,000 homologous enzymes that catalyze a remarkably broad range of reactions required for life, including metabolism, nucleic acid modification, and biogenesis of cofactors. While the highly conserved SAM-binding motif responsible for formation of the key 5'-deoxyadenosyl radical intermediate is a key structural feature that simplifies identification of superfamily members, our understanding of their structure-function relationships is complicated by the modular nature of their structures, which exhibit varied and complex domain architectures. To gain new insight about these relationships, we classified the entire set of sequences into similarity-based subgroups that could be visualized using sequence similarity networks. This superfamily-wide analysis reveals important features that had not previously been appreciated from studies focused on one or a few members. Functional information mapped to the networks indicates which members have been experimentally or structurally characterized, their known reaction types, and their phylogenetic distribution. Despite the biological importance of radical SAM chemistry, the vast majority of superfamily members have never been experimentally characterized in any way, suggesting that many new reactions remain to be discovered. In addition to 20 subgroups with at least one known function, we identified additional subgroups made up entirely of sequences of unknown function. Importantly, our results indicate that even general reaction types fail to track well with our sequence similarity-based subgroupings, raising major challenges for function prediction for currently identified and new members that continue to be discovered. Interactive similarity networks and other data from this analysis are available from the Structure-Function Linkage Database.


Asunto(s)
Enzimas/clasificación , Radicales Libres/metabolismo , Dominios Proteicos/genética , S-Adenosilmetionina/metabolismo , Secuencia de Aminoácidos/genética , Biología Computacional , Enzimas/química , Enzimas/genética , Enzimas/metabolismo , Evolución Molecular , Radicales Libres/química , Filogenia , S-Adenosilmetionina/química , Alineación de Secuencia , Relación Estructura-Actividad
4.
Nucleic Acids Res ; 46(D1): D618-D623, 2018 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-29106569

RESUMEN

M-CSA (Mechanism and Catalytic Site Atlas) is a database of enzyme active sites and reaction mechanisms that can be accessed at www.ebi.ac.uk/thornton-srv/m-csa. Our objectives with M-CSA are to provide an open data resource for the community to browse known enzyme reaction mechanisms and catalytic sites, and to use the dataset to understand enzyme function and evolution. M-CSA results from the merging of two existing databases, MACiE (Mechanism, Annotation and Classification in Enzymes), a database of enzyme mechanisms, and CSA (Catalytic Site Atlas), a database of catalytic sites of enzymes. We are releasing M-CSA as a new website and underlying database architecture. At the moment, M-CSA contains 961 entries, 423 of these with detailed mechanism information, and 538 with information on the catalytic site residues only. In total, these cover 81% (195/241) of third level EC numbers with a PDB structure, and 30% (840/2793) of fourth level EC numbers with a PDB structure, out of 6028 in total. By searching for close homologues, we are able to extend M-CSA coverage of PDB and UniProtKB to 51 993 structures and to over five million sequences, respectively, of which about 40% and 30% have a conserved active site.


Asunto(s)
Bases de Datos de Proteínas , Enzimas/química , Enzimas/metabolismo , Biocatálisis , Dominio Catalítico , Curaduría de Datos , Humanos , Internet , Interfaz Usuario-Computador , Navegador Web
5.
J Biol Chem ; 293(7): 2342-2357, 2018 02 16.
Artículo en Inglés | MEDLINE | ID: mdl-29184004

RESUMEN

The tautomerase superfamily (TSF) consists of more than 11,000 nonredundant sequences present throughout the biosphere. Characterized members have attracted much attention because of the unusual and key catalytic role of an N-terminal proline. These few characterized members catalyze a diverse range of chemical reactions, but the full scale of their chemical capabilities and biological functions remains unknown. To gain new insight into TSF structure-function relationships, we performed a global analysis of similarities across the entire superfamily and computed a sequence similarity network to guide classification into distinct subgroups. Our results indicate that TSF members are found in all domains of life, with most being present in bacteria. The eukaryotic members of the cis-3-chloroacrylic acid dehalogenase subgroup are limited to fungal species, whereas the macrophage migration inhibitory factor subgroup has wide eukaryotic representation (including mammals). Unexpectedly, we found that 346 TSF sequences lack Pro-1, of which 85% are present in the malonate semialdehyde decarboxylase subgroup. The computed network also enabled the identification of similarity paths, namely sequences that link functionally diverse subgroups and exhibit transitional structural features that may help explain reaction divergence. A structure-guided comparison of these linker proteins identified conserved transitions between them, and kinetic analysis paralleled these observations. Phylogenetic reconstruction of the linker set was consistent with these findings. Our results also suggest that contemporary TSF members may have evolved from a short 4-oxalocrotonate tautomerase-like ancestor followed by gene duplication and fusion. Our new linker-guided strategy can be used to enrich the discovery of sequence/structure/function transitions in other enzyme superfamilies.


Asunto(s)
Enzimas/química , Enzimas/metabolismo , Eucariontes/enzimología , Familia de Multigenes , Secuencia de Aminoácidos , Animales , Sitios de Unión , Cristalografía por Rayos X , Enzimas/genética , Eucariontes/química , Eucariontes/clasificación , Eucariontes/genética , Evolución Molecular , Humanos , Cinética , Datos de Secuencia Molecular , Filogenia , Plantas/química , Plantas/enzimología , Plantas/genética , Alineación de Secuencia
7.
Database (Oxford) ; 2017(1)2017 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-28365730

RESUMEN

With ever-increasing amounts of sequence data available in both the primary literature and sequence repositories, there is a bottleneck in annotating molecular function to a sequence. This article describes the biocuration process and methods used in the structure-function linkage database (SFLD) to help address some of the challenges. We discuss how the hierarchy within the SFLD allows us to infer detailed functional properties for functionally diverse enzyme superfamilies in which all members are homologous, conserve an aspect of their chemical function and have associated conserved structural features that enable the chemistry. Also presented is the Enzyme Structure-Function Ontology (ESFO), which has been designed to capture the relationships between enzyme sequence, structure and function that underlie the SFLD and is used to guide the biocuration processes within the SFLD. Database URL: http://sfld.rbvi.ucsf.edu/.


Asunto(s)
Bases de Datos de Proteínas , Enzimas/química , Enzimas/genética , Ontología de Genes , Anotación de Secuencia Molecular , Homología Estructural de Proteína , Relación Estructura-Actividad
8.
Nucleic Acids Res ; 45(D1): D190-D199, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27899635

RESUMEN

InterPro (http://www.ebi.ac.uk/interpro/) is a freely available database used to classify protein sequences into families and to predict the presence of important domains and sites. InterProScan is the underlying software that allows both protein and nucleic acid sequences to be searched against InterPro's predictive models, which are provided by its member databases. Here, we report recent developments with InterPro and its associated software, including the addition of two new databases (SFLD and CDD), and the functionality to include residue-level annotation and prediction of intrinsic disorder. These developments enrich the annotations provided by InterPro, increase the overall number of residues annotated and allow more specific functional inferences.


Asunto(s)
Biología Computacional/métodos , Bases de Datos de Proteínas , Dominios y Motivos de Interacción de Proteínas , Programas Informáticos , Humanos , Anotación de Secuencia Molecular , Filogenia
9.
Methods Mol Biol ; 1446: 111-132, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-27812939

RESUMEN

The Gene Ontology (GO) (Ashburner et al., Nat Genet 25(1):25-29, 2000) is a powerful tool in the informatics arsenal of methods for evaluating annotations in a protein dataset. From identifying the nearest well annotated homologue of a protein of interest to predicting where misannotation has occurred to knowing how confident you can be in the annotations assigned to those proteins is critical. In this chapter we explore what makes an enzyme unique and how we can use GO to infer aspects of protein function based on sequence similarity. These can range from identification of misannotation or other errors in a predicted function to accurate function prediction for an enzyme of entirely unknown function. Although GO annotation applies to any gene products, we focus here a describing our approach for hierarchical classification of enzymes in the Structure-Function Linkage Database (SFLD) (Akiva et al., Nucleic Acids Res 42(Database issue):D521-530, 2014) as a guide for informed utilisation of annotation transfer based on GO terms.


Asunto(s)
Enzimas/clasificación , Enzimas/genética , Ontología de Genes , Anotación de Secuencia Molecular/métodos , Animales , Biología Computacional/métodos , Bases de Datos de Proteínas , Enzimas/metabolismo , Humanos
10.
Bioinformatics ; 32(13): 2065-6, 2016 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-27153692

RESUMEN

UNLABELLED: Extracting chemical features like Atom-Atom Mapping (AAM), Bond Changes (BCs) and Reaction Centres from biochemical reactions helps us understand the chemical composition of enzymatic reactions. Reaction Decoder is a robust command line tool, which performs this task with high accuracy. It supports standard chemical input/output exchange formats i.e. RXN/SMILES, computes AAM, highlights BCs and creates images of the mapped reaction. This aids in the analysis of metabolic pathways and the ability to perform comparative studies of chemical reactions based on these features. AVAILABILITY AND IMPLEMENTATION: This software is implemented in Java, supported on Windows, Linux and Mac OSX, and freely available at https://github.com/asad/ReactionDecoder CONTACT: : asad@ebi.ac.uk or s9asad@gmail.com.


Asunto(s)
Bioquímica/métodos , Biología Computacional/métodos , Redes y Vías Metabólicas , Programas Informáticos , Minería de Datos , Bases de Datos de Compuestos Químicos
11.
Database (Oxford) ; 2015: bav063, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26284514

RESUMEN

During 11-12 August 2014, a Protein Bioinformatics and Community Resources Retreat was held at the Wellcome Trust Genome Campus in Hinxton, UK. This meeting brought together the principal investigators of several specialized protein resources (such as CAZy, TCDB and MEROPS) as well as those from protein databases from the large Bioinformatics centres (including UniProt and RefSeq). The retreat was divided into five sessions: (1) key challenges, (2) the databases represented, (3) best practices for maintenance and curation, (4) information flow to and from large data centers and (5) communication and funding. An important outcome of this meeting was the creation of a Specialist Protein Resource Network that we believe will improve coordination of the activities of its member resources. We invite further protein database resources to join the network and continue the dialogue.


Asunto(s)
Biología Computacional , Bases de Datos de Ácidos Nucleicos , Bases de Datos de Proteínas , Anotación de Secuencia Molecular , Proteínas , Congresos como Asunto , Humanos , Proteínas/química , Proteínas/genética
12.
Database (Oxford) ; 2015: bav043, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-25957950

RESUMEN

Biocuration has become a cornerstone for analyses in biology, and to meet needs, the amount of annotations has considerably grown in recent years. However, the reliability of these annotations varies; it has thus become necessary to be able to assess the confidence in annotations. Although several resources already provide confidence information about the annotations that they produce, a standard way of providing such information has yet to be defined. This lack of standardization undermines the propagation of knowledge across resources, as well as the credibility of results from high-throughput analyses. Seeded at a workshop during the Biocuration 2012 conference, a working group has been created to address this problem. We present here the elements that were identified as essential for assessing confidence in annotations, as well as a draft ontology--the Confidence Information Ontology--to illustrate how the problems identified could be addressed. We hope that this effort will provide a home for discussing this major issue among the biocuration community. Tracker URL: https://github.com/BgeeDB/confidence-information-ontology Ontology URL: https://raw.githubusercontent.com/BgeeDB/confidence-information-ontology/master/src/ontology/cio-simple.obo


Asunto(s)
Ontologías Biológicas , Curaduría de Datos/normas , Congresos como Asunto
13.
Proteins ; 83(6): 1005-13, 2015 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-25820941

RESUMEN

As the volume of data relating to proteins increases, researchers rely more and more on the analysis of published data, thus increasing the importance of good access to these data that vary from the supplemental material of individual articles, all the way to major reference databases with professional staff and long-term funding. Specialist protein resources fill an important middle ground, providing interactive web interfaces to their databases for a focused topic or family of proteins, using specialized approaches that are not feasible in the major reference databases. Many are labors of love, run by a single lab with little or no dedicated funding and there are many challenges to building and maintaining them. This perspective arose from a meeting of several specialist protein resources and major reference databases held at the Wellcome Trust Genome Campus (Cambridge, UK) on August 11 and 12, 2014. During this meeting some common key challenges involved in creating and maintaining such resources were discussed, along with various approaches to address them. In laying out these challenges, we aim to inform users about how these issues impact our resources and illustrate ways in which our working together could enhance their accuracy, currency, and overall value.


Asunto(s)
Bases de Datos de Proteínas/normas , Anotación de Secuencia Molecular , Proteínas , Curaduría de Datos
14.
Biochemistry ; 54(9): 1807-18, 2015 Mar 10.
Artículo en Inglés | MEDLINE | ID: mdl-25654171

RESUMEN

HydE and HydG are radical S-adenosyl-l-methionine enzymes required for the maturation of [FeFe]-hydrogenase (HydA) and produce the nonprotein organic ligands characteristic of its unique catalytic cluster. The catalytic cluster of HydA (the H-cluster) is a typical [4Fe-4S] cubane bridged to a 2Fe-subcluster that contains two carbon monoxides, three cyanides, and a bridging dithiomethylamine as ligands. While recent studies have shed light on the nature of diatomic ligand biosynthesis by HydG, little information exists on the function of HydE. Herein, we present biochemical, spectroscopic, bioinformatic, and molecular modeling data that together map the active site and provide significant insight into the role of HydE in H-cluster biosynthesis. Electron paramagnetic resonance and UV-visible spectroscopic studies demonstrate that reconstituted HydE binds two [4Fe-4S] clusters and copurifies with S-adenosyl-l-methionine. Incorporation of deuterium from D2O into 5'-deoxyadenosine, the cleavage product of S-adenosyl-l-methionine, coupled with molecular docking experiments suggests that the HydE substrate contains a thiol functional group. This information, along with HydE sequence similarity and genome context networks, has allowed us to redefine the presumed mechanism for HydE away from BioB-like sulfur insertion chemistry; these data collectively suggest that the source of the sulfur atoms in the dithiomethylamine bridge of the H-cluster is likely derived from HydE's thiol containing substrate.


Asunto(s)
Clostridium acetobutylicum/enzimología , Dimetilaminas/metabolismo , Hidrogenasas/metabolismo , Proteínas Hierro-Azufre/metabolismo , Procesamiento Proteico-Postraduccional , Azufre/metabolismo , Catálisis , Dominio Catalítico , Desoxiadenosinas/química , Desoxiadenosinas/metabolismo , Deuterio/química , Espectroscopía de Resonancia por Spin del Electrón , Hidrogenasas/química , Hierro/metabolismo , Proteínas Hierro-Azufre/química , Simulación del Acoplamiento Molecular , Espectrofotometría Ultravioleta , Azufre/química
15.
J Mol Biol ; 426(10): 2098-111, 2014 May 15.
Artículo en Inglés | MEDLINE | ID: mdl-24657765

RESUMEN

Using a novel method to map and cluster chemical reactions, we have re-examined the chemistry of the ligases [Enzyme Commission (EC) Class 6] and their associated protein families in detail. The type of bond formed by the ligase can be automatically extracted from the equation of the reaction, replicating the EC subclass division. However, this subclass division hides considerable complexities, especially for the C-N forming ligases, which fall into at least three distinct types. The lower levels of the EC classification for ligases are somewhat arbitrary in their definition and add little to understanding their chemistry or evolution. By comparing the multi-domain architecture of the enzymes and using sequence similarity networks, we examined the links between overall reaction and evolution of the ligases. These show that, whilst many enzymes that perform the same overall chemistry group together, both convergent (similar function, different ancestral lineage) and divergent (different function, common ancestor) evolution of function are observed. However, a common theme is that a single conserved domain (often the nucleoside triphosphate binding domain) is combined with ancillary domains that provide the variation in substrate binding and function.


Asunto(s)
Ligasas/química , Ligasas/fisiología , Secuencia de Aminoácidos , Animales , Catálisis , Análisis por Conglomerados , Evolución Molecular , Humanos , Ligasas/clasificación , Estructura Terciaria de Proteína , Relación Estructura-Actividad
16.
Nat Methods ; 11(2): 171-4, 2014 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-24412978

RESUMEN

We present EC-BLAST (http://www.ebi.ac.uk/thornton-srv/software/rbl/), an algorithm and Web tool for quantitative similarity searches between enzyme reactions at three levels: bond change, reaction center and reaction structure similarity. It uses bond changes and reaction patterns for all known biochemical reactions derived from atom-atom mapping across each reaction. EC-BLAST has the potential to improve enzyme classification, identify previously uncharacterized or new biochemical transformations, improve the assignment of enzyme function to sequences, and assist in enzyme engineering.


Asunto(s)
Algoritmos , Bases de Datos de Proteínas , Enzimas/química , Enzimas/metabolismo , Programas Informáticos , Animales , Fenómenos Bioquímicos , Catálisis , Enzimas/clasificación , Humanos , Internet
17.
Nucleic Acids Res ; 42(Database issue): D485-9, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24319146

RESUMEN

Understanding which are the catalytic residues in an enzyme and what function they perform is crucial to many biology studies, particularly those leading to new therapeutics and enzyme design. The original version of the Catalytic Site Atlas (CSA) (http://www.ebi.ac.uk/thornton-srv/databases/CSA) published in 2004, which catalogs the residues involved in enzyme catalysis in experimentally determined protein structures, had only 177 curated entries and employed a simplistic approach to expanding these annotations to homologous enzyme structures. Here we present a new version of the CSA (CSA 2.0), which greatly expands the number of both curated (968) and automatically annotated catalytic sites in enzyme structures, utilizing a new method for annotation transfer. The curated entries are used, along with the variation in residue type from the sequence comparison, to generate 3D templates of the catalytic sites, which in turn can be used to find catalytic sites in new structures. To ease the transfer of CSA annotations to other resources a new ontology has been developed: the Enzyme Mechanism Ontology, which has permitted the transfer of annotations to Mechanism, Annotation and Classification in Enzymes (MACiE) and UniProt Knowledge Base (UniProtKB) resources. The CSA database schema has been re-designed and both the CSA data and search capabilities are presented in a new modern web interface.


Asunto(s)
Dominio Catalítico , Bases de Datos de Proteínas , Enzimas/química , Ontologías Biológicas , Internet , Análisis de Secuencia de Proteína
18.
Nucleic Acids Res ; 42(Database issue): D521-30, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24271399

RESUMEN

The Structure-Function Linkage Database (SFLD, http://sfld.rbvi.ucsf.edu/) is a manually curated classification resource describing structure-function relationships for functionally diverse enzyme superfamilies. Members of such superfamilies are diverse in their overall reactions yet share a common ancestor and some conserved active site features associated with conserved functional attributes such as a partial reaction. Thus, despite their different functions, members of these superfamilies 'look alike', making them easy to misannotate. To address this complexity and enable rational transfer of functional features to unknowns only for those members for which we have sufficient functional information, we subdivide superfamily members into subgroups using sequence information, and lastly into families, sets of enzymes known to catalyze the same reaction using the same mechanistic strategy. Browsing and searching options in the SFLD provide access to all of these levels. The SFLD offers manually curated as well as automatically classified superfamily sets, both accompanied by search and download options for all hierarchical levels. Additional information includes multiple sequence alignments, tab-separated files of functional and other attributes, and sequence similarity networks. The latter provide a new and intuitively powerful way to visualize functional trends mapped to the context of sequence similarity.


Asunto(s)
Bases de Datos de Proteínas , Enzimas/química , Enzimas/clasificación , Enzimas/metabolismo , Internet , Anotación de Secuencia Molecular , Alineación de Secuencia , Relación Estructura-Actividad
19.
Nucleic Acids Res ; 41(Database issue): D773-80, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23175605

RESUMEN

The availability of comprehensive information about enzymes plays an important role in answering questions relevant to interdisciplinary fields such as biochemistry, enzymology, biofuels, bioengineering and drug discovery. At the EMBL European Bioinformatics Institute, we have developed an enzyme portal (http://www.ebi.ac.uk/enzymeportal) to provide this wealth of information on enzymes from multiple in-house resources addressing particular data classes: protein sequence and structure, reactions, pathways and small molecules. The fact that these data reside in separate databases makes information discovery cumbersome. The main goal of the portal is to simplify this process for end users.


Asunto(s)
Bases de Datos de Proteínas , Enzimas/química , Enzimas/metabolismo , Enfermedad , Enzimas/genética , Internet , Conformación Proteica , Interfaz Usuario-Computador
20.
PLoS Comput Biol ; 8(3): e1002403, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22396634

RESUMEN

In order to understand the evolution of enzyme reactions and to gain an overview of biological catalysis we have combined sequence and structural data to generate phylogenetic trees in an analysis of 276 structurally defined enzyme superfamilies, and used these to study how enzyme functions have evolved. We describe in detail the analysis of two superfamilies to illustrate different paradigms of enzyme evolution. Gathering together data from all the superfamilies supports and develops the observation that they have all evolved to act on a diverse set of substrates, whilst the evolution of new chemistry is much less common. Despite that, by bringing together so much data, we can provide a comprehensive overview of the most common and rare types of changes in function. Our analysis demonstrates on a larger scale than previously studied, that modifications in overall chemistry still occur, with all possible changes at the primary level of the Enzyme Commission (E.C.) classification observed to a greater or lesser extent. The phylogenetic trees map out the evolutionary route taken within a superfamily, as well as all the possible changes within a superfamily. This has been used to generate a matrix of observed exchanges from one enzyme function to another, revealing the scale and nature of enzyme evolution and that some types of exchanges between and within E.C. classes are more prevalent than others. Surprisingly a large proportion (71%) of all known enzyme functions are performed by this relatively small set of 276 superfamilies. This reinforces the hypothesis that relatively few ancient enzymatic domain superfamilies were progenitors for most of the chemistry required for life.


Asunto(s)
Enzimas/química , Enzimas/fisiología , Evolución Molecular , Análisis de Secuencia de Proteína/métodos , Secuencia de Aminoácidos , Datos de Secuencia Molecular , Relación Estructura-Actividad
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...