Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Proteins ; 90(7): 1443-1456, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35175626

RESUMO

Petroleum-based plastics are durable and accumulate in all ecological niches. Knowledge on enzymatic degradation is sparse. Today, less than 50 verified plastics-active enzymes are known. First examples of enzymes acting on the polymers polyethylene terephthalate (PET) and polyurethane (PUR) have been reported together with a detailed biochemical and structural description. Furthermore, very few polyamide (PA) oligomer active enzymes are known. In this article, the current known enzymes acting on the synthetic polymers PET and PUR are briefly summarized, their published activity data were collected and integrated into a comprehensive open access database. The Plastics-Active Enzymes Database (PAZy) represents an inventory of known and experimentally verified enzymes that act on synthetic fossil fuel-based polymers. Almost 3000 homologs of PET-active enzymes were identified by profile hidden Markov models. Over 2000 homologs of PUR-active enzymes were identified by BLAST. Based on multiple sequence alignments, conservation analysis identified the most conserved amino acids, and sequence motifs for PET- and PUR-active enzymes were derived.


Assuntos
Plásticos , Polietilenotereftalatos , Biodegradação Ambiental , Hidrólise , Plásticos/metabolismo , Polietilenotereftalatos/metabolismo
2.
J R Soc Interface ; 18(184): 20210389, 2021 11.
Artigo em Inglês | MEDLINE | ID: mdl-34727710

RESUMO

Evolutionary relationships of protein families can be characterized either by networks or by trees. Whereas trees allow for hierarchical grouping and reconstruction of the most likely ancestral sequences, networks lack a time axis but allow for thresholds of pairwise sequence identity to be chosen and, therefore, the clustering of family members with presumably more similar functions. Here, we use the large family of arylsulfatases and phosphonate monoester hydrolases to investigate similarities, strengths and weaknesses in tree and network representations. For varying thresholds of pairwise sequence identity, values of betweenness centrality and clustering coefficients were derived for nodes of the reconstructed ancestors to measure the propensity to act as a bridge in a network. Based on these properties, ancestral protein sequences emerge as bridges in protein sequence networks. Interestingly, many ancestral protein sequences appear close to extant sequences. Therefore, reconstructed ancestor sequences might also be interpreted as yet-to-be-identified homologues. The concept of ancestor reconstruction is compared to consensus sequences, too. It was found that hub sequences in a network, e.g. reconstructed ancestral sequences that are connected to many neighbouring sequences, share closer similarity with derived consensus sequences. Therefore, some reconstructed ancestor sequences can also be interpreted as consensus sequences.


Assuntos
Evolução Molecular , Proteínas , Sequência de Aminoácidos , Evolução Biológica , Filogenia
3.
PLoS One ; 16(10): e0256817, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34699529

RESUMO

The glycoside hydrolase 19 (GH19) is a bifunctional family of chitinases and endolysins, which have been studied for the control of plant fungal pests, the recycle of chitin biomass, and the treatment of multi-drug resistant bacteria. The GH19 domain-containing sequences (22,461) were divided into a chitinase and an endolysin subfamily by analyzing sequence networks, guided by taxonomy and the substrate specificity of characterized enzymes. The chitinase subfamily was split into seventeen groups, thus extending the previous classification. The endolysin subfamily is more diverse and consists of thirty-four groups. Despite their sequence diversity, twenty-six residues are conserved in chitinases and endolysins, which can be distinguished by two specific sequence patterns at six and four positions, respectively. Their location outside the catalytic cleft suggests a possible mechanism for substrate specificity that goes beyond the direct interaction with the substrate. The evolution of the GH19 catalytic domain was investigated by large-scale phylogeny. The inferred evolutionary history and putative horizontal gene transfer events differ from previous works. While no clear patterns were detected in endolysins, chitinases varied in sequence length by up to four loop insertions, causing at least eight distinct presence/absence loop combinations. The annotated GH19 sequences and structures are accessible via the GH19 Engineering Database (GH19ED, https://gh19ed.biocatnet.de). The GH19ED has been developed to support the prediction of substrate specificity and the search for novel GH19 enzymes from neglected taxonomic groups or in regions of the sequence space where few sequences have been described yet.


Assuntos
Quitinases/genética , Endopeptidases/genética , Animais , Proteínas de Bactérias/química , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Domínio Catalítico , Quitinases/química , Quitinases/metabolismo , Bases de Dados de Proteínas , Endopeptidases/química , Endopeptidases/metabolismo , Evolução Molecular , Fungos/química , Fungos/genética , Fungos/metabolismo , Humanos , Modelos Moleculares , Filogenia , Proteínas de Plantas/química , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Conformação Proteica , Especificidade por Substrato
4.
Proteins ; 89(2): 149-162, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-32862462

RESUMO

Expansins have the remarkable ability to loosen plant cell walls and cellulose material without showing catalytic activity and therefore have potential applications in biomass degradation. To support the study of sequence-structure-function relationships and the search for novel expansins, the Expansin Engineering Database (ExED, https://exed.biocatnet.de) collected sequence and structure data on expansins from Bacteria, Fungi, and Viridiplantae, and expansin-like homologues such as carbohydrate binding modules, glycoside hydrolases, loosenins, swollenins, cerato-platanins, and EXPNs. Based on global sequence alignment and protein sequence network analysis, the sequences are highly diverse. However, many similarities were found between the expansin domains. Newly created profile hidden Markov models of the two expansin domains enable standard numbering schemes, comprehensive conservation analyses, and genome annotation. Conserved key amino acids in the expansin domains were identified, a refined classification of expansins and carbohydrate binding modules was proposed, and new sequence motifs facilitate the search of novel candidate genes and the engineering of expansins.


Assuntos
Actinobacteria/genética , Parede Celular/metabolismo , Bases de Dados de Proteínas , Fungos/genética , Proteínas de Plantas/genética , Plantas/genética , Actinobacteria/metabolismo , Sequência de Aminoácidos , Parede Celular/química , Parede Celular/genética , Sequência Conservada , Fungos/metabolismo , Expressão Gênica , Família Multigênica , Proteínas de Plantas/química , Proteínas de Plantas/classificação , Proteínas de Plantas/metabolismo , Plantas/metabolismo , Domínios Proteicos , Engenharia de Proteínas/métodos , Isoformas de Proteínas/química , Isoformas de Proteínas/classificação , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Software , Homologia Estrutural de Proteína
5.
Chembiochem ; 21(24): 3511-3514, 2020 12 11.
Artigo em Inglês | MEDLINE | ID: mdl-32939899

RESUMO

The ß-hydroxyacid dehydrogenase from Thermocrinus albus (Ta-ßHAD), which catalyzes the NADP+ -dependent oxidation of ß-hydroxyacids, was engineered to accept imines as substrates. The catalytic activity of the proton-donor variant K189D was further increased by the introduction of two nonpolar flanking residues (N192 L, N193 L). Engineering the putative alternative proton donor (D258S) and the gate-keeping residue (F250 A) led to a switched substrate specificity as compared to the single and triple variants. The two most active Ta-ßHAD variants were applied to biocatalytic asymmetric reductions of imines at elevated temperatures and enabled enhanced product formation at a reaction temperature of 50 °C.


Assuntos
Desidrogenases de Carboidrato/metabolismo , Iminas/metabolismo , Engenharia de Proteínas , Temperatura , Bactérias/enzimologia , Desidrogenases de Carboidrato/química , Estabilidade Enzimática , Iminas/química , Modelos Moleculares , Estrutura Molecular , Oxirredução
6.
Front Plant Sci ; 11: 1155, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32849713

RESUMO

The genome of bipartite geminiviruses in the genus Begomovirus comprises two circular DNAs: DNA-A and DNA-B. The DNA-B component encodes a nuclear shuttle protein (NSP) and a movement protein (MP), which cooperate for systemic spread of infectious nucleic acids within host plants and affect pathogenicity. MP mediates multiple functions during intra- and intercellular trafficking, such as binding of viral nucleoprotein complexes, targeting to and modification of plasmodesmata, and release of the cargo after cell-to-cell transfer. For Abutilon mosaic virus (AbMV), phosphorylation of MP expressed in bacteria, yeast, and Nicotiana benthamiana plants, respectively, has been demonstrated in previous studies. Three phosphorylation sites (T221, S223, and S250) were identified in its C-terminal oligomerization domain by mass spectrometry, suggesting a regulation of MP by posttranslational modification. To examine the influence of the three sites on the self-interaction in more detail, MP mutants were tested for their interaction in yeast by two-hybrid assays, or by Förster resonance energy transfer (FRET) techniques in planta. Expression constructs with point mutations leading to simultaneous (triple) exchange of T221, S223, and S250 to either uncharged alanine (MPAAA), or phosphorylation charge-mimicking aspartate residues (MPDDD) were compared. MPDDD interfered with MP-MP binding in contrast to MPAAA. The roles of the phosphorylation sites for the viral life cycle were studied further, using plant-infectious AbMV DNA-B variants with the same triple mutants each. When co-inoculated with wild-type DNA-A, both mutants infected N. benthamiana plants systemically, but were unable to do so for some other plant species of the families Solanaceae or Malvaceae. Systemically infected plants developed symptoms and viral DNA levels different from those of wild-type AbMV for most virus-plant combinations. The results indicate a regulation of diverse MP functions by posttranslational modifications and underscore their biological relevance for a complex host plant-geminivirus interaction.

7.
Proteins ; 88(10): 1329-1339, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-32447824

RESUMO

Multicopper oxidases (MCOs) use copper ions as cofactors to oxidize a variety of substrates while reducing oxygen to water. MCOs have been identified in various taxa, with notable occurrences in fungi. The role of these fungal MCOs in lignin degradation sparked an interest due to their potential for application in biofuel production and various other industries. MCOs consist of different protein domains, which led to their classification into two-, three-, and six-domain MCOs. The previously established Laccase and Multicopper Oxidase Engineering Database (https://lcced.biocatnet.de) was updated and now includes 51 058 sequences and 229 structures of MCOs. Sequences and structures of all MCOs were systematically compared. All MCOs consist of cupredoxin-like domains. Two-domain MCOs are formed by the N- and C-terminal domain (domain N and C), while three-domain MCOs have an additional domain (M) in between, homologous to domain C. The six-domain MCOs consist of alternating domains N and C, each three times. Two standard numbering schemes were developed for the copper-binding domains N and C, which facilitated the identification of conserved positions and a comparison to previously reported results from mutagenesis studies. Two sequence motifs for the copper binding sites were identified per domain. Their modularity, depending on the placement of the T1-copper binding site, was demonstrated. Protein sequence networks showed relationships between two- and three-domain MCOs, allowing for family-specific annotation and inference of evolutionary relationships.


Assuntos
Azurina/química , Coenzimas/química , Cobre/química , Proteínas Fúngicas/química , Oxirredutases/química , Sequência de Aminoácidos , Azurina/metabolismo , Sítios de Ligação , Coenzimas/metabolismo , Cobre/metabolismo , Mineração de Dados , Bases de Dados de Proteínas , Evolução Molecular , Proteínas Fúngicas/classificação , Proteínas Fúngicas/genética , Proteínas Fúngicas/metabolismo , Fungos/química , Fungos/enzimologia , Modelos Moleculares , Oxirredução , Oxirredutases/classificação , Oxirredutases/genética , Oxirredutases/metabolismo , Oxigênio/química , Oxigênio/metabolismo , Ligação Proteica , Engenharia de Proteínas , Domínios e Motivos de Interação entre Proteínas , Estrutura Secundária de Proteína , Alinhamento de Sequência , Homologia de Sequência de Aminoácidos , Relação Estrutura-Atividade , Especificidade por Substrato , Água/química , Água/metabolismo
8.
FEBS J ; 287(5): 1035-1053, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-31545554

RESUMO

The α/ß-hydrolase fold family is highly diverse in sequence, structure and biochemical function. To investigate the sequence-structure-function relationships, the Lipase Engineering Database (https://led.biocatnet.de) was updated. Overall, 280 638 protein sequences and 1557 protein structures were analysed. All α/ß-hydrolases consist of the catalytically active core domain, but they might also contain additional structural modules, resulting in 12 different architectures: core domain only, additional lids at three different positions, three different caps, additional N- or C-terminal domains and combinations of N- and C-terminal domains with caps and lids respectively. In addition, the α/ß-hydrolases were distinguished by their oxyanion hole signature (GX-, GGGX- and Y-types). The N-terminal domains show two different folds, the Rossmann fold or the ß-propeller fold. The C-terminal domains show a ß-sandwich fold. The N-terminal ß-propeller domain and the C-terminal ß-sandwich domain are structurally similar to carbohydrate-binding proteins such as lectins. The classification was applied to the newly discovered polyethylene terephthalate (PET)-degrading PETases and MHETases, which are core domain α/ß-hydrolases of the GX- and the GGGX-type respectively. To investigate evolutionary relationships, sequence networks were analysed. The degree distribution followed a power law with a scaling exponent γ = 1.4, indicating a highly inhomogeneous network which consists of a few hubs and a large number of less connected sequences. The hub sequences have many functional neighbours and therefore are expected to be robust toward possible deleterious effects of mutations. The cluster size distribution followed a power law with an extrapolated scaling exponent τ = 2.6, which strongly supports the connectedness of the sequence space of α/ß-hydrolases. DATABASE: Supporting data about domains from other proteins with structural similarity to the N- or C-terminal domains of α/ß-hydrolases are available in Data Repository of the University of Stuttgart (DaRUS) under doi: https://doi.org/10.18419/darus-458.


Assuntos
Hidrolases/química , Hidrolases/metabolismo , Sequência de Aminoácidos , Lipase/química , Lipase/metabolismo , Polietilenotereftalatos/química , Alinhamento de Sequência , Homologia de Sequência de Aminoácidos
9.
Proteins ; 87(9): 774-785, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31070804

RESUMO

Thiamine diphosphate-dependent decarboxylases catalyze both cleavage and formation of CC bonds in various reactions, which have been assigned to different homologous sequence families. This work compares 53 ThDP-dependent decarboxylases with known crystal structures. Both sequence and structural information were analyzed synergistically and data were analyzed for global and local properties by means of statistical approaches (principle component analysis and principal coordinate analysis) enabling complexity reduction. The different results obtained both locally and globally, that is, individual positions compared with the overall protein sequence or structure, revealed challenges in the assignment of separated homologous families. The methods applied herein support the comparison of enzyme families and the identification of functionally relevant positions. The findings for the family of ThDP-dependent decarboxylases underline that global sequence identity alone is not sufficient to distinguish enzyme function. Instead, local sequence similarity, defined by comparisons of structurally equivalent positions, allows for a better navigation within several groups of homologous enzymes. The differentiation between homologous sequences is further enhanced by taking structural information into account, such as BioGPS analysis of the active site properties or pairwise structural superimpositions. The methods applied herein are expected to be transferrable to other enzyme families, to facilitate family assignments for homologous protein sequences.


Assuntos
Carboxiliases/química , Carboxiliases/metabolismo , Sítios de Ligação , Domínio Catalítico , Tiamina Pirofosfato/química
10.
Chembiochem ; 20(13): 1672-1677, 2019 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-30866142

RESUMO

Chorismate and isochorismate constitute branch-point intermediates in the biosynthesis of many aromatic metabolites in microorganisms and plants. To obtain unnatural compounds, we modified the route to menaquinone in Escherichia coli. We propose a model for the binding of isochorismate to the active site of MenD ((1R,2S, 5S,6S)-2-succinyl-5-enolpyruvyl-6-hydroxycyclohex-3-ene-1-carboxylate (SEPHCHC) synthase) that explains the outcome of the native reaction with α-ketoglutarate. We have rationally designed variants of MenD for the conversion of several isochorismate analogues. The double-variant Asn117Arg-Leu478Thr preferentially converts (5S,6S)-5,6-dihydroxycyclohexa-1,3-diene-1-carboxylate (2,3-trans-CHD), the hydrolysis product of isochorismate, with a >70-fold higher ratio than that for the wild type. The single-variant Arg107Ile uses (5S,6S)-6-amino-5-hydroxycyclohexa-1,3-diene-1-carboxylate (2,3-trans-CHA) as substrate with >6-fold conversion compared to wild-type MenD. The novel compounds have been made accessible in vivo (up to 5.3 g L-1 ). Unexpectedly, as the identified residues such as Arg107 are highly conserved (>94 %), some of the designed variations can be found in wild-type SEPHCHC synthases from other bacteria (Arg107Lys, 0.3 %). This raises the question for the possible natural occurrence of as yet unexplored branches of the shikimate pathway.


Assuntos
Ácidos Cicloexanocarboxílicos/metabolismo , Proteínas de Escherichia coli/metabolismo , Piruvato Oxidase/metabolismo , Domínio Catalítico , Ácido Corísmico/química , Ácido Corísmico/metabolismo , Escherichia coli/enzimologia , Escherichia coli/metabolismo , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/genética , Simulação de Acoplamento Molecular , Mutação , Ligação Proteica , Engenharia de Proteínas , Piruvato Oxidase/química , Piruvato Oxidase/genética , Especificidade por Substrato
11.
Proteins ; 87(6): 443-451, 2019 06.
Artigo em Inglês | MEDLINE | ID: mdl-30714194

RESUMO

The Short-chain Dehydrogenases/Reductases Engineering Database (SDRED) covers one of the largest known protein families (168 150 proteins). Assignment to the superfamilies of Classical and Extended SDRs was achieved by global sequence similarity and by identification of family-specific sequence motifs. Two standard numbering schemes were established for Classical and Extended SDRs that allow for the determination of conserved amino acid residues, such as cofactor specificity determining positions or superfamily specific sequence motifs. The comprehensive sequence dataset of the SDRED facilitates the refinement of family-specific sequence motifs. The glycine-rich motifs for Classical and Extended SDRs were refined to improve the precision of superfamily classification. In each superfamily, the majority of sequences formed a tightly connected sequence network and belonged to a large homologous family. Despite their different sequence motifs and their different sequence length, the two sequence networks of Classical and Extended SDRs are not separate, but connected by edges at a threshold of 40% sequence similarity, indicating that all SDRs belong to a large, connected network. The SDRED is accessible at https://sdred.biocatnet.de/.


Assuntos
Ácido Graxo Sintases/metabolismo , NADH NADPH Oxirredutases/metabolismo , Animais , Bases de Dados Genéticas , Ácido Graxo Sintases/genética , Humanos , NADH NADPH Oxirredutases/genética , Engenharia de Proteínas/métodos
12.
Org Biomol Chem ; 17(8): 2092-2098, 2019 02 20.
Artigo em Inglês | MEDLINE | ID: mdl-30706936

RESUMO

Chorismatases catalyse the cleavage of chorismate, yielding (dihydroxy-)benzoate derivatives, which often constitute starter units for pharmaceutically relevant secondary metabolites. Depending on their products, chorismatases have been classified into three different subfamilies. These can be assigned using a set of amino acid residues in the active site. Here, we describe five new chorismatases, two of them members of a new subfamily, which has been discovered through correlation analysis of homologous protein sequences. The enzymes from the new subfamily produce exclusively 4-hydroxybenzoate, the same compound as produced by the structurally unrelated chorismate lyases. This showcase of convergent evolution is an example of the existence of more than one pathway to central building blocks. In contrast to chorismate lyases, however, chorismatases do not suffer from product inhibition (up to 2 mM 4-HBA), while the remaining kinetic parameters are in the same range; this makes them an interesting alternative for biocatalytic applications.


Assuntos
Bactérias/enzimologia , Proteínas de Bactérias/metabolismo , Ácido Corísmico/metabolismo , Oxo-Ácido-Liases/metabolismo , Parabenos/metabolismo , Bactérias/química , Bactérias/metabolismo , Proteínas de Bactérias/química , Ácido Corísmico/química , Modelos Moleculares , Oxo-Ácido-Liases/química , Parabenos/química , Streptomyces/química , Streptomyces/enzimologia , Streptomyces/metabolismo , Especificidade por Substrato
13.
Biotechnol J ; 14(3): e1800183, 2019 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-29999245

RESUMO

The estimation of kinetic parameters provides valuable insights into the function of biocatalysts and is indispensable in optimizing process conditions. Frequently, kinetic analysis relies on the Michaelis-Menten model derived from initial reaction rates at different initial substrate concentrations. However, by analysis of complete progress curves, more complex kinetic models can be identified. This case study compares two previously published experiments on benzaldehyde lyase-catalyzed self-ligation for the substrates benzaldehyde and 3,5-dimethoxybenzaldehyde to investigate 1) the effect of using different kinetic model equations on the kinetic parameter values, and 2) the effect of using models with and without enzyme inactivation on the kinetic parameter values. These analyses first highlight possible pitfalls in the interpretation of kinetic parameter estimates and second suggest a consistent strategy for data management and validation of kinetic models: First, Michaelis-Menten parameters need to be interpreted with care, complete progress curves are necessary to describe the reaction dynamics, and all experimental conditions have to be taken into consideration when interpreting parameter estimates. Second, complete progress curves should be stored together with the respective reaction conditions, to consistently annotate experimental data and avoid misinterpretation of kinetic parameters. Such a data management strategy is provided by the BioCatNet database system.


Assuntos
Enzimas/química , Aldeído Liases/química , Catálise , Cinética , Modelos Biológicos
14.
PLoS One ; 13(8): e0200815, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30067815

RESUMO

The sequence space of five protein superfamilies was investigated by constructing sequence networks. The nodes represent individual sequences, and two nodes are connected by an edge if the global sequence identity of two sequences exceeds a threshold. The networks were characterized by their degree distribution (number of nodes with a given number of neighbors) and by their fractal network dimension. Although the five protein families differed in sequence length, fold, and domain arrangement, their network properties were similar. The fractal network dimension Df was distance-dependent: a high dimension for single and double mutants (Df = 4.0), which dropped to Df = 0.7-1.0 at 90% sequence identity, and increased to Df = 3.5-4.5 below 70% sequence identity. The distance dependency of the network dimension is consistent with evolutionary constraints for functional proteins. While random single and double mutations often result in a functional protein, the accumulation of more than ten mutations is dominated by epistasis. The networks of the five protein families were highly inhomogeneous with few highly connected communities ("hub sequences") and a large number of smaller and less connected communities. The degree distributions followed a power-law distribution with similar scaling exponents close to 1. Because the hub sequences have a large number of functional neighbors, they are expected to be robust toward possible deleterious effects of mutations. Because of their robustness, hub sequences have the potential of high innovability, with additional mutations readily inducing new functions. Therefore, they form hotspots of evolution and are promising candidates as starting points for directed evolution experiments in biotechnology.


Assuntos
Modelos Moleculares , Proteínas/química , Sequência de Aminoácidos , Fractais , Mutação , Domínios Proteicos , Dobramento de Proteína , Proteínas/genética , Proteínas/metabolismo , Alinhamento de Sequência
15.
Proteins ; 86(5): 566-580, 2018 05.
Artigo em Inglês | MEDLINE | ID: mdl-29423963

RESUMO

The ω-Transaminase Engineering Database (oTAED) was established as a publicly accessible resource on sequences and structures of the biotechnologically relevant ω-transaminases (ω-TAs) from Fold types I and IV. The oTAED integrates sequence and structure data, provides a classification based on fold type and sequence similarity, and applies a standard numbering scheme to identify equivalent positions in homologous proteins. The oTAED includes 67 210 proteins (114 655 sequences) which are divided into 169 homologous families based on global sequence similarity. The 44 and 39 highly conserved positions which were identified in Fold type I and IV, respectively, include the known catalytic residues and a large fraction of glycines and prolines in loop regions, which might have a role in protein folding and stability. However, for most of the conserved positions the function is still unknown. Literature information on positions that mediate substrate specificity and stereoselectivity was systematically examined. The standard numbering schemes revealed that many positions which have been described in different enzymes are structurally equivalent. For some positions, multiple functional roles have been suggested based on experimental data in different enzymes. The proposed standard numbering schemes for Fold type I and IV ω-TAs assist with analysis of literature data, facilitate annotation of ω-TAs, support prediction of promising mutation sites, and enable navigation in ω-TA sequence space. Thus, it is a useful tool for enzyme engineering and the selection of novel ω-TA candidates with desired biochemical properties.


Assuntos
Proteínas de Bactérias/química , Bases de Dados de Proteínas , Transaminases/química , Transaminases/classificação , Sequência de Aminoácidos , Aminoácidos/química , Bactérias , Domínio Catalítico , Sequência Conservada , Modelos Moleculares , Mutação , Conformação Proteica , Dobramento de Proteína , Relação Estrutura-Atividade , Especificidade por Substrato
16.
ACS Chem Biol ; 13(1): 225-234, 2018 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-29182315

RESUMO

Esterases receive special attention because of their wide distribution in biological systems and environments and their importance for physiology and chemical synthesis. The prediction of esterases' substrate promiscuity level from sequence data and the molecular reasons why certain such enzymes are more promiscuous than others remain to be elucidated. This limits the surveillance of the sequence space for esterases potentially leading to new versatile biocatalysts and new insights into their role in cellular function. Here, we performed an extensive analysis of the substrate spectra of 145 phylogenetically and environmentally diverse microbial esterases, when tested with 96 diverse esters. We determined the primary factors shaping their substrate range by analyzing substrate range patterns in combination with structural analysis and protein-ligand simulations. We found a structural parameter that helps rank (classify) the promiscuity level of esterases from sequence data at 94% accuracy. This parameter, the active site effective volume, exemplifies the topology of the catalytic environment by measuring the active site cavity volume corrected by the relative solvent accessible surface area (SASA) of the catalytic triad. Sequences encoding esterases with active site effective volumes (cavity volume/SASA) above a threshold show greater substrate spectra, which can be further extended in combination with phylogenetic data. This measure provides also a valuable tool for interrogating substrates capable of being converted. This measure, found to be transferred to phosphatases of the haloalkanoic acid dehalogenase superfamily and possibly other enzymatic systems, represents a powerful tool for low-cost bioprospecting for esterases with broad substrate ranges, in large scale sequence data sets.


Assuntos
Esterases/química , Esterases/metabolismo , Filogenia , Domínio Catalítico , Especificidade por Substrato
17.
PLoS One ; 12(12): e0189646, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-29261740

RESUMO

The currently known protein sequences are not distributed equally in sequence space, but cluster into families. Analyzing the cluster size distribution gives a glimpse of the large and unknown extant protein sequence space, which has been explored during evolution. For six protein superfamilies with different fold and function, the cluster size distributions followed a power law with slopes between 2.4 and 3.3, which represent upper limits to the cluster distribution of extant sequences. The power law distribution of cluster sizes is in accordance with percolation theory and strongly supports connectedness of extant sequence space. Percolation of extant sequence space has three major consequences: (1) It transforms our view of sequence space as a highly connected network where each sequence has multiple neighbors, and each pair of sequences is connected by many different paths. A high degree of connectedness is a necessary condition of efficient evolution, because it overcomes the possible blockage by sign epistasis and reciprocal sign epistasis. (2) The Fisher exponent is an indicator of connectedness and saturation of sequence space of each protein superfamily. (3) All clusters are expected to be connected by extant sequences that become apparent as a higher portion of extant sequence space becomes known. Being linked to biochemically distinct homologous families, bridging sequences are promising enzyme candidates for applications in biotechnology because they are expected to have substrate ambiguity or catalytic promiscuity.


Assuntos
Proteínas/química , Sequência de Aminoácidos , Análise por Conglomerados , Dobramento de Proteína
18.
Chembiochem ; 17(21): 2093-2098, 2016 11 03.
Artigo em Inglês | MEDLINE | ID: mdl-27562499

RESUMO

The development of novel enzymes for biocatalytic processes requires knowledge on substrate profile and selectivity; this can be derived from databases and from publications. Often, these sources lack time-course data for the substrate or product, and an unambiguous link between experiment and enzyme sequence. The lack of integrated, original data hampers the comprehensive analysis of enzyme kinetics and the evaluation of sequence-function relationships. In order to accelerate enzyme engineering, BioCatNet integrates protein sequence, protein structure, and experimental data for a given enzyme family. BioCatNet explicitly assigns the enzyme sequence to the experimental data, which consists of information on reaction conditions and time-course data. BioCatNet facilitates the consistent documentation of reaction conditions, the archiving of time-course data, and the efficient exchange of experimental data among collaborators. Data integration is demonstrated for three case studies by using the TEED (Thiamine diphosphate-dependent Enzymes Engineering Database).


Assuntos
Biocatálise , Bases de Dados de Proteínas , Enzimas/química , Enzimas/metabolismo , Sequência de Aminoácidos , Engenharia de Proteínas
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...