Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
1.
Proc Natl Acad Sci U S A ; 110(45): E4195-202, 2013 Nov 05.
Artigo em Inglês | MEDLINE | ID: mdl-24145433

RESUMO

Structural Genomics aims to elucidate protein structures to identify their functions. Unfortunately, the variation of just a few residues can be enough to alter activity or binding specificity and limit the functional resolution of annotations based on sequence and structure; in enzymes, substrates are especially difficult to predict. Here, large-scale controls and direct experiments show that the local similarity of five or six residues selected because they are evolutionarily important and on the protein surface can suffice to identify an enzyme activity and substrate. A motif of five residues predicted that a previously uncharacterized Silicibacter sp. protein was a carboxylesterase for short fatty acyl chains, similar to hormone-sensitive-lipase-like proteins that share less than 20% sequence identity. Assays and directed mutations confirmed this activity and showed that the motif was essential for catalysis and substrate specificity. We conclude that evolutionary and structural information may be combined on a Structural Genomics scale to create motifs of mixed catalytic and noncatalytic residues that identify enzyme activity and substrate specificity.


Assuntos
Biologia Computacional/métodos , Enzimas/metabolismo , Proteômica/métodos , Clonagem Molecular , Primers do DNA/genética , Evolução Molecular , Anotação de Sequência Molecular , Relação Estrutura-Atividade , Especificidade por Substrato
2.
Bioinformatics ; 28(9): 1272-3, 2012 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-22419780

RESUMO

SUMMARY: The affordability of high-throughput sequencing has created an unprecedented surge in the use of genomic data in basic, translational and clinical research. The rapid evolution of sequencing technology, coupled with its broad adoption across biology and medicine, necessitates fast, collaborative interdisciplinary discussion. SEQanswers provides a real-time knowledge-sharing resource to address this need, covering experimental and computational aspects of sequencing and sequence analysis. Developers of popular analysis tools are among the >4000 active members, and ~40 peer-reviewed publications have referenced SEQanswers. AVAILABILITY: The SEQanswers community is freely accessible at http://SEQanswers.com/


Assuntos
Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Internet , Comportamento Cooperativo , Genômica/instrumentação , Projeto Genoma Humano , Humanos , Metagenoma
3.
Bioinformatics ; 25(11): 1426-7, 2009 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-19307237

RESUMO

SUMMARY: The Evolutionary Trace Annotation (ETA) Server predicts enzymatic activity. ETA starts with a structure of unknown function, such as those from structural genomics, and with no prior knowledge of its mechanism uses the phylogenetic Evolutionary Trace (ET) method to extract key functional residues and propose a function-associated 3D motif, called a 3D template. ETA then searches previously annotated structures for geometric template matches that suggest molecular and thus functional mimicry. In order to maximize the predictive value of these matches, ETA next applies distinctive specificity filters -- evolutionary similarity, function plurality and match reciprocity. In large scale controls on enzymes, prediction coverage is 43% but the positive predictive value rises to 92%, thus minimizing false annotations. Users may modify any search parameter, including the template. ETA thus expands the ET suite for protein structure annotation, and can contribute to the annotation efforts of metaservers. AVAILABILITY: The ETA Server is a web application available at (http://mammoth.bcm.tmc.edu/eta/).


Assuntos
Biologia Computacional/métodos , Enzimas/química , Proteínas/química , Bases de Dados de Proteínas , Evolução Molecular , Internet , Filogenia , Conformação Proteica , Software
4.
BMC Bioinformatics ; 9: 17, 2008 Jan 11.
Artigo em Inglês | MEDLINE | ID: mdl-18190718

RESUMO

BACKGROUND: Structural genomics projects such as the Protein Structure Initiative (PSI) yield many new structures, but often these have no known molecular functions. One approach to recover this information is to use 3D templates - structure-function motifs that consist of a few functionally critical amino acids and may suggest functional similarity when geometrically matched to other structures. Since experimentally determined functional sites are not common enough to define 3D templates on a large scale, this work tests a computational strategy to select relevant residues for 3D templates. RESULTS: Based on evolutionary information and heuristics, an Evolutionary Trace Annotation (ETA) pipeline built templates for 98 enzymes, half taken from the PSI, and sought matches in a non-redundant structure database. On average each template matched 2.7 distinct proteins, of which 2.0 share the first three Enzyme Commission digits as the template's enzyme of origin. In many cases (61%) a single most likely function could be predicted as the annotation with the most matches, and in these cases such a plurality vote identified the correct function with 87% accuracy. ETA was also found to be complementary to sequence homology-based annotations. When matches are required to both geometrically match the 3D template and to be sequence homologs found by BLAST or PSI-BLAST, the annotation accuracy is greater than either method alone, especially in the region of lower sequence identity where homology-based annotations are least reliable. CONCLUSION: These data suggest that knowledge of evolutionarily important residues improves functional annotation among distant enzyme homologs. Since, unlike other 3D template approaches, the ETA method bypasses the need for experimental knowledge of the catalytic mechanism, it should prove a useful, large scale, and general adjunct to combine with other methods to decipher protein function in the structural proteome.


Assuntos
Motivos de Aminoácidos/genética , Enzimas , Evolução Molecular , Inteligência Artificial , Bases de Dados de Proteínas , Enzimas/química , Enzimas/genética , Enzimas/metabolismo , Funções Verossimilhança , Modelos Biológicos , Reconhecimento Automatizado de Padrão , Conformação Proteica , Proteoma , Alinhamento de Sequência , Homologia de Sequência de Aminoácidos , Homologia Estrutural de Proteína , Relação Estrutura-Atividade
5.
Protein Sci ; 15(6): 1530-6, 2006 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-16672239

RESUMO

The annotation of protein function has not kept pace with the exponential growth of raw sequence and structure data. An emerging solution to this problem is to identify 3D motifs or templates in protein structures that are necessary and sufficient determinants of function. Here, we demonstrate the recurrent use of evolutionary trace information to construct such 3D templates for enzymes, search for them in other structures, and distinguish true from spurious matches. Serine protease templates built from evolutionarily important residues distinguish between proteases and other proteins nearly as well as the classic Ser-His-Asp catalytic triad. In 53 enzymes spanning 33 distinct functions, an automated pipeline identifies functionally related proteins with an average positive predictive power of 62%, including correct matches to proteins with the same function but with low sequence identity (the average identity for some templates is only 17%). Although these template building, searching, and match classification strategies are not yet optimized, their sequential implementation demonstrates a functional annotation pipeline which does not require experimental information, but only local molecular mimicry among a small number of evolutionarily important residues.


Assuntos
Evolução Molecular , Modelos Biológicos , Proteínas/química , Proteínas/metabolismo , Algoritmos , Biologia Computacional/métodos , Bases de Dados de Proteínas , Enzimas/química , Enzimas/metabolismo
6.
J Mol Microbiol Biotechnol ; 21(1-2): 36-44, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-22248541

RESUMO

Evolution hinges on the ability of organisms to adapt to their environment. A key regulator of adaptability is mutation rate, which must be balanced to maintain genome fidelity while permitting sufficient plasticity to cope with environmental changes. Multiple mechanisms govern an organism's mutation rate. Constitutive mechanisms include mutator alleles that drive global, permanent increases in mutation rates, but these changes are confined to the subpopulation that carries the mutator allele. Other mechanisms focus mutagenesis in time and space to improve the chances that adaptive mutations can spread through the population. For example, environmental stress can induce mechanisms that transiently relax the fidelity of DNA repair to bring about a temporary increase in mutation rates during times when an organism experiences a reduced fitness for its surroundings, as has been demonstrated for double-strand break repair in Escherichia coli. Still, other mechanisms control the spatial distribution of mutations by directing changes to especially mutable sequences in the genome. In eukaryotic cells, for example, the stress-sensitive chaperone Hsp90 can regulate the length of trinucleotide repeats to fine-tune gene function and can regulate the mobility of transposable elements to enable larger functional changes. Here, we review the regulation of mutation rate, with special emphasis on the roles of tandem repeats and environmental stress in genome evolution.


Assuntos
Adaptação Biológica , Escherichia coli/fisiologia , Mutação , Sequências Repetitivas de Ácido Nucleico , Estresse Fisiológico , Escherichia coli/genética , Evolução Molecular , Genoma Bacteriano , Seleção Genética
7.
J Mol Biol ; 396(5): 1451-73, 2010 Mar 12.
Artigo em Inglês | MEDLINE | ID: mdl-20036248

RESUMO

By design, structural genomics (SG) solves many structures that cannot be assigned function based on homology to known proteins. Alternative function annotation methods are therefore needed and this study focuses on function prediction with three-dimensional (3D) templates: small structural motifs built of just a few functionally critical residues. Although experimentally proven functional residues are scarce, we show here that Evolutionary Trace (ET) rankings of residue importance are sufficient to build 3D templates, match them, and then assign Gene Ontology (GO) functions in enzymes and non-enzymes alike. In a high-specificity mode, this Evolutionary Trace Annotation (ETA) method covered half (53%) of the 2384 annotated SG protein controls. Three-quarters (76%) of predictions were both correct and complete. The positive predictive value for all GO depths (all-depth PPV) was 84%, and it rose to 94% over GO depths 1-3 (depth 3 PPV). In a high-sensitivity mode, coverage rose significantly (84%), while accuracy fell moderately: 68% of predictions were both correct and complete, all-depth PPV was 75%, and depth 3 PPV was 86%. These data concur with prior mutational experiments showing that ET rank information identifies key functional determinants in proteins. In practice, ETA predicted functions in 42% of 3461 unannotated SG proteins. In 529 cases--including 280 non-enzymes and 21 for metal ion ligands--the expected accuracy is 84% at any GO depth and 94% down to GO depth 3, while for the remaining 931 the expected accuracies are 60% and 71%, respectively. Thus, local structural comparisons of evolutionarily important residues can help decipher protein functions to known reliability levels and without prior assumption on functional mechanisms. ETA is available at http://mammoth.bcm.tmc.edu/eta.


Assuntos
Evolução Molecular , Proteínas/química , Proteínas/genética , Proteoma , Algoritmos , Animais , Bases de Dados de Proteínas , Enzimas/química , Enzimas/genética , Genômica , Humanos , Modelos Moleculares , Conformação Proteica , Proteínas/metabolismo , Alinhamento de Sequência , Análise de Sequência de Proteína , Homologia de Sequência de Aminoácidos , Homologia Estrutural de Proteína
8.
PLoS One ; 5(12): e14286, 2010 Dec 13.
Artigo em Inglês | MEDLINE | ID: mdl-21179190

RESUMO

High-throughput Structural Genomics yields many new protein structures without known molecular function. This study aims to uncover these missing annotations by globally comparing select functional residues across the structural proteome. First, Evolutionary Trace Annotation, or ETA, identifies which proteins have local evolutionary and structural features in common; next, these proteins are linked together into a proteomic network of ETA similarities; then, starting from proteins with known functions, competing functional labels diffuse link-by-link over the entire network. Every node is thus assigned a likelihood z-score for every function, and the most significant one at each node wins and defines its annotation. In high-throughput controls, this competitive diffusion process recovered enzyme activity annotations with 99% and 97% accuracy at half-coverage for the third and fourth Enzyme Commission (EC) levels, respectively. This corresponds to false positive rates 4-fold lower than nearest-neighbor and 5-fold lower than sequence-based annotations. In practice, experimental validation of the predicted carboxylesterase activity in a protein from Staphylococcus aureus illustrated the effectiveness of this approach in the context of an increasingly drug-resistant microbe. This study further links molecular function to a small number of evolutionarily important residues recognizable by Evolutionary Tracing and it points to the specificity and sensitivity of functional annotation by competitive global network diffusion. A web server is at http://mammoth.bcm.tmc.edu/networks.


Assuntos
Proteínas/química , Staphylococcus aureus/genética , Algoritmos , Bioquímica/métodos , Clonagem Molecular , Análise por Conglomerados , Estudos de Coortes , Bases de Dados de Proteínas , Evolução Molecular , Genômica , Funções Verossimilhança , Modelos Genéticos , Modelos Estatísticos , Proteômica/métodos , Reprodutibilidade dos Testes , Staphylococcus aureus/metabolismo
9.
PLoS One ; 3(5): e2136, 2008 May 07.
Artigo em Inglês | MEDLINE | ID: mdl-18461181

RESUMO

Function prediction frequently relies on comparing genes or gene products to search for relevant similarities. Because the number of protein structures with unknown function is mushrooming, however, we asked here whether such comparisons could be improved by focusing narrowly on the key functional features of protein structures, as defined by the Evolutionary Trace (ET). Therefore a series of algorithms was built to (a) extract local motifs (3D templates) from protein structures based on ET ranking of residue importance; (b) to assess their geometric and evolutionary similarity to other structures; and (c) to transfer enzyme annotation whenever a plurality was reached across matches. Whereas a prototype had only been 80% accurate and was not scalable, here a speedy new matching algorithm enabled large-scale searches for reciprocal matches and thus raised annotation specificity to 100% in both positive and negative controls of 49 enzymes and 50 non-enzymes, respectively-in one case even identifying an annotation error-while maintaining sensitivity ( approximately 60%). Critically, this Evolutionary Trace Annotation (ETA) pipeline requires no prior knowledge of functional mechanisms. It could thus be applied in a large-scale retrospective study of 1218 structural genomics enzymes and reached 92% accuracy. Likewise, it was applied to all 2935 unannotated structural genomics proteins and predicted enzymatic functions in 320 cases: 258 on first pass and 62 more on second pass. Controls and initial analyses suggest that these predictions are reliable. Thus the large-scale evolutionary integration of sequence-structure-function data, here through reciprocal identification of local, functionally important structural features, may contribute significantly to de-orphaning the structural proteome.


Assuntos
Evolução Molecular , Proteoma/genética , Algoritmos , Enzimas/metabolismo , Proteínas/genética , Proteínas/metabolismo , Proteoma/química , Moldes Genéticos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA