Pesquisa | Biblioteca Virtual em Saúde

Analyzing the symmetrical arrangement of structural repeats in proteins with CE-Symm.

Bliven, Spencer E; Lafita, Aleix; Rose, Peter W; Capitani, Guido; Prlic, Andreas; Bourne, Philip E.

PLoS Comput Biol ; 15(4): e1006842, 2019 04.

Artigo em Inglês | MEDLINE | ID: mdl-31009453

RESUMO

Many proteins fold into highly regular and repetitive three dimensional structures. The analysis of structural patterns and repeated elements is fundamental to understand protein function and evolution. We present recent improvements to the CE-Symm tool for systematically detecting and analyzing the internal symmetry and structural repeats in proteins. In addition to the accurate detection of internal symmetry, the tool is now capable of i) reporting the type of symmetry, ii) identifying the smallest repeating unit, iii) describing the arrangement of repeats with transformation operations and symmetry axes, and iv) comparing the similarity of all the internal repeats at the residue level. CE-Symm 2.0 helps the user investigate proteins with a robust and intuitive sequence-to-structure analysis, with many applications in protein classification, functional annotation and evolutionary studies. We describe the algorithmic extensions of the method and demonstrate its applications to the study of interesting cases of protein evolution.

Assuntos

Algoritmos , Biologia Computacional/métodos , Proteínas/química , Software , Sequência de Aminoácidos , Bases de Dados de Proteínas , Modelos Moleculares , Análise de Sequência de Proteína

Systems biology of the structural proteome.

Brunk, Elizabeth; Mih, Nathan; Monk, Jonathan; Zhang, Zhen; O'Brien, Edward J; Bliven, Spencer E; Chen, Ke; Chang, Roger L; Bourne, Philip E; Palsson, Bernhard O.

BMC Syst Biol ; 10: 26, 2016 Mar 11.

Artigo em Inglês | MEDLINE | ID: mdl-26969117

RESUMO

BACKGROUND: The success of genome-scale models (GEMs) can be attributed to the high-quality, bottom-up reconstructions of metabolic, protein synthesis, and transcriptional regulatory networks on an organism-specific basis. Such reconstructions are biochemically, genetically, and genomically structured knowledge bases that can be converted into a mathematical format to enable a myriad of computational biological studies. In recent years, genome-scale reconstructions have been extended to include protein structural information, which has opened up new vistas in systems biology research and empowered applications in structural systems biology and systems pharmacology. RESULTS: Here, we present the generation, application, and dissemination of genome-scale models with protein structures (GEM-PRO) for Escherichia coli and Thermotoga maritima. We show the utility of integrating molecular scale analyses with systems biology approaches by discussing several comparative analyses on the temperature dependence of growth, the distribution of protein fold families, substrate specificity, and characteristic features of whole cell proteomes. Finally, to aid in the grand challenge of big data to knowledge, we provide several explicit tutorials of how protein-related information can be linked to genome-scale models in a public GitHub repository ( https://github.com/SBRG/GEMPro/tree/master/GEMPro_recon/). CONCLUSIONS: Translating genome-scale, protein-related information to structured data in the format of a GEM provides a direct mapping of gene to gene-product to protein structure to biochemical reaction to network states to phenotypic function. Integration of molecular-level details of individual proteins, such as their physical, chemical, and structural properties, further expands the description of biochemical network-level properties, and can ultimately influence how to model and predict whole cell phenotypes as well as perform comparative systems biology approaches to study differences between organisms. GEM-PRO offers insight into the physical embodiment of an organism's genotype, and its use in this comparative framework enables exploration of adaptive strategies for these organisms, opening the door to many new lines of research. With these provided tools, tutorials, and background, the reader will be in a position to run GEM-PRO for their own purposes.

Assuntos

Escherichia coli/genética , Escherichia coli/metabolismo , Proteômica , Biologia de Sistemas/métodos , Thermotoga maritima/genética , Thermotoga maritima/metabolismo , Escherichia coli/crescimento & desenvolvimento , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Modelos Biológicos , Modelos Moleculares , Conformação Proteica , Homologia de Sequência de Aminoácidos , Temperatura , Thermotoga maritima/crescimento & desenvolvimento

Detection of circular permutations within protein structures using CE-CP.

Bliven, Spencer E; Bourne, Philip E; Prlic, Andreas.

Bioinformatics ; 31(8): 1316-8, 2015 Apr 15.

Artigo em Inglês | MEDLINE | ID: mdl-25505094

RESUMO

MOTIVATION: Circular permutation is an important type of protein rearrangement. Natural circular permutations have implications for protein function, stability and evolution. Artificial circular permutations have also been used for protein studies. However, such relationships are difficult to detect for many sequence and structure comparison algorithms and require special consideration. RESULTS: We developed a new algorithm, called Combinatorial Extension for Circular Permutations (CE-CP), which allows the structural comparison of circularly permuted proteins. CE-CP was designed to be user friendly and is integrated into the RCSB Protein Data Bank. It was tested on two collections of circularly permuted proteins. Pairwise alignments can be visualized both in a desktop application or on the web using Jmol and exported to other programs in a variety of formats. AVAILABILITY AND IMPLEMENTATION: The CE-CP algorithm can be accessed through the RCSB website at http://www.rcsb.org/pdb/workbench/workbench.do. Source code is available under the LGPL 2.1 as part of BioJava 3 (http://biojava.org; http://github.com/biojava/biojava). CONTACT: sbliven@ucsd.edu or info@rcsb.org.

Assuntos

Algoritmos , Biologia Computacional/métodos , Bases de Dados de Proteínas , Dinaminas/química , Homologia Estrutural de Proteína , Humanos , Linguagens de Programação , Estrutura Terciária de Proteína , Análise de Sequência de Proteína/métodos

Systematic detection of internal symmetry in proteins using CE-Symm.

Myers-Turnbull, Douglas; Bliven, Spencer E; Rose, Peter W; Aziz, Zaid K; Youkharibache, Philippe; Bourne, Philip E; Prlic, Andreas.

J Mol Biol ; 426(11): 2255-68, 2014 May 29.

Artigo em Inglês | MEDLINE | ID: mdl-24681267

RESUMO

Symmetry is an important feature of protein tertiary and quaternary structures that has been associated with protein folding, function, evolution, and stability. Its emergence and ensuing prevalence has been attributed to gene duplications, fusion events, and subsequent evolutionary drift in sequence. This process maintains structural similarity and is further supported by this study. To further investigate the question of how internal symmetry evolved, how symmetry and function are related, and the overall frequency of internal symmetry, we developed an algorithm, CE-Symm, to detect pseudo-symmetry within the tertiary structure of protein chains. Using a large manually curated benchmark of 1007 protein domains, we show that CE-Symm performs significantly better than previous approaches. We use CE-Symm to build a census of symmetry among domain superfamilies in SCOP and note that 18% of all superfamilies are pseudo-symmetric. Our results indicate that more domains are pseudo-symmetric than previously estimated. We establish a number of recurring types of symmetry-function relationships and describe several characteristic cases in detail. With the use of the Enzyme Commission classification, symmetry was found to be enriched in some enzyme classes but depleted in others. CE-Symm thus provides a methodology for a more complete and detailed study of the role of symmetry in tertiary protein structure [availability: CE-Symm can be run from the Web at http://source.rcsb.org/jfatcatserver/symmetry.jsp. Source code and software binaries are also available under the GNU Lesser General Public License (version 2.1) at https://github.com/rcsb/symmetry. An interactive census of domains identified as symmetric by CE-Symm is available from http://source.rcsb.org/jfatcatserver/scopResults.jsp].

Assuntos

Algoritmos , Estrutura Terciária de Proteína , Alinhamento de Sequência/métodos , Análise de Sequência de Proteína/métodos , Software , Sequência de Aminoácidos , Biologia Computacional/métodos , Bases de Dados de Proteínas , Humanos , Modelos Moleculares , Dobramento de Proteína , Homologia de Sequência de Aminoácidos

BioJava: an open-source framework for bioinformatics in 2012.

Prlic, Andreas; Yates, Andrew; Bliven, Spencer E; Rose, Peter W; Jacobsen, Julius; Troshin, Peter V; Chapman, Mark; Gao, Jianjiong; Koh, Chuan Hock; Foisy, Sylvain; Holland, Richard; Rimsa, Gediminas; Heuer, Michael L; Brandstätter-Müller, H; Bourne, Philip E; Willis, Scooter.

Bioinformatics ; 28(20): 2693-5, 2012 Oct 15.

Artigo em Inglês | MEDLINE | ID: mdl-22877863

RESUMO

UNLABELLED: BioJava is an open-source project for processing of biological data in the Java programming language. We have recently released a new version (3.0.5), which is a major update to the code base that greatly extends its functionality. RESULTS: BioJava now consists of several independent modules that provide state-of-the-art tools for protein structure comparison, pairwise and multiple sequence alignments, working with DNA and protein sequences, analysis of amino acid properties, detection of protein modifications and prediction of disordered regions in proteins as well as parsers for common file formats using a biologically meaningful data model. AVAILABILITY: BioJava is an open-source project distributed under the Lesser GPL (LGPL). BioJava can be downloaded from the BioJava website (http://www.biojava.org). BioJava requires Java 1.6 or higher. All inquiries should be directed to the BioJava mailing lists. Details are available at http://biojava.org/wiki/BioJava:MailingLists.

Assuntos

Proteínas/química , Análise de Sequência , Software , Aminoácidos/química , Biologia Computacional , Genômica , Conformação Proteica , Processamento de Proteína Pós-Traducional , Alinhamento de Sequência , Análise de Sequência de DNA , Análise de Sequência de Proteína

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA