Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
1.
BMC Genomics ; 14 Suppl 5: S13, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24564380

RESUMO

BACKGROUND: Small bioinformatics databases, unlike institutionally funded large databases, are vulnerable to discontinuation and many reported in publications are no longer accessible. This leads to irreproducible scientific work and redundant effort, impeding the pace of scientific progress. RESULTS: We describe a Web-accessible system, available online at http://biodb100.apbionet.org, for archival and future on demand re-instantiation of small databases within minutes. Depositors can rebuild their databases by downloading a Linux live operating system (http://www.bioslax.com), preinstalled with bioinformatics and UNIX tools. The database and its dependencies can be compressed into an ".lzm" file for deposition. End-users can search for archived databases and activate them on dynamically re-instantiated BioSlax instances, run as virtual machines over the two popular full virtualization standard cloud-computing platforms, Xen Hypervisor or vSphere. The system is adaptable to increasing demand for disk storage or computational load and allows database developers to use the re-instantiated databases for integration and development of new databases. CONCLUSIONS: Herein, we demonstrate that a relatively inexpensive solution can be implemented for archival of bioinformatics databases and their rapid re-instantiation should the live databases disappear.


Assuntos
Biologia Computacional/métodos , Bases de Dados Factuais , Internet , Arquivos , Software , Interface Usuário-Computador
2.
BMC Bioinformatics ; 10 Suppl 15: S2, 2009 Dec 03.
Artigo em Inglês | MEDLINE | ID: mdl-19958512

RESUMO

BACKGROUND: Amino-terminal signal peptides (SPs) are short regions that guide the targeting of secretory proteins to the correct subcellular compartments in the cell. They are cleaved off upon the passenger protein reaching its destination. The explosive growth in sequencing technologies has led to the deposition of vast numbers of protein sequences necessitating rapid functional annotation techniques, with subcellular localization being a key feature. Of the myriad software prediction tools developed to automate the task of assigning the SP cleavage site of these new sequences, we review here, the performance and reliability of commonly used SP prediction tools. RESULTS: The available signal peptide data has been manually curated and organized into three datasets representing eukaryotes, Gram-positive and Gram-negative bacteria. These datasets are used to evaluate thirteen prediction tools that are publicly available. SignalP (both the HMM and ANN versions) maintains consistency and achieves the best overall accuracy in all three benchmarking experiments, ranging from 0.872 to 0.914 although other prediction tools are narrowing the performance gap. CONCLUSION: The majority of the tools evaluated in this study encounter no difficulty in discriminating between secretory and non-secretory proteins. The challenge clearly remains with pinpointing the correct SP cleavage site. The composite scoring schemes employed by SignalP may help to explain its accuracy. Prediction task is divided into a number of separate steps, thus allowing each score to tackle a particular aspect of the prediction.


Assuntos
Biologia Computacional/métodos , Sinais Direcionadores de Proteínas/genética , Análise de Sequência de Proteína/métodos , Bases de Dados de Proteínas , Proteínas/química
3.
BMC Bioinformatics ; 9 Suppl 12: S15, 2008 Dec 12.
Artigo em Inglês | MEDLINE | ID: mdl-19091014

RESUMO

BACKGROUND: Signal peptides (SPs) mediate the targeting of secretory precursor proteins to the correct subcellular compartments in prokaryotes and eukaryotes. Identifying these transient peptides is crucial to the medical, food and beverage and biotechnology industries yet our understanding of these peptides remains limited. This paper examines the most common type of signal peptides cleavable by the endoprotease signal peptidase I (SPase I), and the residues flanking the cleavage sites of three groups of signal peptide sequences, namely (i) eukaryotes (Euk) (ii) Gram-positive (Gram+) bacteria, and (iii) Gram-negative (Gram-) bacteria. RESULTS: In this study, 2352 secretory peptide sequences from a variety of organisms with amino-terminal SPs are extracted from the manually curated SPdb database for analysis based on physicochemical properties such as pI, aliphatic index, GRAVY score, hydrophobicity, net charge and position-specific residue preferences. Our findings show that the three groups share several similarities in general, but they display distinctive features upon examination in terms of their amino acid compositions and frequencies, and various physico-chemical properties. Thus, analysis or prediction of their sequences should be separated and treated as distinct groups. CONCLUSION: We conclude that the peptide segment recognized by SPase I extends to the start of the mature protein to a limited extent, upon our survey of the amino acid residues surrounding the cleavage processing site. These flanking residues possibly influence the cleavage processing and contribute to non-canonical cleavage sites. Our findings are applicable in defining more accurate prediction tools for recognition and identification of cleavage site of SPs.


Assuntos
Biologia Computacional/métodos , Proteínas de Membrana/química , Sinais Direcionadores de Proteínas , Serina Endopeptidases/química , Proteínas de Bactérias/química , Físico-Química/métodos , Bases de Dados de Proteínas , Dimerização , Ponto Isoelétrico , Modelos Estatísticos , Modelos Teóricos , Peptídeos/química , Sinais Direcionadores de Proteínas/genética , Estrutura Terciária de Proteína
4.
BMC Bioinformatics ; 9 Suppl 1: S15, 2008.
Artigo em Inglês | MEDLINE | ID: mdl-18315846

RESUMO

BACKGROUND: Type I signal peptidases (SPases) are essential membrane-bound serine proteases responsible for the cleavage of signal peptides from proteins that are translocated across biological membranes. The crystal structure of SPase in complex with signal peptide has not been solved and their substrate-binding site and binding specificities remain poorly understood. We report here a structure-based model for Escherichia coli DsbA 13-25 in complex with its endogenous type I SPase. RESULTS: The bound structure of DsbA 13-25 in complex with its endogenous type I SPase reported here reveals the existence of an extended conformation of the precursor protein with a pronounced backbone twist between positions P3 and P1'. Residues 13-25 of DsbA occupy, and thereby define 13 subsites, S7 to S6', within the SPase substrate-binding site. The newly defined subsites, S1' to S6' play critical roles in the substrate specificities of E. coli SPase. Our results are in accord with available experimental data. CONCLUSION: Collectively, the results of this study provide interesting new insights into the binding conformation of signal peptides and the substrate-binding site of E. coli SPase. This is the first report on the modeling of a precursor protein into the entire SPase binding site. Together with the conserved precursor protein binding conformation, the existing and newly identified substrate binding sites readily explain SPase cleavage fidelity, consistent with existing biochemical results and solution structures of inhibitors in complex with E. coli SPase. Our data suggests that both signal and mature moiety sequences play important roles and should be considered in the development of predictive tools.


Assuntos
Algoritmos , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/ultraestrutura , Modelos Químicos , Modelos Moleculares , Isomerases de Dissulfetos de Proteínas/química , Isomerases de Dissulfetos de Proteínas/ultraestrutura , Análise de Sequência de Proteína/métodos , Sequência de Aminoácidos , Sítios de Ligação , Simulação por Computador , Ativação Enzimática , Dados de Sequência Molecular , Ligação Proteica , Conformação Proteica , Relação Estrutura-Atividade
5.
Bioinformatics ; 23(4): 504-6, 2007 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-17150996

RESUMO

UNLABELLED: Assessment of potential allergenicity and patterns of cross-reactivity is necessary whenever novel proteins are introduced into human food chain. Current bioinformatic methods in allergology focus mainly on the prediction of allergenic proteins, with no information on cross-reactivity patterns among known allergens. In this study, we present AllerTool, a web server with essential tools for the assessment of predicted as well as published cross-reactivity patterns of allergens. The analysis tools include graphical representation of allergen cross-reactivity information; a local sequence comparison tool that displays information of known cross-reactive allergens; a sequence similarity search tool for assessment of cross-reactivity in accordance to FAO/WHO Codex alimentarius guidelines; and a method based on support vector machine (SVM). A 10-fold cross-validation results showed that the area under the receiver operating curve (A(ROC)) of SVM models is 0.90 with 86.00% sensitivity (SE) at specificity (SP) of 86.00%. AVAILABILITY: AllerTool is freely available at http://research.i2r.a-star.edu.sg/AllerTool/.


Assuntos
Alérgenos/química , Alérgenos/imunologia , Reações Cruzadas/imunologia , Proteínas/química , Proteínas/imunologia , Análise de Sequência de Proteína/métodos , Software , Algoritmos , Sequência de Aminoácidos , Bases de Dados de Proteínas , Dados de Sequência Molecular , Interface Usuário-Computador
6.
J Bioinform Comput Biol ; 5(6): 1319-37, 2007 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-18172931

RESUMO

Rich information on point mutation studies is scattered across heterogeneous data sources. This paper presents an automated workflow for mining mutation annotations from full-text biomedical literature using natural language processing (NLP) techniques as well as for their subsequent reuse in protein structure annotation and visualization. This system, called mSTRAP (Mutation extraction and STRucture Annotation Pipeline), is designed for both information aggregation and subsequent brokerage of the mutation annotations. It facilitates the coordination of semantically related information from a series of text mining and sequence analysis steps into a formal OWL-DL ontology. The ontology is designed to support application-specific data management of sequence, structure, and literature annotations that are populated as instances of object and data type properties. mSTRAPviz is a subsystem that facilitates the brokerage of structure information and the associated mutations for visualization. For mutated sequences without any corresponding structure available in the Protein Data Bank (PDB), an automated pipeline for homology modeling is developed to generate the theoretical model. With mSTRAP, we demonstrate a workable system that can facilitate automation of the workflow for the retrieval, extraction, processing, and visualization of mutation annotations -- tasks which are well known to be tedious, time-consuming, complex, and error-prone. The ontology and visualization tool are available at (http://datam.i2r.a-star.edu.sg/mstrap).


Assuntos
Biologia Computacional , Mutação Puntual , Simulação por Computador , Bases de Dados Genéticas , Armazenamento e Recuperação da Informação , Modelos Moleculares , Processamento de Linguagem Natural , Proteínas/química , Proteínas/genética , PubMed , Design de Software
7.
BMC Bioinformatics ; 6: 249, 2005 Oct 13.
Artigo em Inglês | MEDLINE | ID: mdl-16221310

RESUMO

BACKGROUND: The signal peptide plays an important role in protein targeting and protein translocation in both prokaryotic and eukaryotic cells. This transient, short peptide sequence functions like a postal address on an envelope by targeting proteins for secretion or for transfer to specific organelles for further processing. Understanding how signal peptides function is crucial in predicting where proteins are translocated. To support this understanding, we present SPdb signal peptide database http://proline.bic.nus.edu.sg/spdb, a repository of experimentally determined and computationally predicted signal peptides. RESULTS: SPdb integrates information from two sources (a) Swiss-Prot protein sequence database which is now part of UniProt and (b) EMBL nucleotide sequence database. The database update is semi-automated with human checking and verification of the data to ensure the correctness of the data stored. The latest release SPdb release 3.2 contains 18,146 entries of which 2,584 entries are experimentally verified signal sequences; the remaining 15,562 entries are either signal sequences that fail to meet our filtering criteria or entries that contain unverified signal sequences. CONCLUSION: SPdb is a manually curated database constructed to support the understanding and analysis of signal peptides. SPdb tracks the major updates of the two underlying primary databases thereby ensuring that its information remains up-to-date.


Assuntos
Bases de Dados de Ácidos Nucleicos , Sinais Direcionadores de Proteínas/fisiologia , Internet , Peptídeos/classificação , Transporte Proteico/fisiologia , Análise de Sequência de Proteína , Integração de Sistemas , Interface Usuário-Computador , Vocabulário Controlado
8.
Genomics Proteomics Bioinformatics ; 2(2): 84-96, 2004 May.
Artigo em Inglês | MEDLINE | ID: mdl-15629048

RESUMO

This paper examines recent developments and applications of Hidden Markov Models (HMMs) to various problems in computational biology, including multiple sequence alignment, homology detection, protein sequences classification, and genomic annotation.


Assuntos
Biologia Computacional , Cadeias de Markov , Modelos Biológicos , Conformação Proteica , Alinhamento de Sequência , Homologia de Sequência
9.
PLoS One ; 4(1): e4261, 2009.
Artigo em Inglês | MEDLINE | ID: mdl-19156204

RESUMO

BACKGROUND: Little is known about the immunopathogenesis of Chikungunya virus. Circulating levels of immune mediators and growth factors were analyzed from patients infected during the first Singaporean Chikungunya fever outbreak in early 2008 to establish biomarkers associated with infection and/or disease severity. METHODS AND FINDINGS: Adult patients with laboratory-confirmed Chikungunya fever infection, who were referred to the Communicable Disease Centre/Tan Tock Seng Hospital during the period from January to February 2008, were included in this retrospective study. Plasma fractions were analyzed using a multiplex-microbead immunoassay. Among the patients, the most common clinical features were fever (100%), arthralgia (90%), rash (50%) and conjunctivitis (40%). Profiles of 30 cytokines, chemokines, and growth factors were able to discriminate the clinical forms of Chikungunya from healthy controls, with patients classified as non-severe and severe disease. Levels of 8 plasma cytokines and 4 growth factors were significantly elevated. Statistical analysis showed that an increase in IL-1beta, IL-6 and a decrease in RANTES were associated with disease severity. CONCLUSIONS: This is the first comprehensive report on the production of cytokines, chemokines, and growth factors during acute Chikungunya virus infection. Using these biomarkers, we were able to distinguish between mild disease and more severe forms of Chikungunya fever, thus enabling the identification of patients with poor prognosis and monitoring of the disease.


Assuntos
Biomarcadores/metabolismo , Quimiocina CCL5/fisiologia , Regulação Viral da Expressão Gênica , Interleucina-1beta/fisiologia , Interleucina-6/fisiologia , Infecções por Togaviridae/sangue , Infecções por Togaviridae/diagnóstico , Adulto , Idoso , Algoritmos , Estudos de Casos e Controles , Vírus Chikungunya , Análise por Conglomerados , Febre , Humanos , Pessoa de Meia-Idade
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA