Pesquisa | BVS Economia da Saúde

Resources and tools for investigating biomolecular networks in mammals.

Dietmann, Sabine; Aguilar, Daniel; Mader, Michael; Oesterheld, Matthias; Ruepp, Andreas; Stuempflen, Volker; Mewes, Hans-Werner.

Curr Pharm Des ; 12(29): 3723-34, 2006.

Artigo em Inglês | MEDLINE | ID: mdl-17073671

RESUMO

Molecular databases serve as primary information resources for the analysis of biological networks providing an essential and invaluable treasure for information exploration. Tools for projecting experimental data sets onto known functional information are a major need to support the analysis of samples produced in clinical research. A new concept is the notation of functional modules, i.e. the characterisation of sets of proteins that perform a defined biological function in cooperation. The determination and analysis of functional modules overcome the limitations of the analysis of individual genes and their properties. Although functional modules are not suitable to fully capture systems properties, they have the potential to unify the information generated by different types of experiments. We describe advances related to the problem of integrating heterogeneous data sets into functional modules for mouse and/or human cellular networks based on publicly available data resources, including advances in the design of ontologies for functional classification, problems of automatic protein functional annotation and integration of microarray data.

Assuntos

Bases de Dados Genéticas , Proteínas/classificação , Integração de Sistemas , Algoritmos , Animais , Análise por Conglomerados , Humanos , Cadeias de Markov , Camundongos , Redes Neurais de Computação , Análise de Sequência com Séries de Oligonucleotídeos , Proteínas/química , Proteínas/genética

Super paramagnetic clustering of protein sequences.

Tetko, Igor V; Facius, Axel; Ruepp, Andreas; Mewes, Hans-Werner.

BMC Bioinformatics ; 6: 82, 2005 Apr 01.

Artigo em Inglês | MEDLINE | ID: mdl-15804359

RESUMO

BACKGROUND: Detection of sequence homologues represents a challenging task that is important for the discovery of protein families and the reliable application of automatic annotation methods. The presence of domains in protein families of diverse function, inhomogeneity and different sizes of protein families create considerable difficulties for the application of published clustering methods. RESULTS: Our work analyses the Super Paramagnetic Clustering (SPC) and its extension, global SPC (gSPC) algorithm. These algorithms cluster input data based on a method that is analogous to the treatment of an inhomogeneous ferromagnet in physics. For the SwissProt and SCOP databases we show that the gSPC improves the specificity and sensitivity of clustering over the original SPC and Markov Cluster algorithm (TRIBE-MCL) up to 30%. The three algorithms provided similar results for the MIPS FunCat 1.3 annotation of four bacterial genomes, Bacillus subtilis, Helicobacter pylori, Listeria innocua and Listeria monocytogenes. However, the gSPC covered about 12% more sequences compared to the other methods. The SPC algorithm was programmed in house using C++ and it is available at http://mips.gsf.de/proj/spc. The FunCat annotation is available at http://mips.gsf.de. CONCLUSION: The gSPC calculated to a higher accuracy or covered a larger number of sequences than the TRIBE-MCL algorithm. Thus it is a useful approach for automatic detection of protein families and unsupervised annotation of full genomes.

Assuntos

Biologia Computacional/métodos , Bases de Dados de Proteínas , Proteínas/química , Algoritmos , Bacillus subtilis/metabolismo , Análise por Conglomerados , Gráficos por Computador , Simulação por Computador , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Bases de Dados de Ácidos Nucleicos , Genes Bacterianos , Genoma , Genoma Bacteriano , Helicobacter pylori/metabolismo , Armazenamento e Recuperação da Informação , Listeria/metabolismo , Listeria monocytogenes/metabolismo , Magnetismo , Cadeias de Markov , Método de Monte Carlo , Linguagens de Programação , Estrutura Terciária de Proteína , Alinhamento de Sequência , Homologia de Sequência de Aminoácidos , Software

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA