Your browser doesn't support javascript.
loading
Shortest triplet clustering: reconstructing large phylogenies using representative sets.
Vinh, Le Sy; von Haeseler, Arndt.
Afiliação
  • Vinh le S; Heinrich-Heine-Universität Düsseldorf, WE Informatik, Universitätstr. 1, D-040225 Düsseldorf, Germany. vinh@cs.uni-duesseldorf.de
BMC Bioinformatics ; 6: 92, 2005 Apr 08.
Article em En | MEDLINE | ID: mdl-15819989
ABSTRACT

BACKGROUND:

Understanding the evolutionary relationships among species based on their genetic information is one of the primary objectives in phylogenetic analysis. Reconstructing phylogenies for large data sets is still a challenging task in Bioinformatics.

RESULTS:

We propose a new distance-based clustering method, the shortest triplet clustering algorithm (STC), to reconstruct phylogenies. The main idea is the introduction of a natural definition of so-called k-representative sets. Based on k-representative sets, shortest triplets are reconstructed and serve as building blocks for the STC algorithm to agglomerate sequences for tree reconstruction in O(n2) time for n sequences. Simulations show that STC gives better topological accuracy than other tested methods that also build a first starting tree. STC appears as a very good method to start the tree reconstruction. However, all tested methods give similar results if balanced nearest neighbor interchange (BNNI) is applied as a post-processing step. BNNI leads to an improvement in all instances. The program is available at http//www.bi.uni-duesseldorf.de/software/stc/.

CONCLUSION:

The results demonstrate that the new approach efficiently reconstructs phylogenies for large data sets. We found that BNNI boosts the topological accuracy of all methods including STC, therefore, one should use BNNI as a post-processing step to get better topological accuracy.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Interpretação Estatística de Dados / Biologia Computacional Tipo de estudo: Risk_factors_studies Idioma: En Revista: BMC Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2005 Tipo de documento: Article País de afiliação: Alemanha

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Interpretação Estatística de Dados / Biologia Computacional Tipo de estudo: Risk_factors_studies Idioma: En Revista: BMC Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2005 Tipo de documento: Article País de afiliação: Alemanha