Ortholog identification in the presence of domain architecture rearrangement.
Brief Bioinform
; 12(5): 413-22, 2011 Sep.
Article
em En
| MEDLINE
| ID: mdl-21712343
Ortholog identification is used in gene functional annotation, species phylogeny estimation, phylogenetic profile construction and many other analyses. Bioinformatics methods for ortholog identification are commonly based on pairwise protein sequence comparisons between whole genomes. Phylogenetic methods of ortholog identification have also been developed; these methods can be applied to protein data sets sharing a common domain architecture or which share a single functional domain but differ outside this region of homology. While promiscuous domains represent a challenge to all orthology prediction methods, overall structural similarity is highly correlated with proximity in a phylogenetic tree, conferring a degree of robustness to phylogenetic methods. In this article, we review the issues involved in orthology prediction when data sets include sequences with structurally heterogeneous domain architectures, with particular attention to automated methods designed for high-throughput application, and present a case study to illustrate the challenges in this area.
Texto completo:
1
Base de dados:
MEDLINE
Assunto principal:
Filogenia
/
Genoma
/
Biologia Computacional
Idioma:
En
Ano de publicação:
2011
Tipo de documento:
Article
País de afiliação:
Estados Unidos