Simultaneous Bayesian estimation of alignment and phylogeny under a joint model of protein sequence and structure.
Mol Biol Evol
; 31(9): 2251-66, 2014 Sep.
Article
in En
| MEDLINE
| ID: mdl-24899668
For sequences that are highly divergent, there is often insufficient information to infer accurate alignments, and phylogenetic uncertainty may be high. One way to address this issue is to make use of protein structural information, since structures generally diverge more slowly than sequences. In this work, we extend a recently developed stochastic model of pairwise structural evolution to multiple structures on a tree, analytically integrating over ancestral structures to permit efficient likelihood computations under the resulting joint sequence-structure model. We observe that the inclusion of structural information significantly reduces alignment and topology uncertainty, and reduces the number of topology and alignment errors in cases where the true trees and alignments are known. In some cases, the inclusion of structure results in changes to the consensus topology, indicating that structure may contain additional information beyond that which can be obtained from sequences. We use the model to investigate the order of divergence of cytoglobins, myoglobins, and hemoglobins and observe a stabilization of phylogenetic inference: although a sequence-based inference assigns significant posterior probability to several different topologies, the structural model strongly favors one of these over the others and is more robust to the choice of data set.
Key words
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Main subject:
Globins
/
Hemoglobins
/
Bayes Theorem
/
Computational Biology
/
Myoglobin
Type of study:
Health_economic_evaluation
/
Prognostic_studies
Limits:
Animals
/
Humans
Language:
En
Journal:
Mol Biol Evol
Journal subject:
BIOLOGIA MOLECULAR
Year:
2014
Type:
Article
Affiliation country:
United kingdom