Your browser doesn't support javascript.
loading
Simultaneous Bayesian estimation of alignment and phylogeny under a joint model of protein sequence and structure.
Herman, Joseph L; Challis, Christopher J; Novák, Ádám; Hein, Jotun; Schmidler, Scott C.
Affiliation
  • Herman JL; Department of Statistics, University of Oxford, Oxford, United KingdomDivision of Mathematical Biology, National Institute of Medical Research, London, United Kingdom herman@stats.ox.ac.uk.
  • Challis CJ; Department of Statistical Science, Duke University.
  • Novák Á; Department of Statistics, University of Oxford, Oxford, United Kingdom.
  • Hein J; Department of Statistics, University of Oxford, Oxford, United Kingdom.
  • Schmidler SC; Department of Statistical Science, Duke UniversityDepartment of Computer Science, Duke University.
Mol Biol Evol ; 31(9): 2251-66, 2014 Sep.
Article in En | MEDLINE | ID: mdl-24899668
For sequences that are highly divergent, there is often insufficient information to infer accurate alignments, and phylogenetic uncertainty may be high. One way to address this issue is to make use of protein structural information, since structures generally diverge more slowly than sequences. In this work, we extend a recently developed stochastic model of pairwise structural evolution to multiple structures on a tree, analytically integrating over ancestral structures to permit efficient likelihood computations under the resulting joint sequence-structure model. We observe that the inclusion of structural information significantly reduces alignment and topology uncertainty, and reduces the number of topology and alignment errors in cases where the true trees and alignments are known. In some cases, the inclusion of structure results in changes to the consensus topology, indicating that structure may contain additional information beyond that which can be obtained from sequences. We use the model to investigate the order of divergence of cytoglobins, myoglobins, and hemoglobins and observe a stabilization of phylogenetic inference: although a sequence-based inference assigns significant posterior probability to several different topologies, the structural model strongly favors one of these over the others and is more robust to the choice of data set.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Globins / Hemoglobins / Bayes Theorem / Computational Biology / Myoglobin Type of study: Health_economic_evaluation / Prognostic_studies Limits: Animals / Humans Language: En Journal: Mol Biol Evol Journal subject: BIOLOGIA MOLECULAR Year: 2014 Type: Article Affiliation country: United kingdom

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Globins / Hemoglobins / Bayes Theorem / Computational Biology / Myoglobin Type of study: Health_economic_evaluation / Prognostic_studies Limits: Animals / Humans Language: En Journal: Mol Biol Evol Journal subject: BIOLOGIA MOLECULAR Year: 2014 Type: Article Affiliation country: United kingdom