Your browser doesn't support javascript.
loading
Gradients Do Grow on Trees: A Linear-Time O(N)-Dimensional Gradient for Statistical Phylogenetics.
Ji, Xiang; Zhang, Zhenyu; Holbrook, Andrew; Nishimura, Akihiko; Baele, Guy; Rambaut, Andrew; Lemey, Philippe; Suchard, Marc A.
Afiliação
  • Ji X; Department of Biomathematics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA.
  • Zhang Z; Department of Mathematics, School of Science & Engineering, Tulane University, New Orleans, LA.
  • Holbrook A; Department of Biostatistics, Fielding School of Public Health, University of California Los Angeles, Los Angeles, CA.
  • Nishimura A; Department of Biostatistics, Fielding School of Public Health, University of California Los Angeles, Los Angeles, CA.
  • Baele G; Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD.
  • Rambaut A; Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, Leuven, Belgium.
  • Lemey P; Institute of Evolutionary Biology, Centre for Immunology, Infection and Evolution, University of Edinburgh, Edinburgh, United Kingdom.
  • Suchard MA; Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, Leuven, Belgium.
Mol Biol Evol ; 37(10): 3047-3060, 2020 10 01.
Article em En | MEDLINE | ID: mdl-32458974
ABSTRACT
Calculation of the log-likelihood stands as the computational bottleneck for many statistical phylogenetic algorithms. Even worse is its gradient evaluation, often used to target regions of high probability. Order O(N)-dimensional gradient calculations based on the standard pruning algorithm require O(N2) operations, where N is the number of sampled molecular sequences. With the advent of high-throughput sequencing, recent phylogenetic studies have analyzed hundreds to thousands of sequences, with an apparent trend toward even larger data sets as a result of advancing technology. Such large-scale analyses challenge phylogenetic reconstruction by requiring inference on larger sets of process parameters to model the increasing data heterogeneity. To make these analyses tractable, we present a linear-time algorithm for O(N)-dimensional gradient evaluation and apply it to general continuous-time Markov processes of sequence substitution on a phylogenetic tree without a need to assume either stationarity or reversibility. We apply this approach to learn the branch-specific evolutionary rates of three pathogenic viruses West Nile virus, Dengue virus, and Lassa virus. Our proposed algorithm significantly improves inference efficiency with a 126- to 234-fold increase in maximum-likelihood optimization and a 16- to 33-fold computational performance increase in a Bayesian framework.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Filogenia / Evolução Molecular / Modelos Genéticos Tipo de estudo: Evaluation_studies Idioma: En Revista: Mol Biol Evol Assunto da revista: BIOLOGIA MOLECULAR Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Canadá

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Filogenia / Evolução Molecular / Modelos Genéticos Tipo de estudo: Evaluation_studies Idioma: En Revista: Mol Biol Evol Assunto da revista: BIOLOGIA MOLECULAR Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Canadá