Your browser doesn't support javascript.
loading
Bayesian Phylogenetic Analysis on Multi-Core Compute Architectures: Implementation and Evaluation of BEAGLE in RevBayes With MPI.
Smith, Killian; Ayres, Daniel; Neumaier, René; Wörheide, Gert; Höhna, Sebastian.
Afiliación
  • Smith K; GeoBio-Center, Ludwig-Maximilians-Universität München, Richard-Wagner Straße 10, 80333 Munich, Germany.
  • Ayres D; Department of Earth and Environmental Sciences, Paleontology & Geobiology, Ludwig-Maximilians-Universität München, Richard-Wagner Straße 10, 80333 Munich, Germany.
  • Neumaier R; Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA.
  • Wörheide G; GeoBio-Center, Ludwig-Maximilians-Universität München, Richard-Wagner Straße 10, 80333 Munich, Germany.
  • Höhna S; GeoBio-Center, Ludwig-Maximilians-Universität München, Richard-Wagner Straße 10, 80333 Munich, Germany.
Syst Biol ; 73(2): 455-469, 2024 Jul 27.
Article en En | MEDLINE | ID: mdl-38284268
ABSTRACT
Phylogenies are central to many research areas in biology and commonly estimated using likelihood-based methods. Unfortunately, any likelihood-based method, including Bayesian inference, can be restrictively slow for large datasets-with many taxa and/or many sites in the sequence alignment-or complex substitutions models. The primary limiting factor when using large datasets and/or complex models in probabilistic phylogenetic analyses is the likelihood calculation, which dominates the total computation time. To address this bottleneck, we incorporated the high-performance phylogenetic library BEAGLE into RevBayes, which enables multi-threading on multi-core CPUs and GPUs, as well as hardware specific vectorized instructions for faster likelihood calculations. Our new implementation of RevBayes+BEAGLE retains the flexibility and dynamic nature that users expect from vanilla RevBayes. In addition, we implemented native parallelization within RevBayes without an external library using the message passing interface (MPI); RevBayes+MPI. We evaluated our new implementation of RevBayes+BEAGLE using multi-threading on CPUs and 2 different powerful GPUs (NVidia Titan V and NVIDIA A100) against our native implementation of RevBayes+MPI. We found good improvements in speedup when multiple cores were used, with up to 20-fold speedup when using multiple CPU cores and over 90-fold speedup when using multiple GPU cores. The improvement depended on the data type used, DNA or amino acids, and the size of the alignment, but less on the size of the tree. We additionally investigated the cost of rescaling partial likelihoods to avoid numerical underflow and showed that unnecessarily frequent and inefficient rescaling can increase runtimes up to 4-fold. Finally, we presented and compared a new approach to store partial likelihoods on branches instead of nodes that can speed up computations up to 1.7 times but comes at twice the memory requirements.
Asunto(s)
Palabras clave

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Filogenia / Programas Informáticos / Teorema de Bayes Tipo de estudio: Prognostic_studies Idioma: En Revista: Syst Biol Asunto de la revista: BIOLOGIA Año: 2024 Tipo del documento: Article

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Filogenia / Programas Informáticos / Teorema de Bayes Tipo de estudio: Prognostic_studies Idioma: En Revista: Syst Biol Asunto de la revista: BIOLOGIA Año: 2024 Tipo del documento: Article