Your browser doesn't support javascript.
loading
A machine-learning-based alternative to phylogenetic bootstrap.
Ecker, Noa; Huchon, Dorothée; Mansour, Yishay; Mayrose, Itay; Pupko, Tal.
  • Ecker N; The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel.
  • Huchon D; School of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel.
  • Mansour Y; The Steinhardt Museum of Natural History and National Research Center, Tel Aviv University, Tel Aviv 6997801, Israel.
  • Mayrose I; The Blavatnik School of Computer Science, Raymond & Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 6997801, Israel.
  • Pupko T; School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel.
Bioinformatics ; 40(Supplement_1): i208-i217, 2024 Jun 28.
Article en En | MEDLINE | ID: mdl-38940166
ABSTRACT
MOTIVATION Currently used methods for estimating branch support in phylogenetic analyses often rely on the classic Felsenstein's bootstrap, parametric tests, or their approximations. As these branch support scores are widely used in phylogenetic analyses, having accurate, fast, and interpretable scores is of high importance.

RESULTS:

Here, we employed a data-driven approach to estimate branch support values with a probabilistic interpretation. To this end, we simulated thousands of realistic phylogenetic trees and the corresponding multiple sequence alignments. Each of the obtained alignments was used to infer the phylogeny using state-of-the-art phylogenetic inference software, which was then compared to the true tree. Using these extensive data, we trained machine-learning algorithms to estimate branch support values for each bipartition within the maximum-likelihood trees obtained by each software. Our results demonstrate that our model provides fast and more accurate probability-based branch support values than commonly used procedures. We demonstrate the applicability of our approach on empirical datasets. AVAILABILITY AND IMPLEMENTATION The data supporting this work are available in the Figshare repository at https//doi.org/10.6084/m9.figshare.25050554.v1, and the underlying code is accessible via GitHub at https//github.com/noaeker/bootstrap_repo.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Filogenia / Algoritmos / Programas Informáticos / Aprendizaje Automático Idioma: En Año: 2024 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Filogenia / Algoritmos / Programas Informáticos / Aprendizaje Automático Idioma: En Año: 2024 Tipo del documento: Article