Your browser doesn't support javascript.
loading
The influence of the number of tree searches on maximum likelihood inference in phylogenomics.
Liu, Chao; Zhou, Xiaofan; Li, Yuanning; Hittinger, Chris Todd; Pan, Ronghui; Huang, Jinyan; Chen, Xue-Xin; Rokas, Antonis; Chen, Yun; Shen, Xing-Xing.
Affiliation
  • Liu C; Key Laboratory of Biology of Crop Pathogens and Insects of Zhejiang Province, Department of Plant Protection, Zhejiang University, Hangzhou 310058, China.
  • Zhou X; Centre for Evolutionary & Organismal Biology, Zhejiang University, Hangzhou 310058, China.
  • Li Y; Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Centre, South China Agricultural University, Guangzhou 510642, China.
  • Hittinger CT; Institute of Marine Science and Technology, Shandong University, Qingdao 266237, China.
  • Pan R; Department of Biological Sciences and Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA.
  • Huang J; Laboratory of Genetics, Wisconsin Energy Institute, Center for Genomic Science Innovation, DOE Great Lakes Bioenergy Research Center, J. F. Crow Institute for the Study of Evolution, University of Wisconsin-Madison, Madison, WI 53706, USA.
  • Chen XX; ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou, 310027, China.
  • Rokas A; Zhejiang Provincial Key Laboratory of Pancreatic Disease, Zhejiang University School of Medicine First Affiliated Hospital, Hangzhou 310003, China.
  • Chen Y; Key Laboratory of Biology of Crop Pathogens and Insects of Zhejiang Province, Department of Plant Protection, Zhejiang University, Hangzhou 310058, China.
  • Shen XX; Department of Biological Sciences and Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA.
Syst Biol ; 2024 Jun 28.
Article in En | MEDLINE | ID: mdl-38940001
ABSTRACT
Maximum likelihood (ML) phylogenetic inference is widely used in phylogenomics. As heuristic searches most likely find suboptimal trees, it is recommended to conduct multiple (e.g., ten) tree searches in phylogenetic analyses. However, beyond its positive role, how and to what extent multiple tree searches aid ML phylogenetic inference remains poorly explored. Here, we found that a random starting tree was not as effective as the BioNJ and parsimony starting trees in inferring ML gene tree and that RAxML-NG and PhyML were less sensitive to different starting trees than IQ-TREE. We then examined the effect of the number of tree searches on ML tree inference with IQ-TREE and RAxML-NG, by running 100 tree searches on 19,414 gene alignments from 15 animal, plant, and fungal phylogenomic datasets. We found that the number of tree searches substantially impacted the recovery of the best-of-100 ML gene tree topology among 100 searches for a given ML program. In addition, all of the concatenation-based trees were topologically identical if the number of tree searches was ≥ 10. Quartet-based ASTRAL trees inferred from 1 to 80 tree searches differed topologically from those inferred from 100 tree searches for 6 /15 phylogenomic datasets. Lastly, our simulations showed that gene alignments with lower difficulty scores had a higher chance of finding the best-of-100 gene tree topology and were more likely to yield the correct trees.
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Syst Biol Journal subject: BIOLOGIA Year: 2024 Type: Article Affiliation country: China

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Syst Biol Journal subject: BIOLOGIA Year: 2024 Type: Article Affiliation country: China