Your browser doesn't support javascript.
loading
PhyKIT: a broadly applicable UNIX shell toolkit for processing and analyzing phylogenomic data.
Steenwyk, Jacob L; Buida, Thomas J; Labella, Abigail L; Li, Yuanning; Shen, Xing-Xing; Rokas, Antonis.
Afiliação
  • Steenwyk JL; Department of Biological Sciences, Vanderbilt University, VU Station B #35-1634, Nashville, TN 37235, USA.
  • Buida TJ; 9 City Place #312, Nashville, TN 37209, USA.
  • Labella AL; Department of Biological Sciences, Vanderbilt University, VU Station B #35-1634, Nashville, TN 37235, USA.
  • Li Y; Department of Biological Sciences, Vanderbilt University, VU Station B #35-1634, Nashville, TN 37235, USA.
  • Shen XX; Ministry of Agriculture Key Lab of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou 310058, China.
  • Rokas A; Department of Biological Sciences, Vanderbilt University, VU Station B #35-1634, Nashville, TN 37235, USA.
Bioinformatics ; 37(16): 2325-2331, 2021 Aug 25.
Article em En | MEDLINE | ID: mdl-33560364
ABSTRACT
MOTIVATION Diverse disciplines in biology process and analyze multiple sequence alignments (MSAs) and phylogenetic trees to evaluate their information content, infer evolutionary events and processes and predict gene function. However, automated processing of MSAs and trees remains a challenge due to the lack of a unified toolkit. To fill this gap, we introduce PhyKIT, a toolkit for the UNIX shell environment with 30 functions that process MSAs and trees, including but not limited to estimation of mutation rate, evaluation of sequence composition biases, calculation of the degree of violation of a molecular clock and collapsing bipartitions (internal branches) with low support.

RESULTS:

To demonstrate the utility of PhyKIT, we detail three use cases (1) summarizing information content in MSAs and phylogenetic trees for diagnosing potential biases in sequence or tree data; (2) evaluating gene-gene covariation of evolutionary rates to identify functional relationships, including novel ones, among genes and (3) identify lack of resolution events or polytomies in phylogenetic trees, which are suggestive of rapid radiation events or lack of data. We anticipate PhyKIT will be useful for processing, examining and deriving biological meaning from increasingly large phylogenomic datasets. AVAILABILITY AND IMPLEMENTATION PhyKIT is freely available on GitHub (https//github.com/JLSteenwyk/PhyKIT), PyPi (https//pypi.org/project/phykit/) and the Anaconda Cloud (https//anaconda.org/JLSteenwyk/phykit) under the MIT license with extensive documentation and user tutorials (https//jlsteenwyk.com/PhyKIT). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2021 Tipo de documento: Article