Your browser doesn't support javascript.
loading
matOptimize: a parallel tree optimization method enables online phylogenetics for SARS-CoV-2.
Ye, Cheng; Thornlow, Bryan; Hinrichs, Angie; Kramer, Alexander; Mirchandani, Cade; Torvi, Devika; Lanfear, Robert; Corbett-Detig, Russell; Turakhia, Yatish.
Afiliación
  • Ye C; Department of Electrical and Computer Engineering, University of California, San Diego, San Diego, CA 92093, USA.
  • Thornlow B; Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA.
  • Hinrichs A; Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA.
  • Kramer A; Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA.
  • Mirchandani C; Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA.
  • Torvi D; Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA.
  • Lanfear R; Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA.
  • Corbett-Detig R; Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA.
  • Turakhia Y; Department of Bioengineering, University of California, San Diego, San Diego, CA 92093, USA.
Bioinformatics ; 38(15): 3734-3740, 2022 08 02.
Article en En | MEDLINE | ID: mdl-35731204
ABSTRACT
MOTIVATION Phylogenetic tree optimization is necessary for precise analysis of evolutionary and transmission dynamics, but existing tools are inadequate for handling the scale and pace of data produced during the coronavirus disease 2019 (COVID-19) pandemic. One transformative approach, online phylogenetics, aims to incrementally add samples to an ever-growing phylogeny, but there are no previously existing approaches that can efficiently optimize this vast phylogeny under the time constraints of the pandemic.

RESULTS:

Here, we present matOptimize, a fast and memory-efficient phylogenetic tree optimization tool based on parsimony that can be parallelized across multiple CPU threads and nodes, and provides orders of magnitude improvement in runtime and peak memory usage compared to existing state-of-the-art methods. We have developed this method particularly to address the pressing need during the COVID-19 pandemic for daily maintenance and optimization of a comprehensive SARS-CoV-2 phylogeny. matOptimize is currently helping refine on a daily basis possibly the largest-ever phylogenetic tree, containing millions of SARS-CoV-2 sequences. AVAILABILITY AND IMPLEMENTATION The matOptimize code is freely available as part of the UShER package (https//github.com/yatisht/usher) and can also be installed via bioconda (https//bioconda.github.io/recipes/usher/README.html). All scripts we used to perform the experiments in this manuscript are available at https//github.com/yceh/matOptimize-experiments. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: SARS-CoV-2 / COVID-19 Límite: Humans Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2022 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: SARS-CoV-2 / COVID-19 Límite: Humans Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2022 Tipo del documento: Article País de afiliación: Estados Unidos