Your browser doesn't support javascript.
loading
Reproducibly sampling SARS-CoV-2 genomes across time, geography, and viral diversity.
Bolyen, Evan; Dillon, Matthew R; Bokulich, Nicholas A; Ladner, Jason T; Larsen, Brendan B; Hepp, Crystal M; Lemmer, Darrin; Sahl, Jason W; Sanchez, Andrew; Holdgraf, Chris; Sewell, Chris; Choudhury, Aakash G; Stachurski, John; McKay, Matthew; Simard, Anthony; Engelthaler, David M; Worobey, Michael; Keim, Paul; Caporaso, J Gregory.
Afiliação
  • Bolyen E; Center for Applied Microbiome Science, Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ, USA.
  • Dillon MR; School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ, USA.
  • Bokulich NA; Center for Applied Microbiome Science, Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ, USA.
  • Ladner JT; Laboratory of Food Systems Biotechnology, Institute of Food, Nutrition and Health, ETH Zurich, Switzerland.
  • Larsen BB; Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ, USA.
  • Hepp CM; Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA.
  • Lemmer D; School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ, USA.
  • Sahl JW; Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ, USA.
  • Sanchez A; Pathogen and Microbiome Division, Translational Genomics Research Institute, Flagstaff, AZ, USA.
  • Holdgraf C; Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ, USA.
  • Sewell C; Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA.
  • Choudhury AG; Center for Applied Microbiome Science, Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ, USA.
  • Stachurski J; Department of Statistics, University of California at Berkeley, Berkeley, CA, USA.
  • McKay M; Theory and Simulation of Materials, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland.
  • Simard A; Research School of Economics, Australian National University, ACT, Australia.
  • Engelthaler DM; Research School of Economics, Australian National University, ACT, Australia.
  • Worobey M; Research School of Economics, Australian National University, ACT, Australia.
  • Keim P; Center for Applied Microbiome Science, Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ, USA.
  • Caporaso JG; Pathogen and Microbiome Division, Translational Genomics Research Institute, Flagstaff, AZ, USA.
F1000Res ; 9: 657, 2020.
Article em En | MEDLINE | ID: mdl-33500774
ABSTRACT
The COVID-19 pandemic has led to a rapid accumulation of SARS-CoV-2 genomes, enabling genomic epidemiology on local and global scales. Collections of genomes from resources such as GISAID must be subsampled to enable computationally feasible phylogenetic and other analyses. We present genome-sampler, a software package that supports sampling collections of viral genomes across multiple axes including time of genome isolation, location of genome isolation, and viral diversity. The software is modular in design so that these or future sampling approaches can be applied independently and combined (or replaced with a random sampling approach) to facilitate custom workflows and benchmarking. genome-sampler is written as a QIIME 2 plugin, ensuring that its application is fully reproducible through QIIME 2's unique retrospective data provenance tracking system. genome-sampler can be installed in a conda environment on macOS or Linux systems. A complete default pipeline is available through a Snakemake workflow, so subsampling can be achieved using a single command. genome-sampler is open source, free for all to use, and available at https//caporasolab.us/genome-sampler. We hope that this will facilitate SARS-CoV-2 research and support evaluation of viral genome sampling approaches for genomic epidemiology.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Filogenia / Genoma Viral / SARS-CoV-2 Tipo de estudo: Observational_studies Limite: Humans Idioma: En Revista: F1000Res Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Filogenia / Genoma Viral / SARS-CoV-2 Tipo de estudo: Observational_studies Limite: Humans Idioma: En Revista: F1000Res Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos