Your browser doesn't support javascript.
loading
Recommendations for Uniform Variant Calling of SARS-CoV-2 Genome Sequence across Bioinformatic Workflows.
Connor, Ryan; Shakya, Migun; Yarmosh, David A; Maier, Wolfgang; Martin, Ross; Bradford, Rebecca; Brister, J Rodney; Chain, Patrick S G; Copeland, Courtney A; di Iulio, Julia; Hu, Bin; Ebert, Philip; Gunti, Jonathan; Jin, Yumi; Katz, Kenneth S; Kochergin, Andrey; LaRosa, Tré; Li, Jiani; Li, Po-E; Lo, Chien-Chi; Rashid, Sujatha; Maiorova, Evguenia S; Xiao, Chunlin; Zalunin, Vadim; Purcell, Lisa; Pruitt, Kim D.
Afiliación
  • Connor R; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
  • Shakya M; Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA.
  • Yarmosh DA; American Type Culture Collection, Manassas, VA 20110, USA.
  • Maier W; BEI Resources, Manassas, VA 20110, USA.
  • Martin R; Galaxy Europe Team, University of Freiburg, 79085 Freiburg, Germany.
  • Bradford R; Clinical Virology Department, Gilead Sciences, Foster City, CA 94404, USA.
  • Brister JR; American Type Culture Collection, Manassas, VA 20110, USA.
  • Chain PSG; BEI Resources, Manassas, VA 20110, USA.
  • Copeland CA; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
  • di Iulio J; Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA.
  • Hu B; Deloitte Consulting LLP, Rosslyn, VA 22209, USA.
  • Ebert P; Vir Biotechnology Inc., San Francisco, CA 94158, USA.
  • Gunti J; Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA.
  • Jin Y; Eli Lilly and Company, Indianapolis, IN 46225, USA.
  • Katz KS; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
  • Kochergin A; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
  • LaRosa T; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
  • Li J; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
  • Li PE; Deloitte Consulting LLP, Rosslyn, VA 22209, USA.
  • Lo CC; Clinical Virology Department, Gilead Sciences, Foster City, CA 94404, USA.
  • Rashid S; Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA.
  • Maiorova ES; Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA.
  • Xiao C; American Type Culture Collection, Manassas, VA 20110, USA.
  • Zalunin V; Clinical Virology Department, Gilead Sciences, Foster City, CA 94404, USA.
  • Purcell L; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
  • Pruitt KD; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
Viruses ; 16(3)2024 03 11.
Article en En | MEDLINE | ID: mdl-38543795
ABSTRACT
Genomic sequencing of clinical samples to identify emerging variants of SARS-CoV-2 has been a key public health tool for curbing the spread of the virus. As a result, an unprecedented number of SARS-CoV-2 genomes were sequenced during the COVID-19 pandemic, which allowed for rapid identification of genetic variants, enabling the timely design and testing of therapies and deployment of new vaccine formulations to combat the new variants. However, despite the technological advances of deep sequencing, the analysis of the raw sequence data generated globally is neither standardized nor consistent, leading to vastly disparate sequences that may impact identification of variants. Here, we show that for both Illumina and Oxford Nanopore sequencing platforms, downstream bioinformatic protocols used by industry, government, and academic groups resulted in different virus sequences from same sample. These bioinformatic workflows produced consensus genomes with differences in single nucleotide polymorphisms, inclusion and exclusion of insertions, and/or deletions, despite using the same raw sequence as input datasets. Here, we compared and characterized such discrepancies and propose a specific suite of parameters and protocols that should be adopted across the field. Consistent results from bioinformatic workflows are fundamental to SARS-CoV-2 and future pathogen surveillance efforts, including pandemic preparation, to allow for a data-driven and timely public health response.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: SARS-CoV-2 / COVID-19 Límite: Humans Idioma: En Revista: Viruses Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: SARS-CoV-2 / COVID-19 Límite: Humans Idioma: En Revista: Viruses Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos