Your browser doesn't support javascript.
loading
A gold standard dataset and evaluation of methods for lineage abundance estimation from wastewater.
Ferdous, Jannatul; Kunkleman, Samuel; Taylor, William; Harris, April; Gibas, Cynthia J; Schlueter, Jessica A.
Affiliation
  • Ferdous J; Department of Bioinformatics and Genomics, UNC Charlotte, 9201 University City Blvd, Charlotte, NC 28223, USA.
  • Kunkleman S; Department of Bioinformatics and Genomics, UNC Charlotte, 9201 University City Blvd, Charlotte, NC 28223, USA.
  • Taylor W; Department of Bioinformatics and Genomics, UNC Charlotte, 9201 University City Blvd, Charlotte, NC 28223, USA.
  • Harris A; Department of Bioinformatics and Genomics, UNC Charlotte, 9201 University City Blvd, Charlotte, NC 28223, USA.
  • Gibas CJ; Department of Bioinformatics and Genomics, UNC Charlotte, 9201 University City Blvd, Charlotte, NC 28223, USA.
  • Schlueter JA; Department of Bioinformatics and Genomics, UNC Charlotte, 9201 University City Blvd, Charlotte, NC 28223, USA. Electronic address: jschluet@charlotte.edu.
Sci Total Environ ; 948: 174515, 2024 Oct 20.
Article in En | MEDLINE | ID: mdl-38971244
ABSTRACT
During the SARS-CoV-2 pandemic, genome-based wastewater surveillance sequencing has been a powerful tool for public health to monitor circulating and emerging viral variants. As a medium, wastewater is very complex because of its mixed matrix nature, which makes the deconvolution of wastewater samples more difficult. Here we introduce a gold standard dataset constructed from synthetic viral control mixtures of known composition, spiked into a wastewater RNA matrix and sequenced on the Oxford Nanopore Technologies platform. We compare the performance of eight of the most commonly used deconvolution tools in identifying SARS-CoV-2 variants present in these mixtures. The software evaluated was primarily chosen for its relevance to the CDC wastewater surveillance reporting protocol, which until recently employed a pipeline that incorporates results from four deconvolution

methods:

Freyja, kallisto, Kraken 2/Bracken, and LCS. We also tested Lollipop, a deconvolution method used by the Swiss SARS-CoV-2 Sequencing Consortium, and three additional methods not used in the C-WAP pipeline lineagespot, Alcov, and VaQuERo. We found that the commonly used software Freyja outperformed the other CDC pipeline tools in correct identification of lineages present in the control mixtures, and that the VaQuERo method was similarly accurate, with minor differences in the ability of the two methods to avoid false negatives and suppress false positives. Our results also provide insight into the effect of the tiling primer scheme and wastewater RNA extract matrix on viral sequencing and data deconvolution outcomes.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Wastewater / SARS-CoV-2 / COVID-19 Language: En Journal: Sci Total Environ Year: 2024 Document type: Article Affiliation country: Country of publication:

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Wastewater / SARS-CoV-2 / COVID-19 Language: En Journal: Sci Total Environ Year: 2024 Document type: Article Affiliation country: Country of publication: