RESUMO
The 2022 multicountry mpox outbreak concurrent with the ongoing Coronavirus Disease 2019 (COVID-19) pandemic further highlighted the need for genomic surveillance and rapid pathogen whole-genome sequencing. While metagenomic sequencing approaches have been used to sequence many of the early mpox infections, these methods are resource intensive and require samples with high viral DNA concentrations. Given the atypical clinical presentation of cases associated with the outbreak and uncertainty regarding viral load across both the course of infection and anatomical body sites, there was an urgent need for a more sensitive and broadly applicable sequencing approach. Highly multiplexed amplicon-based sequencing (PrimalSeq) was initially developed for sequencing of Zika virus, and later adapted as the main sequencing approach for Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). Here, we used PrimalScheme to develop a primer scheme for human monkeypox virus that can be used with many sequencing and bioinformatics pipelines implemented in public health laboratories during the COVID-19 pandemic. We sequenced clinical specimens that tested presumptively positive for human monkeypox virus with amplicon-based and metagenomic sequencing approaches. We found notably higher genome coverage across the virus genome, with minimal amplicon drop-outs, in using the amplicon-based sequencing approach, particularly in higher PCR cycle threshold (Ct) (lower DNA titer) samples. Further testing demonstrated that Ct value correlated with the number of sequencing reads and influenced the percent genome coverage. To maximize genome coverage when resources are limited, we recommend selecting samples with a PCR Ct below 31 Ct and generating 1 million sequencing reads per sample. To support national and international public health genomic surveillance efforts, we sent out primer pool aliquots to 10 laboratories across the United States, United Kingdom, Brazil, and Portugal. These public health laboratories successfully implemented the human monkeypox virus primer scheme in various amplicon sequencing workflows and with different sample types across a range of Ct values. Thus, we show that amplicon-based sequencing can provide a rapidly deployable, cost-effective, and flexible approach to pathogen whole-genome sequencing in response to newly emerging pathogens. Importantly, through the implementation of our primer scheme into existing SARS-CoV-2 workflows and across a range of sample types and sequencing platforms, we further demonstrate the potential of this approach for rapid outbreak response.
Assuntos
COVID-19 , Mpox , Infecção por Zika virus , Zika virus , Humanos , COVID-19/epidemiologia , Pandemias , SARS-CoV-2/genética , GenômicaRESUMO
The sharing of genome sequences in online data repositories allows for large scale analyses of specific genes or gene families. This can result in the detection of novel gene subtypes as well as the development of improved detection methods. Here, we used publicly available WGS data to detect a novel Stx subtype, Stx2n in two clinical E. coli strains isolated in the USA. During this process, additional Stx2 subtypes were detected; six Stx2j, one Stx2m strain, and one Stx2o, were all analyzed for variability from the originally described subtypes. Complete genome sequences were assembled from short- or long-read sequencing and analyzed for serotype, and ST types. The WGS data from Stx2n- and Stx2o-producing STEC strains were further analyzed for virulence genes pro-phage analysis and phage insertion sites. Nucleotide and amino acid maximum parsimony trees showed expected clustering of the previously described subtypes and a clear separation of the novel Stx2n subtype. WGS data were used to design OMNI PCR primers for the detection of all known stx1 (283 bp amplicon), stx2 (400 bp amplicon), intimin encoded by eae (221 bp amplicon), and stx2f (438 bp amplicon) subtypes. These primers were tested in three different laboratories, using standard reference strains. An analysis of the complete genome sequence showed variability in serogroup, virulence genes, and ST type, and Stx2 pro-phages showed variability in size, gene composition, and phage insertion sites. The strains with Stx2j, Stx2m, Stx2n, and Stx2o showed toxicity to Vero cells. Stx2j carrying strain, 2012C-4221, was induced when grown with sub-inhibitory concentrations of ciprofloxacin, and toxicity was detected. Taken together, these data highlight the need to reinforce genomic surveillance to identify the emergence of potential new Stx2 or Stx1 variants. The importance of this surveillance has a paramount impact on public health. Per our description in this study, we suggest that 2017C-4317 be designated as the Stx2n type-strain.
RESUMO
We have adopted an open bioinformatics ecosystem to address the challenges of bioinformatics implementation in public health laboratories (PHLs). Bioinformatics implementation for public health requires practitioners to undertake standardized bioinformatic analyses and generate reproducible, validated and auditable results. It is essential that data storage and analysis are scalable, portable and secure, and that implementation of bioinformatics fits within the operational constraints of the laboratory. We address these requirements using Terra, a web-based data analysis platform with a graphical user interface connecting users to bioinformatics analyses without the use of code. We have developed bioinformatics workflows for use with Terra that specifically meet the needs of public health practitioners. These Theiagen workflows perform genome assembly, quality control, and characterization, as well as construction of phylogeny for insights into genomic epidemiology. Additonally, these workflows use open-source containerized software and the WDL workflow language to ensure standardization and interoperability with other bioinformatics solutions, whilst being adaptable by the user. They are all open source and publicly available in Dockstore with the version-controlled code available in public GitHub repositories. They have been written to generate outputs in standardized file formats to allow for further downstream analysis and visualization with separate genomic epidemiology software. Testament to this solution meeting the requirements for bioinformatic implementation in public health, Theiagen workflows have collectively been used for over 5 million sample analyses in the last 2 years by over 90 public health laboratories in at least 40 different countries. Continued adoption of technological innovations and development of further workflows will ensure that this ecosystem continues to benefit PHLs.
Assuntos
Ecossistema , Saúde Pública , Software , Biologia Computacional/métodos , GenômicaRESUMO
The capacity for pathogen genomics in public health expanded rapidly during the coronavirus disease 2019 (COVID-19) pandemic, but many public health laboratories did not have the infrastructure in place to handle the vast amount of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequence data generated. The California Department of Public Health, in partnership with Theiagen Genomics, was an early adopter of cloud-based resources for bioinformatics and genomic epidemiology, resulting in the creation of a SARS-CoV-2 genomic surveillance system that combined the efforts of more than 40 sequencing laboratories across government, academia and industry to form California COVIDNet, California's SARS-CoV-2 Whole-Genome Sequencing Initiative. Open-source bioinformatics workflows, ongoing training sessions for the public health workforce, and automated data transfer to visualization tools all contributed to the success of California COVIDNet. While challenges remain for public health genomic surveillance worldwide, California COVIDNet serves as a framework for a scaled and successful bioinformatics infrastructure that has expanded beyond SARS-CoV-2 to other pathogens of public health importance.
Assuntos
COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/genética , COVID-19/epidemiologia , Saúde Pública , Laboratórios , Genômica , California/epidemiologiaRESUMO
The 2022 multi-country monkeypox (mpox) outbreak concurrent with the ongoing COVID-19 pandemic has further highlighted the need for genomic surveillance and rapid pathogen whole genome sequencing. While metagenomic sequencing approaches have been used to sequence many of the early mpox infections, these methods are resource intensive and require samples with high viral DNA concentrations. Given the atypical clinical presentation of cases associated with the outbreak and uncertainty regarding viral load across both the course of infection and anatomical body sites, there was an urgent need for a more sensitive and broadly applicable sequencing approach. Highly multiplexed amplicon-based sequencing (PrimalSeq) was initially developed for sequencing of Zika virus, and later adapted as the main sequencing approach for SARS-CoV-2. Here, we used PrimalScheme to develop a primer scheme for human monkeypox virus that can be used with many sequencing and bioinformatics pipelines implemented in public health laboratories during the COVID-19 pandemic. We sequenced clinical samples that tested presumptive positive for human monkeypox virus with amplicon-based and metagenomic sequencing approaches. We found notably higher genome coverage across the virus genome, with minimal amplicon drop-outs, in using the amplicon-based sequencing approach, particularly in higher PCR cycle threshold (lower DNA titer) samples. Further testing demonstrated that Ct value correlated with the number of sequencing reads and influenced the percent genome coverage. To maximize genome coverage when resources are limited, we recommend selecting samples with a PCR cycle threshold below 31 Ct and generating 1 million sequencing reads per sample. To support national and international public health genomic surveillance efforts, we sent out primer pool aliquots to 10 laboratories across the United States, United Kingdom, Brazil, and Portugal. These public health laboratories successfully implemented the human monkeypox virus primer scheme in various amplicon sequencing workflows and with different sample types across a range of Ct values. Thus, we show that amplicon based sequencing can provide a rapidly deployable, cost-effective, and flexible approach to pathogen whole genome sequencing in response to newly emerging pathogens. Importantly, through the implementation of our primer scheme into existing SARS-CoV-2 workflows and across a range of sample types and sequencing platforms, we further demonstrate the potential of this approach for rapid outbreak response.
RESUMO
In the United States, reports of Salmonella enterica carrying mcr-1 remain rare in humans, but when observed, the infection is often associated with travel. Here, we report 14 mcr-1-positive Salmonella enterica isolates from patients in the United States that reported travel to the Dominican Republic within the 12 months before illness.
RESUMO
Laboratories that run Whole Genome Sequencing (WGS) produce a tremendous amount of data, up to 10 gigabytes for some common instruments. There is a need to standardize the quality assurance and quality control process (QA/QC). Therefore we have created SneakerNet to automate the QA/QC for WGS.
RESUMO
We have developed and implemented an undergraduate microbiology course in which students isolate, characterize, and perform whole genome assembly and analysis of Salmonella enterica from stream sediments and poultry litter. In the development of the course and over three semesters, successive teams of undergraduate students collected field samples and performed enrichment and isolation techniques specific for the detection of S. enterica. Eighty-eight strains were confirmed using standard microbiological methods and PCR of the invA gene. The isolates' genomes were Illumina-sequenced by the Center for Food Safety and Applied Nutrition at the FDA and the Virginia state Division of Consolidated Laboratory Services as part of the GenomeTrakr program. Students used GalaxyTrakr and other web- and non-web-based platforms and tools to perform quality control on raw and assembled sequence data, assemble, and annotate genomes, identify antimicrobial resistance and virulence genes, putative plasmids, and other mobile genetic elements. Strains with putative plasmid-borne antimicrobial resistance genes were further sequenced by students in our research lab using the Oxford Nanopore MinIONTM platform. Strains of Salmonella that were isolated include human infectious serotypes such as Typhimurium and Infantis. Over 31 of the isolates possessed antibiotic resistance genes, some of which were located on large, multidrug resistance plasmids. Plasmid pHJ-38, identified in a Typhimurium isolate, is an apparently self-transmissible 183 kb IncA/C2 plasmid that possesses multiple antimicrobial resistance and heavy-metal resistance genes. Plasmid pFHS-02, identified in an Infantis isolate, is an apparently self-transmissible 303 kb IncF1B plasmid that also possesses numerous heavy-metal and antimicrobial resistance genes. Using direct and indirect measures to assess student outcomes, results indicate that course participation contributed to cognitive gains in relevant content knowledge and research skills such as field sampling, molecular techniques, and computational analysis. Furthermore, participants self-reported a deeper interest in scientific research and careers as well as psychosocial outcomes (e.g., sense of belonging and self-efficacy) commonly associated with student success and persistence in STEM. Overall, this course provided a powerful combination of field, wet lab, and computational biology experiences for students, while also providing data potentially useful in pathogen surveillance, epidemiological tracking, and for the further study of environmental reservoirs of S. enterica.
RESUMO
Salmonella enterica subsp. enterica serovar Corvallis is commonly reported in avian populations and avian by-products. We report the draft genome sequence of a multidrug-resistant S. Corvallis strain (NPHL 15376). To our knowledge, this is the first reported case of this serovar isolated from human blood in the United States.