RESUMO
BACKGROUND: Implementation of clinical metagenomics and pathogen genomic surveillance can be particularly challenging due to the lack of bioinformatics tools and/or expertise. In order to face this challenge, we have previously developed INSaFLU, a free web-based bioinformatics platform for virus next-generation sequencing data analysis. Here, we considerably expanded its genomic surveillance component and developed a new module (TELEVIR) for metagenomic virus identification. RESULTS: The routine genomic surveillance component was strengthened with new workflows and functionalities, including (i) a reference-based genome assembly pipeline for Oxford Nanopore technologies (ONT) data; (ii) automated SARS-CoV-2 lineage classification; (iii) Nextclade analysis; (iv) Nextstrain phylogeographic and temporal analysis (SARS-CoV-2, human and avian influenza, monkeypox, respiratory syncytial virus (RSV A/B), as well as a "generic" build for other viruses); and (v) algn2pheno for screening mutations of interest. Both INSaFLU pipelines for reference-based consensus generation (Illumina and ONT) were benchmarked against commonly used command line bioinformatics workflows for SARS-CoV-2, and an INSaFLU snakemake version was released. In parallel, a new module (TELEVIR) for virus detection was developed, after extensive benchmarking of state-of-the-art metagenomics software and following up-to-date recommendations and practices in the field. TELEVIR allows running complex workflows, covering several combinations of steps (e.g., with/without viral enrichment or host depletion), classification software (e.g., Kaiju, Kraken2, Centrifuge, FastViromeExplorer), and databases (RefSeq viral genome, Virosaurus, etc.), while culminating in user- and diagnosis-oriented reports. Finally, to potentiate real-time virus detection during ONT runs, we developed findONTime, a tool aimed at reducing costs and the time between sample reception and diagnosis. CONCLUSIONS: The accessibility, versatility, and functionality of INSaFLU-TELEVIR are expected to supply public and animal health laboratories and researchers with a user-oriented and pan-viral bioinformatics framework that promotes a strengthened and timely viral metagenomic detection and routine genomics surveillance. INSaFLU-TELEVIR is compatible with Illumina, Ion Torrent, and ONT data and is freely available at https://insaflu.insa.pt/ (online tool) and https://github.com/INSaFLU (code).
Assuntos
COVID-19 , Biologia Computacional , Genoma Viral , Metagenômica , SARS-CoV-2 , Software , Metagenômica/métodos , Humanos , SARS-CoV-2/genética , SARS-CoV-2/classificação , COVID-19/virologia , Biologia Computacional/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Internet , Genômica/métodosRESUMO
Pathogen genome sequencing during epidemics enhances our ability to identify and understand suspected clusters and investigate their relationships. Here, we combine genomic and epidemiological data of the 2022 mpox outbreak to better understand early viral spread, diversification and transmission dynamics. By sequencing 52% of the confirmed cases in Portugal, we identified the mpox virus sublineages with the highest impact on case numbers and fitted them into a global context, finding evidence that several international sublineages probably emerged or spread early in Portugal. We estimated a 62% infection reporting rate and that 1.3% of the population of men who have sex with men in Portugal were infected. We infer the critical role played by sexual networks and superspreader gatherings, such as sauna attendance, in the dissemination of mpox virus. Overall, our findings highlight genomic epidemiology as a tool for the real-time monitoring and control of mpox epidemics, and can guide future vaccine policy in a highly susceptible population.
Assuntos
Mpox , Minorias Sexuais e de Gênero , Masculino , Humanos , Portugal/epidemiologia , Homossexualidade Masculina , Surtos de Doenças , Análise por ConglomeradosRESUMO
The largest monkeypox virus (MPXV) outbreak described so far in non-endemic countries was identified in May 2022 (refs. 1-6). In this study, shotgun metagenomics allowed the rapid reconstruction and phylogenomic characterization of the first MPXV outbreak genome sequences, showing that this MPXV belongs to clade 3 and that the outbreak most likely has a single origin. Although 2022 MPXV (lineage B.1) clustered with 2018-2019 cases linked to an endemic country, it segregates in a divergent phylogenetic branch, likely reflecting continuous accelerated evolution. An in-depth mutational analysis suggests the action of host APOBEC3 in viral evolution as well as signs of potential MPXV human adaptation in ongoing microevolution. Our findings also indicate that genome sequencing may provide resolution to track the spread and transmission of this presumably slow-evolving double-stranded DNA virus.