Your browser doesn't support javascript.
loading
Using earth mover's distance for viral outbreak investigations.
Melnyk, Andrew; Knyazev, Sergey; Vannberg, Fredrik; Bunimovich, Leonid; Skums, Pavel; Zelikovsky, Alex.
Afiliação
  • Melnyk A; Computer Science Department, Georgia State University, 25 Park Place NE, Atlanta, GA, 30303, USA. andrew.s.melnyk@gmail.com.
  • Knyazev S; Computer Science Department, Georgia State University, 25 Park Place NE, Atlanta, GA, 30303, USA.
  • Vannberg F; Georgia Institute of Technology, North Ave NW, Atlanta, GA, 30332, USA.
  • Bunimovich L; Georgia Institute of Technology, North Ave NW, Atlanta, GA, 30332, USA.
  • Skums P; Computer Science Department, Georgia State University, 25 Park Place NE, Atlanta, GA, 30303, USA.
  • Zelikovsky A; Computer Science Department, Georgia State University, 25 Park Place NE, Atlanta, GA, 30303, USA.
BMC Genomics ; 21(Suppl 5): 582, 2020 Dec 16.
Article em En | MEDLINE | ID: mdl-33327932
BACKGROUND: RNA viruses mutate at extremely high rates, forming an intra-host viral population of closely related variants, which allows them to evade the host's immune system and makes them particularly dangerous. Viral outbreaks pose a significant threat for public health, and, in order to deal with it, it is critical to infer transmission clusters, i.e., decide whether two viral samples belong to the same outbreak. Next-generation sequencing (NGS) can significantly help in tackling outbreak-related problems. While NGS data is first obtained as short reads, existing methods rely on assembled sequences. This requires reconstruction of the entire viral population, which is complicated, error-prone and time-consuming. RESULTS: The experimental validation using sequencing data from HCV outbreaks shows that the proposed algorithm can successfully identify genetic relatedness between viral populations, infer transmission direction, transmission clusters and outbreak sources, as well as decide whether the source is present in the sequenced outbreak sample and identify it. CONCLUSIONS: Introduced algorithm allows to cluster genetically related samples, infer transmission directions and predict sources of outbreaks. Validation on experimental data demonstrated that algorithm is able to reconstruct various transmission characteristics. Advantage of the method is the ability to bypass cumbersome read assembly, thus eliminating the chance to introduce new errors, and saving processing time by allowing to use raw NGS reads.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Vírus de RNA / Hepacivirus Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2020 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Vírus de RNA / Hepacivirus Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2020 Tipo de documento: Article