Your browser doesn't support javascript.
loading
Machine learning-based approach KEVOLVE efficiently identifies SARS-CoV-2 variant-specific genomic signatures.
Lebatteux, Dylan; Soudeyns, Hugo; Boucoiran, Isabelle; Gantt, Soren; Diallo, Abdoulaye Baniré.
Afiliación
  • Lebatteux D; Department of Computer Science, Université du Québec à Montréal, Montréal, Québec, Canada.
  • Soudeyns H; CHU Sainte-Justine Research Centre, Montréal, Québec, Canada.
  • Boucoiran I; Department of Microbiology, Infectious Diseases and Immunology, Faculty of Medicine, Université de Montréal, Montréal, Québec, Canada.
  • Gantt S; Department of Pediatrics, Faculty of Medicine, Université du Québec à Montréal, Montréal, Québec, Canada.
  • Diallo AB; Department of Obstetrics and Gynecology, Faculty of Medicine, Université de Montréal, Montreal, Quebec, Canada.
PLoS One ; 19(1): e0296627, 2024.
Article en En | MEDLINE | ID: mdl-38241279
ABSTRACT
Machine learning was shown to be effective at identifying distinctive genomic signatures among viral sequences. These signatures are defined as pervasive motifs in the viral genome that allow discrimination between species or variants. In the context of SARS-CoV-2, the identification of these signatures can assist in taxonomic and phylogenetic studies, improve in the recognition and definition of emerging variants, and aid in the characterization of functional properties of polymorphic gene products. In this paper, we assess KEVOLVE, an approach based on a genetic algorithm with a machine-learning kernel, to identify multiple genomic signatures based on minimal sets of k-mers. In a comparative study, in which we analyzed large SARS-CoV-2 genome dataset, KEVOLVE was more effective at identifying variant-discriminative signatures than several gold-standard statistical tools. Subsequently, these signatures were characterized using a new extension of KEVOLVE (KANALYZER) to highlight variations of the discriminative signatures among different classes of variants, their genomic location, and the mutations involved. The majority of identified signatures were associated with known mutations among the different variants, in terms of functional and pathological impact based on available literature. Here we showed that KEVOLVE is a robust machine learning approach to identify discriminative signatures among SARS-CoV-2 variants, which are frequently also biologically relevant, while bypassing multiple sequence alignments. The source code of the method and additional resources are available at https//github.com/bioinfoUQAM/KEVOLVE.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: SARS-CoV-2 / COVID-19 Tipo de estudio: Diagnostic_studies / Prognostic_studies Límite: Humans Idioma: En Revista: PLoS One Asunto de la revista: CIENCIA / MEDICINA Año: 2024 Tipo del documento: Article País de afiliación: Canadá

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: SARS-CoV-2 / COVID-19 Tipo de estudio: Diagnostic_studies / Prognostic_studies Límite: Humans Idioma: En Revista: PLoS One Asunto de la revista: CIENCIA / MEDICINA Año: 2024 Tipo del documento: Article País de afiliación: Canadá
...