Your browser doesn't support javascript.
loading
Machine Learning Methods as a Tool for Predicting Risk of Illness Applying Next-Generation Sequencing Data.
Njage, Patrick Murigu Kamau; Henri, Clementine; Leekitcharoenphon, Pimlapas; Mistou, Michel-Yves; Hendriksen, Rene S; Hald, Tine.
Afiliación
  • Njage PMK; Division for Epidemiology and Microbial Genomics, National Food Institute, Technical University of Denmark, Kongens Lyngby, Denmark.
  • Henri C; Université PARIS-EST, Agence Nationale de Sécurité Sanitaire de L'Alimentation, de L'Environnement et du Travail (ANSES), Laboratory for Food Safety, Maisons-Alfort, France.
  • Leekitcharoenphon P; Division for Epidemiology and Microbial Genomics, National Food Institute, Technical University of Denmark, Kongens Lyngby, Denmark.
  • Mistou MY; Université PARIS-EST, Agence Nationale de Sécurité Sanitaire de L'Alimentation, de L'Environnement et du Travail (ANSES), Laboratory for Food Safety, Maisons-Alfort, France.
  • Hendriksen RS; Division for Epidemiology and Microbial Genomics, National Food Institute, Technical University of Denmark, Kongens Lyngby, Denmark.
  • Hald T; Division for Epidemiology and Microbial Genomics, National Food Institute, Technical University of Denmark, Kongens Lyngby, Denmark.
Risk Anal ; 39(6): 1397-1413, 2019 06.
Article en En | MEDLINE | ID: mdl-30462833
ABSTRACT
Next-generation sequencing (NGS) data present an untapped potential to improve microbial risk assessment (MRA) through increased specificity and redefinition of the hazard. Most of the MRA models do not account for differences in survivability and virulence among strains. The potential of machine learning algorithms for predicting the risk/health burden at the population level while inputting large and complex NGS data was explored with Listeria monocytogenes as a case study. Listeria data consisted of a percentage similarity matrix from genome assemblies of 38 and 207 strains of clinical and food origin, respectively. Basic Local Alignment (BLAST) was used to align the assemblies against a database of 136 virulence and stress resistance genes. The outcome variable was frequency of illness, which is the percentage of reported cases associated with each strain. These frequency data were discretized into seven ordinal outcome categories and used for supervised machine learning and model selection from five ensemble algorithms. There was no significant difference in accuracy between the models, and support vector machine with linear kernel was chosen for further inference (accuracy of 89% [95% CI 68%, 97%]). The virulence genes FAM002725, FAM002728, FAM002729, InlF, InlJ, Inlk, IisY, IisD, IisX, IisH, IisB, lmo2026, and FAM003296 were important predictors of higher frequency of illness. InlF was uniquely truncated in the sequence type 121 strains. Most important risk predictor genes occurred at highest prevalence among strains from ready-to-eat, dairy, and composite foods. We foresee that the findings and approaches described offer the potential for rethinking the current approaches in MRA.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Contexto en salud: 3_ND Problema de salud: 3_neglected_diseases / 3_zoonosis Asunto principal: Medición de Riesgo / Secuenciación de Nucleótidos de Alto Rendimiento / Aprendizaje Automático / Listeriosis / Listeria monocytogenes Tipo de estudio: Diagnostic_studies / Etiology_studies / Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: Risk Anal Año: 2019 Tipo del documento: Article País de afiliación: Dinamarca

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Contexto en salud: 3_ND Problema de salud: 3_neglected_diseases / 3_zoonosis Asunto principal: Medición de Riesgo / Secuenciación de Nucleótidos de Alto Rendimiento / Aprendizaje Automático / Listeriosis / Listeria monocytogenes Tipo de estudio: Diagnostic_studies / Etiology_studies / Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: Risk Anal Año: 2019 Tipo del documento: Article País de afiliación: Dinamarca
...