Your browser doesn't support javascript.
loading
Predicting hosts and cross-species transmission of Streptococcus agalactiae by interpretable machine learning.
Ren, Yunxiao; Li, Carmen; Nanayakkara Sapugahawatte, Dulmini; Zhu, Chendi; Spänig, Sebastian; Jamrozy, Dorota; Rothen, Julian; Daubenberger, Claudia A; Bentley, Stephen D; Ip, Margaret; Heider, Dominik.
Afiliación
  • Ren Y; Department for Data Science in Biomedicine, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, Marburg, Germany.
  • Li C; Department of Microbiology, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China.
  • Nanayakkara Sapugahawatte D; Department of Microbiology, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China.
  • Zhu C; Department of Microbiology, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China.
  • Spänig S; Department for Data Science in Biomedicine, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, Marburg, Germany.
  • Jamrozy D; Parasites and Microbes Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom.
  • Rothen J; Swiss Tropical and Public Health Institute (Swiss TPH) Basel, Department of Medical Parasitology and Infection Biology, 4002, Basel, Switzerland; University of Basel, 4002, Basel, Switzerland.
  • Daubenberger CA; Swiss Tropical and Public Health Institute (Swiss TPH) Basel, Department of Medical Parasitology and Infection Biology, 4002, Basel, Switzerland; University of Basel, 4002, Basel, Switzerland.
  • Bentley SD; Parasites and Microbes Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom.
  • Ip M; Department of Microbiology, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China.
  • Heider D; Department for Data Science in Biomedicine, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, Marburg, Germany; Institute for Computer Science, University of Düsseldorf, 40211, Düsseldorf, Germany; Center for Digital Health, Heinrich Heine University Düsseldorf, Moorenstr.
Comput Biol Med ; 171: 108185, 2024 Mar.
Article en En | MEDLINE | ID: mdl-38401454
ABSTRACT

BACKGROUND:

Streptococcus agalactiae, commonly known as Group B Streptococcus (GBS), exhibits a broad host range, manifesting as both a beneficial commensal and an opportunistic pathogen across various species. In humans, it poses significant risks, causing neonatal sepsis and meningitis, along with severe infections in adults. Additionally, it impacts livestock by inducing mastitis in bovines and contributing to epidemic mortality in fish populations. Despite its wide host spectrum, the mechanisms enabling GBS to adapt to specific hosts remain inadequately elucidated. Therefore, the development of a rapid and accurate method differentiates GBS strains associated with particular animal hosts based on genome-wide information holds immense potential. Such a tool would not only bolster the identification and containment efforts during GBS outbreaks but also deepen our comprehension of the bacteria's host adaptations spanning humans, livestock, and other natural animal reservoirs. METHODS AND

RESULTS:

Here, we developed three machine learning models-random forest (RF), logistic regression (LR), and support vector machine (SVM) based on genome-wide mutation data. These models enabled precise prediction of the host origin of GBS, accurately distinguishing between human, bovine, fish, and pig hosts. Moreover, we conducted an interpretable machine learning using SHapley Additive exPlanations (SHAP) and variant annotation to uncover the most influential genomic features and associated genes for each host. Additionally, by meticulously examining misclassified samples, we gained valuable insights into the dynamics of host transmission and the potential for zoonotic infections.

CONCLUSIONS:

Our study underscores the effectiveness of random forest (RF) and logistic regression (LR) models based on mutation data for accurately predicting GBS host origins. Additionally, we identify the key features associated with each GBS host, thereby enhancing our understanding of the bacteria's host-specific adaptations.
Asunto(s)
Palabras clave

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Infecciones Estreptocócicas / Streptococcus agalactiae Idioma: En Revista: Comput Biol Med Año: 2024 Tipo del documento: Article

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Infecciones Estreptocócicas / Streptococcus agalactiae Idioma: En Revista: Comput Biol Med Año: 2024 Tipo del documento: Article