Your browser doesn't support javascript.
loading
Forecasting tuberculosis using diabetes-related google trends data.
Frauenfeld, Leonie; Nann, Dominik; Sulyok, Zita; Feng, You-Shan; Sulyok, Mihály.
Afiliación
  • Frauenfeld L; Institute for Pathology and Neuropathology, Eberhard Karls University, University Hospital of Tübingen , Tübingen 72076, Germany.
  • Nann D; Institute for Pathology and Neuropathology, Eberhard Karls University, University Hospital of Tübingen , Tübingen 72076, Germany.
  • Sulyok Z; Institute of Tropical Medicine, Eberhard Karls University, University Hospital of Tübingen , Tübingen 72074, Germany.
  • Feng YS; Department of Clinical Epidemiology and Applied Biometry, University Hospital of Tübingen , Tübingen 72076, Germany.
  • Sulyok M; Institute for Pathology and Neuropathology, Eberhard Karls University, University Hospital of Tübingen , Tübingen 72076, Germany.
Pathog Glob Health ; 114(5): 236-241, 2020 07.
Article en En | MEDLINE | ID: mdl-32453658
ABSTRACT
Online activity-based data can be used to aid infectious disease forecasting. Our aim was to exploit the converging nature of the tuberculosis (TB) and diabetes epidemics to forecast TB case numbers. Thus, we extended TB prediction models based on traditional data with diabetes-related Google searches. We obtained data on the weekly case numbers of TB in Germany from June 8th, 2014, to May 5th, 2019. Internet search data were obtained from a Google Trends (GTD) search for 'diabetes' to the corresponding interval. A seasonal autoregressive moving average (SARIMA) model (0,1,1) (1,0,0) [52] was selected to describe the weekly TB case numbers with and without GTD as an external regressor. We cross-validated the SARIMA models to obtain the root mean squared errors (RMSE). We repeated this procedure with autoregressive feed-forward neural network (NNAR) models using 5-fold cross-validation. To simulate a data-poor surveillance setting, we also tested traditional and GTD-extended models against a hold-out dataset using a decreased 52-week-long period with missing values for training. Cross-validation resulted in an RMSE of 20.83 for the traditional model and 18.56 for the GTD-extended model. Cross-validation of the NNAR models showed a mean RMSE of 19.49 for the traditional model and 18.99 for the GTD-extended model. When we tested the models trained on a decreased dataset with missing values, the GTD-extended models achieved significantly better prediction than the traditional models (p < 0.001). The GTD-extended models outperformed the traditional models in all assessed model evaluation parameters. Using online activity-based data regarding diabetes can improve TB forecasting, but further validation is warranted.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Tuberculosis / Redes Neurales de la Computación / Diabetes Mellitus / Epidemias Tipo de estudio: Evaluation_studies / Prognostic_studies / Screening_studies Límite: Humans País/Región como asunto: Europa Idioma: En Revista: Pathog Glob Health Año: 2020 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Tuberculosis / Redes Neurales de la Computación / Diabetes Mellitus / Epidemias Tipo de estudio: Evaluation_studies / Prognostic_studies / Screening_studies Límite: Humans País/Región como asunto: Europa Idioma: En Revista: Pathog Glob Health Año: 2020 Tipo del documento: Article País de afiliación: Alemania